Exponential Family Variational Flow Matching for Tabular Data Generation
Abstract
While denoising diffusion and flow matching have driven major advances in generative modeling, their application to tabular data remains limited, despite its ubiquity in real-world applications. To this end, we develop TabbyFlow, a variational Flow Matching (VFM) method for tabular data generation. To apply VFM to data with mixed continuous and discrete features, we introduce Exponential Family Variational Flow Matching (EF-VFM), which represents heterogeneous data types using a general exponential family distribution. We hereby obtain an efficient, data-driven objective based on moment matching, enabling principled learning of probability paths over mixed continuous and discrete variables. We also establish a connection between variational flow matching and generalized flow matching objectives based on Bregman divergences. Evaluation on tabular data benchmarks demonstrates state-of-the-art performance compared to baselines.
Cite
Text
Guzmán-Cordero et al. "Exponential Family Variational Flow Matching for Tabular Data Generation." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Guzmán-Cordero et al. "Exponential Family Variational Flow Matching for Tabular Data Generation." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/guzmancordero2025icml-exponential/)BibTeX
@inproceedings{guzmancordero2025icml-exponential,
title = {{Exponential Family Variational Flow Matching for Tabular Data Generation}},
author = {Guzmán-Cordero, Andrés and Eijkelboom, Floor and Van De Meent, Jan-Willem},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {21516-21529},
volume = {267},
url = {https://mlanthology.org/icml/2025/guzmancordero2025icml-exponential/}
}