Causal-TGAN: Modeling Tabular Data Using Causally-Aware GAN
Abstract
Generative adversarial net (GAN)-based tabular data generation has recently received significant attention for its power for data augmentation when available data is limited. Most prior works have applied generic GAN frameworks for tabular data generation without explicitly considering inter-variable relationships, which is important for modeling tabular data distribution. In this work, we design Causal-TGAN, a causally-aware generator architecture that can capture the relationships among variables (continuous-type, discrete-type, and mixed-type) by explicitly modeling the pre-defined inter-variable causal relationships. The flexibility of Causal-TGAN is its capability to support different degrees of subject matter expert domain knowledge (e.g., complete or partial) about the inter-variable causal relations. Extensive experimental results on both simulated and real-world datasets demonstrate that exploiting causal relations in deep generative models could improve the generated tabular data quality compared to the state-of-the-art. Code is available at \url{https://github.com/BiggyBing/Causal-TGAN-Public}.
Cite
Text
Wen et al. "Causal-TGAN: Modeling Tabular Data Using Causally-Aware GAN." ICLR 2022 Workshops: DGM4HSD, 2022.Markdown
[Wen et al. "Causal-TGAN: Modeling Tabular Data Using Causally-Aware GAN." ICLR 2022 Workshops: DGM4HSD, 2022.](https://mlanthology.org/iclrw/2022/wen2022iclrw-causaltgan/)BibTeX
@inproceedings{wen2022iclrw-causaltgan,
title = {{Causal-TGAN: Modeling Tabular Data Using Causally-Aware GAN}},
author = {Wen, Bingyang and Cao, Yupeng and Yang, Fan and Subbalakshmi, Koduvayur and Chandramouli, Rajarathnam},
booktitle = {ICLR 2022 Workshops: DGM4HSD},
year = {2022},
url = {https://mlanthology.org/iclrw/2022/wen2022iclrw-causaltgan/}
}