Modeling Causal Mechanisms with Diffusion Models for Interventional and Counterfactual Queries
Abstract
We consider the problem of answering observational, interventional, and counterfactual queries in a causally sufficient setting where only observational data and the causal graph are available. Utilizing the recent developments in diffusion models, we introduce diffusion-based causal models (DCM) to learn causal mechanisms, that generate unique latent encodings. These encodings enable us to directly sample under interventions and perform abduction for counterfactuals. Diffusion models are a natural fit here, since they can encode each node to a latent representation that acts as a proxy for exogenous noise. Our empirical evaluations demonstrate significant improvements over existing state-of-the-art methods for answering causal queries. Furthermore, we provide theoretical results that offer a methodology for analyzing counterfactual estimation in general encoder-decoder models, which could be useful in settings beyond our proposed approach.
Cite
Text
Chao et al. "Modeling Causal Mechanisms with Diffusion Models for Interventional and Counterfactual Queries." Transactions on Machine Learning Research, 2024.Markdown
[Chao et al. "Modeling Causal Mechanisms with Diffusion Models for Interventional and Counterfactual Queries." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/chao2024tmlr-modeling/)BibTeX
@article{chao2024tmlr-modeling,
title = {{Modeling Causal Mechanisms with Diffusion Models for Interventional and Counterfactual Queries}},
author = {Chao, Patrick and Blöbaum, Patrick and Patel, Sapan Kirit and Kasiviswanathan, Shiva},
journal = {Transactions on Machine Learning Research},
year = {2024},
url = {https://mlanthology.org/tmlr/2024/chao2024tmlr-modeling/}
}