Causal Discovery on Dependent Binary Data

Abstract

The assumption of independence between observations (units) in a dataset is prevalent across various methodologies for learning causal graphical models. However, this assumption often finds itself in conflict with real-world data, posing challenges to accurate structure learning. We propose a decorrelation-based approach for causal graph learning on dependent binary data, where the local conditional distribution is defined by a latent utility model with dependent errors across units. We develop a pairwise maximum likelihood method to estimate the covariance matrix for the dependence among the units. Then, leveraging the estimated covariance matrix, we develop an EM-like iterative algorithm to generate and de-correlate samples of the latent utility variables, which serve as de-correlated data. Any standard causal discovery method can be applied on the de-correlated data to learn the underlying causal graph. We demonstrate that the proposed de-correlation approach significantly improves the accuracy in causal graph learning, through numerical experiments on both synthetic and real-world datasets.

Cite

Text

Chen and Zhou. "Causal Discovery on Dependent Binary Data." Proceedings of The 28th International Conference on Artificial Intelligence and Statistics, 2025.

Markdown

[Chen and Zhou. "Causal Discovery on Dependent Binary Data." Proceedings of The 28th International Conference on Artificial Intelligence and Statistics, 2025.](https://mlanthology.org/aistats/2025/chen2025aistats-causal/)

BibTeX

@inproceedings{chen2025aistats-causal,
  title     = {{Causal Discovery on Dependent Binary Data}},
  author    = {Chen, Alex and Zhou, Qing},
  booktitle = {Proceedings of The 28th International Conference on Artificial Intelligence and Statistics},
  year      = {2025},
  pages     = {2773-2781},
  volume    = {258},
  url       = {https://mlanthology.org/aistats/2025/chen2025aistats-causal/}
}