Causal Discovery over Clusters of Variables in Markovian Systems
Abstract
Causal discovery methods are powerful tools for uncovering the structure of relationships among variables, yet they face significant challenges in scalability and interpretability, especially in high-dimensional settings. In many domains, researchers are not only interested in causal links between individual variables, but also in relationships among sets or clusters of variables. Learning causal structure at the cluster level can both reveal higher-order relationships of interest and improve scalability. In this work, we introduce an approach for causal discovery over clusters in Markov causal systems. We propose a new graphical model that encodes knowledge of relationships between user-defined clusters while fully representing independencies and dependencies over clusters, faithful to a given distribution. We then define and characterize a graphical equivalence class of these models that share cluster-level independence information. Lastly, we present a sound and complete algorithm for causal discovery to represent learnable causal relationships between clusters of variables.
Cite
Text
Anand et al. "Causal Discovery over Clusters of Variables in Markovian Systems." Advances in Neural Information Processing Systems, 2025.Markdown
[Anand et al. "Causal Discovery over Clusters of Variables in Markovian Systems." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/anand2025neurips-causal/)BibTeX
@inproceedings{anand2025neurips-causal,
title = {{Causal Discovery over Clusters of Variables in Markovian Systems}},
author = {Anand, Tara Vafai and Ribeiro, Adèle H. and Tian, Jin and Hripcsak, George and Bareinboim, Elias},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/anand2025neurips-causal/}
}