Scalable Causal Discovery with Score Matching

Abstract

This paper demonstrates how to discover the whole causal graph from the second derivative of the log-likelihood in observational non-linear additive Gaussian noise models. Leveraging scalable machine learning approaches to approximate the score function $\nabla \operatorname{log}p(\mathbf{X})$, we extend the work of Rolland et al., 2022, that only recovers the topological order from the score and requires an expensive pruning step to discover the edges. Our analysis leads to DAS, a practical algorithm that reduces the complexity of the pruning by a factor proportional to the graph size. In practice, DAS achieves competitive accuracy with current state-of-the-art while being over an order of magnitude faster. Overall, our approach enables principled and scalable causal discovery, significantly lowering the compute bar.

Cite

Text

Montagna et al. "Scalable Causal Discovery with Score Matching." NeurIPS 2022 Workshops: SBM, 2022.

Markdown

[Montagna et al. "Scalable Causal Discovery with Score Matching." NeurIPS 2022 Workshops: SBM, 2022.](https://mlanthology.org/neuripsw/2022/montagna2022neuripsw-scalable/)

BibTeX

@inproceedings{montagna2022neuripsw-scalable,
  title     = {{Scalable Causal Discovery with Score Matching}},
  author    = {Montagna, Francesco and Noceti, Nicoletta and Rosasco, Lorenzo and Zhang, Kun and Locatello, Francesco},
  booktitle = {NeurIPS 2022 Workshops: SBM},
  year      = {2022},
  url       = {https://mlanthology.org/neuripsw/2022/montagna2022neuripsw-scalable/}
}