Sequential Causal Discovery with Noisy Language Model Priors

Abstract

Causal discovery from observational data typically assumes access to complete data and availability of perfect domain experts. In practice, data often arrive in batches, are subject to sampling bias, and expert knowledge is scarce. Language Models (LMs) offer a surrogate for expert knowledge but suffer from hallucinations, inconsistencies, and bias. We present a hybrid framework that bridges these gaps by adaptively integrating sequential batch data with LM-derived noisy, expert knowledge while accounting for both data-induced and LM-induced biases. We propose a representation shift from Directed Acyclic Graph (DAG) to Partial Ancestral Graph (PAG), that accommodates ambiguities within a coherent framework, allowing grounding the global LM knowledge in local observational data. To guide LM interactions, we use a sequential optimization scheme that adaptively queries the most informative edges. Across varied datasets and LMs, we outperform prior work in structural accuracy and extend to parameter estimation, showing robustness to LM noise.

Cite

Text

Verma et al. "Sequential Causal Discovery with Noisy Language Model Priors." Transactions on Machine Learning Research, 2026.

Markdown

[Verma et al. "Sequential Causal Discovery with Noisy Language Model Priors." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/verma2026tmlr-sequential/)

BibTeX

@article{verma2026tmlr-sequential,
  title     = {{Sequential Causal Discovery with Noisy Language Model Priors}},
  author    = {Verma, Prakhar and Arbour, David and Choudhary, Sunav and Chopra, Harshita and Solin, Arno and Sinha, Atanu R.},
  journal   = {Transactions on Machine Learning Research},
  year      = {2026},
  url       = {https://mlanthology.org/tmlr/2026/verma2026tmlr-sequential/}
}