Towards Reverse Causal Inference on Panel Data: Precise Formulation and Challenges
Abstract
Seeking causal explanations in panel (or longitudinal/multivariate time-series) data is a difficult problem of both academic and industrial importance. Although there exists a large amount of literature on forward causal inference, where the treatment/outcome/covariates variables are well-defined, it is unclear how to answer the reverse question: which covariates have effects on the outcome? In this paper, we set forth our expedition on this reverse question from the first principles. We formulate the precise problem definition in terms of causal patterns and causal paths, and propose a linear-time greedy meta algorithm that makes use of forward causal inference estimators. We further identify a set of optimality conditions under which the proposed algorithm is able to find the optimal causal path. To substantiate our greedy algorithm, we propose a generalized version of the synthetic control estimator by fitting both synthetic treatments and controls by conditioning on the partial causal paths. Promising results on on synthetic datasets demonstrate the potential of our method.
Cite
Text
Zhang et al. "Towards Reverse Causal Inference on Panel Data: Precise Formulation and Challenges." NeurIPS 2022 Workshops: CDS, 2022.Markdown
[Zhang et al. "Towards Reverse Causal Inference on Panel Data: Precise Formulation and Challenges." NeurIPS 2022 Workshops: CDS, 2022.](https://mlanthology.org/neuripsw/2022/zhang2022neuripsw-reverse/)BibTeX
@inproceedings{zhang2022neuripsw-reverse,
title = {{Towards Reverse Causal Inference on Panel Data: Precise Formulation and Challenges}},
author = {Zhang, Jiayao and Park, Youngsuk and Maddix, Danielle C. and Roth, Dan and Wang, Bernie},
booktitle = {NeurIPS 2022 Workshops: CDS},
year = {2022},
url = {https://mlanthology.org/neuripsw/2022/zhang2022neuripsw-reverse/}
}