Missing Data as a Causal and Probabilistic Problem

Abstract

Causal inference is often phrased as a missing data problem -- for every unit, only the response to observed treatment assignment is known, the response to other treatment assignments is not. In this paper, we extend the converse approach of (Mohan et al, 2013) of representing missing data problems to restricted causal models (where only interventions on missingness indicators are allowed). We further use this representation to leverage techniques developed for the problem of identification of causal effects to give a general criterion for cases where a joint distribution containing missing variables can be recovered from data actually observed, given assumptions on missingness mechanisms. This criterion is significantly more general than the commonly used ``missing at random'' (MAR) criterion, and generalizes past work which also exploits a graphical representation of missingness. In fact, the relationship of our criterion to MAR is not unlike the relationship between the ID algorithm for identification of causal effects (Tian and Pearl, 2002), (Shpitser and Pearl 2006), and conditional ignorability (Rosenbaum and Rubin, 1983).

Cite

Text

Shpitser et al. "Missing Data as a Causal and Probabilistic Problem." Conference on Uncertainty in Artificial Intelligence, 2015.

Markdown

[Shpitser et al. "Missing Data as a Causal and Probabilistic Problem." Conference on Uncertainty in Artificial Intelligence, 2015.](https://mlanthology.org/uai/2015/shpitser2015uai-missing/)

BibTeX

@inproceedings{shpitser2015uai-missing,
  title     = {{Missing Data as a Causal and Probabilistic Problem}},
  author    = {Shpitser, Ilya and Mohan, Karthika and Pearl, Judea},
  booktitle = {Conference on Uncertainty in Artificial Intelligence},
  year      = {2015},
  pages     = {802-811},
  url       = {https://mlanthology.org/uai/2015/shpitser2015uai-missing/}
}