Adjustment for Confounding Using Pre-Trained Representations

Abstract

There is growing interest in extending average treatment effect (ATE) estimation to incorporate non-tabular data, such as images and text, which may act as sources of confounding. Neglecting these effects risks biased results and flawed scientific conclusions. However, incorporating non-tabular data necessitates sophisticated feature extractors, often in combination with ideas of transfer learning. In this work, we investigate how latent features from pre-trained neural networks can be leveraged to adjust for sources of confounding. We formalize conditions under which these latent features enable valid adjustment and statistical inference in ATE estimation, demonstrating results along the example of double machine learning. In this context, we also discuss critical challenges inherent to latent feature learning and downstream parameter estimation using those. As our results are agnostic to the considered data modality, they represent an important first step towards a theoretical foundation for the usage of latent representation from foundation models in ATE estimation.

Cite

Text

Schulte et al. "Adjustment for Confounding Using Pre-Trained Representations." ICLR 2025 Workshops: FM-Wild, 2025.

Markdown

[Schulte et al. "Adjustment for Confounding Using Pre-Trained Representations." ICLR 2025 Workshops: FM-Wild, 2025.](https://mlanthology.org/iclrw/2025/schulte2025iclrw-adjustment/)

BibTeX

@inproceedings{schulte2025iclrw-adjustment,
  title     = {{Adjustment for Confounding Using Pre-Trained Representations}},
  author    = {Schulte, Rickmer and Rügamer, David and Nagler, Thomas},
  booktitle = {ICLR 2025 Workshops: FM-Wild},
  year      = {2025},
  url       = {https://mlanthology.org/iclrw/2025/schulte2025iclrw-adjustment/}
}