Transformers with Endogenous In-Context Learning: Bias Characterization and Mitigation

Wang, Haotian; Zou, Hao; Li, Haoxuan; Chi, Haoang; Shi, Yang; Zhang, Yuanxing; Yang, Wenjing; Liu, Xinwang; Lin, Zhouchen

Transformers with Endogenous In-Context Learning: Bias Characterization and Mitigation

Haotian Wang, Hao Zou, Haoxuan Li, Haoang Chi, Yang Shi, Yuanxing Zhang, Wenjing Yang, Xinwang Liu, Zhouchen Lin

ICLR 2026

/iclr/2026/wang2026iclr-transformers/

Abstract

In-context learning (ICL) enables pre-trained transformers (TFs) to perform few-shot learning across diverse tasks, fostering growing research into its underlying mechanisms. However, existing studies typically assume a causally-sufficient regime, overlooking spurious correlations and prediction bias introduced by hidden confounders (HCs). As HC commonly exists in real-world cases, current ICL understandings may not align with actual data structures. To fill this gap, we contribute the pioneer theoretical analysis towards a novel problem setup termed as ICL-HC, which offers understanding the effect of HC on the pre-training of TFs and the following ICL prediction. Our theoretical results entail that pre-trained TFs exhibits certain prediction bias with proportional to the confounding strength. To migrate such prediction bias, we further propose a gradient-free debiasing method named Double-Debiasing (DDbias) by collecting and prompting with extremely few unconfounded examples, correcting pre-trained TFs with unbiased ICL predictions. Extensive experiments on regression tasks across diverse designs of the TF architectures and data generation protocols verify both our theoretical results and the effectiveness of the proposed DDbias method.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Wang et al. "Transformers with Endogenous In-Context Learning: Bias Characterization and Mitigation." International Conference on Learning Representations, 2026.

Markdown

[Wang et al. "Transformers with Endogenous In-Context Learning: Bias Characterization and Mitigation." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/wang2026iclr-transformers/)

BibTeX

@inproceedings{wang2026iclr-transformers,
  title     = {{Transformers with Endogenous In-Context Learning: Bias Characterization and Mitigation}},
  author    = {Wang, Haotian and Zou, Hao and Li, Haoxuan and Chi, Haoang and Shi, Yang and Zhang, Yuanxing and Yang, Wenjing and Liu, Xinwang and Lin, Zhouchen},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/wang2026iclr-transformers/}
}