The First Impression Problem: Internal Bias Triggers Overthinking in Reasoning Models

Abstract

Reasoning models often exhibit overthinking, characterized by redundant reasoning steps. We identify \emph{internal bias} elicited by the input question as a key trigger of such behavior. Upon encountering a problem, the model immediately forms a preliminary guess about the answer, which we term an internal bias since it may not be explicitly generated, and it arises without systematic reasoning. When this guess conflicts with its subsequent reasoning, the model tends to engage in excessive reflection, resulting in wasted computation. We validate the association between internal bias and overthinking across multiple models and diverse reasoning tasks. To demonstrate the causal relationship more rigorously, we conduct two counterfactual interventions, showing that removing the input question after the model reduces the redundant reasoning across various complex reasoning tasks, and manually injecting bias affects overthinking accordingly. Further interpretability experiments suggest that excessive attention to the input question serves as a key mechanism through which internal bias influences subsequent reasoning trajectories. Finally, we evaluated several methods aimed at mitigating overthinking, yet the influence of internal bias persisted under all conditions.

Cite

Text

Dang et al. "The First Impression Problem: Internal Bias Triggers Overthinking in Reasoning Models." International Conference on Learning Representations, 2026.

Markdown

[Dang et al. "The First Impression Problem: Internal Bias Triggers Overthinking in Reasoning Models." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/dang2026iclr-first/)

BibTeX

@inproceedings{dang2026iclr-first,
  title     = {{The First Impression Problem: Internal Bias Triggers Overthinking in Reasoning Models}},
  author    = {Dang, Renfei and Li, Zhening and Huang, Shujian and Chen, Jiajun},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/dang2026iclr-first/}
}