Reasoning or Retrieval? a Study of Answer Attribution on Large Reasoning Models

Abstract

Large reasoning models (LRMs) exhibit unprecedented capabilities in solving complex problems through Chain-of-Thought (CoT) reasoning. However, recent studies reveal that their final answers often contradict their own reasoning traces. We hypothesize that this inconsistency stems from two competing mechanisms for generating answers: CoT reasoning and memory retrieval. To test this hypothesis, we conduct controlled experiments that challenge LRMs with misleading cues during reasoning and/or corrupted answers during retrieval. Our results across models and datasets confirm that both mechanisms operate simultaneously, with their relative dominance influenced by multiple factors: problem domains, model scales, and fine-tuning approaches (e.g., reinforcement learning vs. distillation). The findings reveal a critical limitation in current reasoning fine-tuning paradigms: models can exploit the retrieval mechanism as a shortcut, effectively "hacking" the reward signal and undermining genuine reasoning development. To address this challenge, we introduce FARL, a novel fine-tuning framework that integrates memory unlearning with reinforcement learning. By carefully suppressing retrieval shortcuts during the fine-tuning process, FARL promotes reasoning-dominant behavior and enhances generalizable reasoning capabilities. The code is available at https://github.com/ZJUWYH/FARL.

Cite

Text

Wang et al. "Reasoning or Retrieval? a Study of Answer Attribution on Large Reasoning Models." International Conference on Learning Representations, 2026.

Markdown

[Wang et al. "Reasoning or Retrieval? a Study of Answer Attribution on Large Reasoning Models." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/wang2026iclr-reasoning/)

BibTeX

@inproceedings{wang2026iclr-reasoning,
  title     = {{Reasoning or Retrieval? a Study of Answer Attribution on Large Reasoning Models}},
  author    = {Wang, Yuhui and Li, Changjiang and Chen, Guangke and Liang, Jiacheng and Wang, Ting},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/wang2026iclr-reasoning/}
}