Hallucination Begins Where Saliency Drops

Zhang, Xiaofeng; Zhu, Yuanchao; Gu, Chaochen; Yuan, Xiaosong; Zhao, Qiyan; Cao, Jiawei; Tang, Feilong; Fan, Sinan; Shen, Yaomin; Shen, Chen; Tang, Hao

Hallucination Begins Where Saliency Drops

Xiaofeng Zhang, Yuanchao Zhu, Chaochen Gu, Xiaosong Yuan, Qiyan Zhao, Jiawei Cao, Feilong Tang, Sinan Fan, Yaomin Shen, Chen Shen, Hao Tang

ICLR 2026

/iclr/2026/zhang2026iclr-hallucination/

Abstract

Recent studies have investigated attention dynamics in large vision language models (LVLMs), yet existing methods remain limited in reliably distinguishing hallucinated from correct outputs — primarily because they rely solely on forward-pass attention, ignoring gradient-based signals that reveal how token influence propagates through the model. To bridge this gap, we introduce \textbf{LVLMs-Saliency}, an \textit{gradient-aware diagnostic tool} that quantifies the grounding strength of each output token by fusing attention weights with their gradients. Through analysis, we identify a decisive pattern: \textit{Hallucinations occur when prior output tokens shows low saliency to the next token prediction}, indicating a failure of contextual memory. Building on this insight, we propose a dual-mechanism inference-time framework: (1) Saliency-Guided Rejection Sampling (SGRS), which dynamically filters candidate tokens during decoding by rejecting those with saliency below a context-adaptive threshold, thereby preventing coherence-breaking tokens from entering the sequence; and (2) Local Coherence Reinforcement (LocoRE), a lightweight plug-and-play module that strengthens attention from the current token to its most recent outputs, actively counteracting the “forgetting” behavior identified by LVLMs-Saliency. Experimental results demonstrate that our method significantly reduces hallucinations across multiple LVLMs, offering a robust and interpretable solution to improve model reliability.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Zhang et al. "Hallucination Begins Where Saliency Drops." International Conference on Learning Representations, 2026.

Markdown

[Zhang et al. "Hallucination Begins Where Saliency Drops." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/zhang2026iclr-hallucination/)

BibTeX

@inproceedings{zhang2026iclr-hallucination,
  title     = {{Hallucination Begins Where Saliency Drops}},
  author    = {Zhang, Xiaofeng and Zhu, Yuanchao and Gu, Chaochen and Yuan, Xiaosong and Zhao, Qiyan and Cao, Jiawei and Tang, Feilong and Fan, Sinan and Shen, Yaomin and Shen, Chen and Tang, Hao},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/zhang2026iclr-hallucination/}
}