Beyond In-Domain Detection: SpikeScore for Cross-Domain Hallucination Detection

Abstract

Hallucination detection is critical for deploying large language models (LLMs) in real-world applications. Existing hallucination detection methods achieve strong performance when the training and test data come from the same domain, but they suffer from poor cross-domain generalization. In this paper, we study an important yet overlooked problem, termed generalizable hallucination detection (GHD), which aims to train hallucination detectors on data from a single domain while ensuring robust performance across diverse related domains. In studying GHD, we simulate multi-turn dialogues following LLMs' initial response and observe an interesting phenomenon: hallucination-initiated multi-turn dialogues universally exhibit larger uncertainty fluctuations than factual ones across different domains. Based on the phenomenon, we propose a new score SpikeScore, which quantifies abrupt fluctuations in multi-turn dialogues. Through both theoretical analysis and empirical validation, we demonstrate that SpikeScore achieves strong cross-domain separability between hallucinated and non-hallucinated responses. Experiments across multiple LLMs and benchmarks demonstrate that the SpikeScore-based detection method outperforms representative baselines in cross-domain generalization and surpasses advanced generalization-oriented methods, verifying the effectiveness of our method in cross-domain hallucination detection.

Cite

Text

Deng et al. "Beyond In-Domain Detection: SpikeScore for Cross-Domain Hallucination Detection." International Conference on Learning Representations, 2026.

Markdown

[Deng et al. "Beyond In-Domain Detection: SpikeScore for Cross-Domain Hallucination Detection." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/deng2026iclr-beyond-a/)

BibTeX

@inproceedings{deng2026iclr-beyond-a,
  title     = {{Beyond In-Domain Detection: SpikeScore for Cross-Domain Hallucination Detection}},
  author    = {Deng, Yongxin and Fang, Zhen and Li, Sharon and Chen, Ling},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/deng2026iclr-beyond-a/}
}