Tracedet: Hallucination Detection from the Decoding Trace of Diffusion Large Language Models
Abstract
Diffusion large language models (D-LLMs) have recently emerged as a promising alternative to auto-regressive LLMs (AR-LLMs). However, the hallucination problem in D-LLMs remains underexplored, limiting their reliability in real-world applications. Existing hallucination detection methods are designed for AR-LLMs and rely on signals from \emph{single-step} generation, making them ill-suited for D-LLMs where hallucination signals often emerge throughout the \emph{multi-step} denoising process. To bridge this gap, we propose \textbf{TraceDet}, a novel framework that explicitly leverages the intermediate denoising steps of D-LLMs for hallucination detection. TraceDet models the denoising process as an \emph{action trace}, with each action defined as the model’s prediction over the cleaned response, conditioned on the previous intermediate output. By identifying the sub-trace that is maximally informative to the hallucinated responses, TraceDet leverages the key hallucination signals in the multi-step denoising process of D-LLMs for hallucination detection. Extensive experiments on various open source D-LLMs demonstrate that \textbf{TraceDet} consistently improves hallucination detection, achieving an average gain in AUROC of 15.2\% compared to baselines.
Cite
Text
Chang et al. "Tracedet: Hallucination Detection from the Decoding Trace of Diffusion Large Language Models." International Conference on Learning Representations, 2026.Markdown
[Chang et al. "Tracedet: Hallucination Detection from the Decoding Trace of Diffusion Large Language Models." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/chang2026iclr-tracedet/)BibTeX
@inproceedings{chang2026iclr-tracedet,
title = {{Tracedet: Hallucination Detection from the Decoding Trace of Diffusion Large Language Models}},
author = {Chang, Shenxu and Yu, Junchi and Wang, Weixing and Chen, Yongqiang and Yu, Jialin and Torr, Philip and Gu, Jindong},
booktitle = {International Conference on Learning Representations},
year = {2026},
url = {https://mlanthology.org/iclr/2026/chang2026iclr-tracedet/}
}