Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models
Abstract
Despite the outstanding performance in vision-language reasoning, Large Vision-Language Models (LVLMs) might generate hallucinated contents that do not exist in the given image. Most existing LVLM hallucination benchmarks are constrained to evaluate the object-related hallucinations. However, the potential hallucination on the relations between two objects, i.e., relation hallucination, still lacks investigation. To remedy that, we design a unified framework to measure object and relation hallucination in LVLMs simultaneously. The core idea of our framework is to evaluate hallucinations in (object, relation, object) triplets extracted from LVLMs’ responses, making it easily generalizable to various vision-language tasks. Based on our framework, we further introduce Tri-HE, a novel Triplet-level Hallucination Evaluation benchmark which can be used to study both object and relation hallucination at the same time. With comprehensive evaluations on Tri-HE, we observe that the relation hallucination issue is even more serious than object hallucination among existing LVLMs, highlighting a previously neglected problem towards reliable LVLMs. Moreover, based on our findings, we design a simple training-free approach that effectively mitigates hallucinations for LVLMs.
Cite
Text
Wu et al. "Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models." Transactions on Machine Learning Research, 2025.Markdown
[Wu et al. "Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/wu2025tmlr-unified/)BibTeX
@article{wu2025tmlr-unified,
title = {{Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models}},
author = {Wu, Junjie and Chung, Tsz Ting and Chen, Kai and Yeung, Dit-Yan},
journal = {Transactions on Machine Learning Research},
year = {2025},
url = {https://mlanthology.org/tmlr/2025/wu2025tmlr-unified/}
}