KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking

Abstract

As large language models (LLMs) become increasingly integral to a wide array of applications, ensuring the factual accuracy of their outputs and mitigating hallucinations is paramount. Current approaches, which primarily rely on self-consistency checks or post-hoc fact-checking, often fall short by disregarding the nuanced structure of queries and the diverse forms of contextual knowledge required for accurate response generation. To address these shortcomings, we introduce KnowHalu (pronounced "No Halu"), the first multi-form knowledge-based hallucination detection framework. We also introduce a new category of hallucinations, off-target hallucinations, which occur when responses are factually accurate but irrelevant or nonspecific to the query (e.g., answering "What’s the primary language in Barcelona?" with "European language"). In particular, KnowHalu employs a rigorous two-phase process to detect hallucinations. In the first phase, it isolates off-target hallucinations by analyzing the semantic alignment between the response and the query. In the second phase, it conducts a novel multi-form knowledge-based fact-checking through a comprehensive pipeline of reasoning and query decomposition, knowledge retrieval, knowledge form optimization, judgment generation, and judgment aggregation. Extensive evaluations demonstrate that KnowHalu significantly surpasses state-of-the-art (SOTA) baselines across diverse tasks, achieving over 15% improvement in question answering (QA) and 6% in summarization tasks when applied to the same underlying LLM. These results underscore the effectiveness and versatility of KnowHalu, setting a new benchmark for hallucination detection and paving the way for safer and more reliable LLM applications.

Cite

Text

Zhang et al. "KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking." ICLR 2025 Workshops: FM-Wild, 2025.

Markdown

[Zhang et al. "KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking." ICLR 2025 Workshops: FM-Wild, 2025.](https://mlanthology.org/iclrw/2025/zhang2025iclrw-knowhalu/)

BibTeX

@inproceedings{zhang2025iclrw-knowhalu,
  title     = {{KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking}},
  author    = {Zhang, Jiawei and Xu, Chejian and Gai, Yu and Lecue, Freddy and Yang, Shuang and Song, Dawn and Li, Bo},
  booktitle = {ICLR 2025 Workshops: FM-Wild},
  year      = {2025},
  url       = {https://mlanthology.org/iclrw/2025/zhang2025iclrw-knowhalu/}
}