Explore What LLM Does Not Know in Complex Question Answering

Lin, Xin; Huang, Zhenya; Zhang, Zhiqiang; Zhou, Jun; Chen, Enhong

doi:10.1609/AAAI.V39I23.34638

Explore What LLM Does Not Know in Complex Question Answering

Xin Lin, Zhenya Huang, Zhiqiang Zhang, Jun Zhou, Enhong Chen

AAAI 2025 pp. 24585-24594

doi:10.1609/AAAI.V39I23.34638 /aaai/2025/lin2025aaai-explore/

Abstract

Complex question answering (QA) is a challenging task in artificial intelligence research which requires reasoning based on related knowledge. The retrieval-augmented generation (RAG) based on large language models (LLMs) have become one promising solution in QA. To facilitate RAG more effectively, the LLM needs to precisely evaluate knowledge required in QA. That is, first, the LLM needs to examine its knowledge boundary (what the LLM does not know) to retrieve external knowledge as supplement. Second, the LLM needs to evaluate the utility of the retrieved knowledge (whether it helps in reasoning) for robust RAG. To this end, in this paper, we propose a novel Question Answering with Knowledge Evaluation (KEQA) framework to promote the effectiveness and efficiency of RAG in QA. First, inspired by quizzes in classroom, we propose a quiz-based method to precisely examine the knowledge state of the uninterpretable LLM for QA. We ask indicative quizzes on each required knowledge, and inspect whether the LLM can consistently answer the quiz to examine its knowledge boundary. Second, we retrieve the unknown knowledge from external source, and evaluate its utility to pick the helpful ones for reasoning. We design a reasoning-based metric to evaluate utility, and construct a demonstration set in training data for reference to guide knowledge picking in inference. We conduct extensive experiments on four widely-used QA datasets, and the results demonstrate the effectiveness of the proposed method.

PDF AAAI Semantic Scholar

Cite

Text

Lin et al. "Explore What LLM Does Not Know in Complex Question Answering." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I23.34638

Markdown

[Lin et al. "Explore What LLM Does Not Know in Complex Question Answering." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/lin2025aaai-explore/) doi:10.1609/AAAI.V39I23.34638

BibTeX

@inproceedings{lin2025aaai-explore,
  title     = {{Explore What LLM Does Not Know in Complex Question Answering}},
  author    = {Lin, Xin and Huang, Zhenya and Zhang, Zhiqiang and Zhou, Jun and Chen, Enhong},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {24585-24594},
  doi       = {10.1609/AAAI.V39I23.34638},
  url       = {https://mlanthology.org/aaai/2025/lin2025aaai-explore/}
}