ChatQA: Surpassing GPT-4 on Conversational QA and RAG
Abstract
In this work, we introduce ChatQA, a suite of models that outperform GPT-4 on retrieval-augmented generation (RAG) and conversational question answering (QA). To enhance generation, we propose a two-stage instruction tuning method that significantly boosts the performance of RAG. For effective retrieval, we introduce a dense retriever optimized for conversational QA, which yields results comparable to the alternative state-of-the-art query rewriting models, while substantially reducing deployment costs. We also present the ChatRAG Bench, which encompasses ten datasets covering comprehensive evaluations on RAG, table-related QA, arithmetic calculations, and scenarios involving unanswerable questions. Our ChatQA-1.0-70B (score: 54.14), built on Llama2, a weaker foundation model than GPT-4, can slightly outperform GPT-4-0613 (score: 53.90) and GPT-4-Turbo-2024-04-09 (score: 54.03) on the ChatRAG Bench, without relying on any synthetic data from OpenAI GPT models. Notably, Llama3-ChatQA-1.5-70B model surpasses the accuracy of GPT-4-Turbo-2024-04-09 by a margin. These results demonstrate the exceptional quality of the proposed ChatQA recipe. To advance research in this field, we open-sourced the model weights, instruction tuning data, ChatRAG Bench, and retriever for the community.
Cite
Text
Liu et al. "ChatQA: Surpassing GPT-4 on Conversational QA and RAG." Neural Information Processing Systems, 2024. doi:10.52202/079017-0493Markdown
[Liu et al. "ChatQA: Surpassing GPT-4 on Conversational QA and RAG." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/liu2024neurips-chatqa/) doi:10.52202/079017-0493BibTeX
@inproceedings{liu2024neurips-chatqa,
title = {{ChatQA: Surpassing GPT-4 on Conversational QA and RAG}},
author = {Liu, Zihan and Ping, Wei and Roy, Rajarshi and Xu, Peng and Lee, Chankyu and Shoeybi, Mohammad and Catanzaro, Bryan},
booktitle = {Neural Information Processing Systems},
year = {2024},
doi = {10.52202/079017-0493},
url = {https://mlanthology.org/neurips/2024/liu2024neurips-chatqa/}
}