Variational Open-Domain Question Answering
Abstract
Retrieval-augmented models have proven to be effective in natural language processing tasks, yet there remains a lack of research on their optimization using variational inference. We introduce the Variational Open-Domain (VOD) framework for end-to-end training and evaluation of retrieval-augmented models, focusing on open-domain question answering and language modelling. The VOD objective, a self-normalized estimate of the Rényi variational bound, approximates the task marginal likelihood and is evaluated under samples drawn from an auxiliary sampling distribution (cached retriever and/or approximate posterior). It remains tractable, even for retriever distributions defined on large corpora. We demonstrate VOD’s versatility by training reader-retriever BERT-sized models on multiple-choice medical exam questions. On the MedMCQA dataset, we outperform the domain-tuned Med-PaLM by +5.3% despite using 2.500$\times$ fewer parameters. Our retrieval-augmented BioLinkBERT model scored 62.9% on the MedMCQA and 55.0% on the MedQA-USMLE. Last, we show the effectiveness of our learned retriever component in the context of medical semantic search.
Cite
Text
Liévin et al. "Variational Open-Domain Question Answering." International Conference on Machine Learning, 2023.Markdown
[Liévin et al. "Variational Open-Domain Question Answering." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/lievin2023icml-variational/)BibTeX
@inproceedings{lievin2023icml-variational,
title = {{Variational Open-Domain Question Answering}},
author = {Liévin, Valentin and Motzfeldt, Andreas Geert and Jensen, Ida Riis and Winther, Ole},
booktitle = {International Conference on Machine Learning},
year = {2023},
pages = {20950-20977},
volume = {202},
url = {https://mlanthology.org/icml/2023/lievin2023icml-variational/}
}