Causal Retrieval with Semantic Consideration

Abstract

Recent rapid progress in large language models (LLMs) leads to high performance conversational AI system. To extend the the system to expert fields, such as biomedical or legal domains, it becomes standard to combine LLMs with information retrieval (IR) system and generate answer based on retrieved information (documents) for given queries. Thus, it is essential that the IR system should ''understand'' various intents included in the queries including but not limited to semantic similarity, causal relationship etc. However, existing IR systems primarily focuses on retrieving related information based only on semtnic similarities between sentences or documents. Here, we develop CAWAI that can understand causal relation between queries and documents . By training dense retriever with dual constraints, causal loss and semantic loss, CAWAI shows strong generalization capability achieving up to +7.8\% Hit@1 compared to DPR baseline in causal retrieval task where a target sentence is buried in 20m Wikipedia sentences.

Cite

Text

Shin and Hwang. "Causal Retrieval with Semantic Consideration." NeurIPS 2024 Workshops: CRL, 2024.

Markdown

[Shin and Hwang. "Causal Retrieval with Semantic Consideration." NeurIPS 2024 Workshops: CRL, 2024.](https://mlanthology.org/neuripsw/2024/shin2024neuripsw-causal/)

BibTeX

@inproceedings{shin2024neuripsw-causal,
  title     = {{Causal Retrieval with Semantic Consideration}},
  author    = {Shin, Hyunseo and Hwang, Wonseok},
  booktitle = {NeurIPS 2024 Workshops: CRL},
  year      = {2024},
  url       = {https://mlanthology.org/neuripsw/2024/shin2024neuripsw-causal/}
}