A Survey of Model Architectures in Information Retrieval

Abstract

The period from 2019 to the present has represented one of the biggest paradigm shifts in information retrieval (IR) and natural language processing (NLP), culminating in the emergence of powerful large language models (LLMs) from 2022 onward. Methods leveraging pretrained encoder-only models (e.g., BERT) and decoder-only generative LLMs have outperformed many previous approaches, particularly excelling in zero-shot scenarios and complex reasoning tasks. Our survey study investigates the evolution of model architectures in IR, focusing on two key aspects: backbone models for feature extraction and end-to-end system architectures for relevance estimation. The review intentionally separates architectural considerations from training methodologies, in order to provide a focused analysis of structural innovations in IR systems. We trace the development from traditional term-based methods to modern neural approaches, particularly discussing the impact of transformer-based models and subsequent large language models (LLMs). We conclude with a forward-looking discussion of emerging challenges and future directions, including architectural optimizations for performance and scalability, handling of multimodal, multilingual data, and adaptation to novel application domains such as autonomous search agents that might be the next-generation paradigm of IR.

Cite

Text

Xu et al. "A Survey of Model Architectures in Information Retrieval." Transactions on Machine Learning Research, 2026.

Markdown

[Xu et al. "A Survey of Model Architectures in Information Retrieval." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/xu2026tmlr-survey/)

BibTeX

@article{xu2026tmlr-survey,
  title     = {{A Survey of Model Architectures in Information Retrieval}},
  author    = {Xu, Zhichao and Mo, Fengran and Huang, Zhiqi and Zhang, Crystina and Yu, Puxuan and Phillips, Bei Wang and Lin, Jimmy and Srikumar, Vivek},
  journal   = {Transactions on Machine Learning Research},
  year      = {2026},
  url       = {https://mlanthology.org/tmlr/2026/xu2026tmlr-survey/}
}