MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query
Abstract
Semantic retrieval is crucial for modern applications yet remains underexplored in current research. Existing datasets are limited to single languages, single images, or singular retrieval conditions, often failing to fully exploit the expressive capacity of visual information as evidenced by maintained performance when images are replaced with captions. However, practical retrieval scenarios frequently involve interleaved multi-condition queries with multiple images. Hence, this paper introduces MERIT, the first multilingual dataset for interleaved multi-condition semantic retrieval, comprising 320,000 queries with 135,000 products in 5 languages, covering 7 distinct product categories. Extensive experiments on MERIT identify existing models's critical limitation: focusing solely on global semantic information while neglecting specific conditional elements in queries. Consequently, we propose Coral, a novel fine-tuning framework that adapts pre-trained MLLMs by integrating embedding reconstruction to preserve fine-grained conditional elements and contrastive learning to extract comprehensive global semantics. Experiments demonstrate that Coral achieves a 45.9% performance improvement over conventional approaches on MERIT, with strong generalization capabilities validated across 8 established retrieval benchmarks. Collectively, our contributions—a novel dataset, identification of critical limitations in existing approaches, and an innovative fine-tuning framework—establish a foundation for future research in interleaved multi-condition semantic retrieval. **Data & Code**: [MERIT-2025.github.io](https://merit-2025.github.io)
Cite
Text
Chow et al. "MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query." Advances in Neural Information Processing Systems, 2025.Markdown
[Chow et al. "MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/chow2025neurips-merit/)BibTeX
@inproceedings{chow2025neurips-merit,
title = {{MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query}},
author = {Chow, Wei and Gao, Yuan and Li, Linfeng and Wang, Xian and Xu, Qi and Song, Hang and Kong, Lingdong and Zhou, Ran and Zeng, Yi and Cai, Yidong and Jiang, Botian and Xu, Shilin and Jiajunzhang, and Qiu, Minghui and Li, Xiangtai and Yang, Tianshu and Tang, Siliang and Li, Juncheng},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/chow2025neurips-merit/}
}