Bridging Information Asymmetry in Text-Video Retrieval: A Data-Centric Approach
Abstract
As online video content rapidly grows, the task of text-video retrieval (TVR) becomes increasingly important. A key challenge in TVR is the information asymmetry between video and text: videos are inherently richer in information, while their textual descriptions often capture only fragments of this complexity. This paper introduces a novel, data-centric framework to bridge this gap by enriching textual representations to better match the richness of video content. During training, videos are segmented into event-level clips and captioned to ensure comprehensive coverage. During retrieval, a large language model (LLM) generates semantically diverse queries to capture a broader range of possible matches. To enhance retrieval efficiency, we propose a query selection mechanism that identifies the most relevant and diverse queries, reducing computational cost while improving accuracy. Our method achieves state-of-the-art results across multiple benchmarks, demonstrating the power of data-centric approaches in addressing information asymmetry in TVR. This work paves the way for new research focused on leveraging data to improve cross-modal retrieval.
Cite
Text
Bai et al. "Bridging Information Asymmetry in Text-Video Retrieval: A Data-Centric Approach." International Conference on Learning Representations, 2025.Markdown
[Bai et al. "Bridging Information Asymmetry in Text-Video Retrieval: A Data-Centric Approach." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/bai2025iclr-bridging/)BibTeX
@inproceedings{bai2025iclr-bridging,
title = {{Bridging Information Asymmetry in Text-Video Retrieval: A Data-Centric Approach}},
author = {Bai, Zechen and Xiao, Tianjun and He, Tong and Wang, Pichao and Zhang, Zheng and Brox, Thomas and Shou, Mike Zheng},
booktitle = {International Conference on Learning Representations},
year = {2025},
url = {https://mlanthology.org/iclr/2025/bai2025iclr-bridging/}
}