Bridging Information Asymmetry in Text-Video Retrieval: A Data-Centric Approach

Abstract

As online video content rapidly grows, the task of text-video retrieval (TVR) becomes increasingly important. A key challenge in TVR is the information asymmetry between video and text: videos are inherently richer in information, while their textual descriptions often capture only fragments of this complexity. This paper introduces a novel, data-centric framework to bridge this gap by enriching textual representations to better match the richness of video content. During training, videos are segmented into event-level clips and captioned to ensure comprehensive coverage. During retrieval, a large language model (LLM) generates semantically diverse queries to capture a broader range of possible matches. To enhance retrieval efficiency, we propose a query selection mechanism that identifies the most relevant and diverse queries, reducing computational cost while improving accuracy. Our method achieves state-of-the-art results across multiple benchmarks, demonstrating the power of data-centric approaches in addressing information asymmetry in TVR. This work paves the way for new research focused on leveraging data to improve cross-modal retrieval.

Cite

Text

Bai et al. "Bridging Information Asymmetry in Text-Video Retrieval: A Data-Centric Approach." International Conference on Learning Representations, 2025.

Markdown

[Bai et al. "Bridging Information Asymmetry in Text-Video Retrieval: A Data-Centric Approach." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/bai2025iclr-bridging/)

BibTeX

@inproceedings{bai2025iclr-bridging,
  title     = {{Bridging Information Asymmetry in Text-Video Retrieval: A Data-Centric Approach}},
  author    = {Bai, Zechen and Xiao, Tianjun and He, Tong and Wang, Pichao and Zhang, Zheng and Brox, Thomas and Shou, Mike Zheng},
  booktitle = {International Conference on Learning Representations},
  year      = {2025},
  url       = {https://mlanthology.org/iclr/2025/bai2025iclr-bridging/}
}