DUQ: Dual Uncertainty Quantification for Text-Video Retrieval
Abstract
Text-video retrieval establishes accurate similarity relationships between text and video through feature enhancement and granularity alignment. However, relying solely on similarity to associate intra-pair features and distinguish inter-pair features is insufficient, \textit{e.g.}, when querying a multi-scene video with sparse text or selecting the most relevant video from many similar candidates. In this paper, we propose a novel Dual Uncertainty Quantification (DUQ) model that separately handles uncertainties in intra-pair interaction and inter-pair exclusion. Specifically, to enhance intra-pair interaction, we propose an intra-pair similarity uncertainty module to provide similarity-based trustworthy predictions and explicitly model this uncertainty. To increase inter-pair exclusion, we propose an inter-pair distance uncertainty module to construct a distance-based diversity probability embeding, thereby widening the gap between similar features. The two components work synergistically, jointly improving the calculation of similarity between features. We evaluate our model on six benchmark datasets: MSRVTT (51.2%), DiDeMo, MSVD, LSMDC, Charades, and VATEX, achieving state-of-the-art retrieval performance.
Cite
Text
Liu et al. "DUQ: Dual Uncertainty Quantification for Text-Video Retrieval." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/643Markdown
[Liu et al. "DUQ: Dual Uncertainty Quantification for Text-Video Retrieval." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/liu2025ijcai-duq/) doi:10.24963/IJCAI.2025/643BibTeX
@inproceedings{liu2025ijcai-duq,
title = {{DUQ: Dual Uncertainty Quantification for Text-Video Retrieval}},
author = {Liu, Xin and Yin, Shibai and Wang, Jun and Zhu, Jiaxin and Wang, Xingyang and Yang, Yee-Hong},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2025},
pages = {5779-5787},
doi = {10.24963/IJCAI.2025/643},
url = {https://mlanthology.org/ijcai/2025/liu2025ijcai-duq/}
}