DREAM: Decoupled Discriminative Learning with Bigraph-Aware Alignment for Semi-Supervised 2D-3D Cross-Modal Retrieval
Abstract
With the burst of big data, 2D-3D cross-modal retrieval has received increasing attention, which aims to retrieve relevant data from one modality given the query from the other modality. In this paper, we study an underexplored yet practical problem of semi-supervised 2D-3D cross-modal retrieval, which could suffer from serious label scarcity in real-world applications. Moreover, the huge heterogeneous gap could deteriorate the process of learning from unlabeled data. In this work, we propose a novel approach named Decoupled Discriminative Learning with Bigraph-aware Alignment (DREAM) for semi-supervised 2D-3D cross-modal retrieval. The core of our DREAM is to decouple the label prediction and reliability measurement processes to reduce overconfident samples in discriminative learning. In particular, we enhance a label prediction module with label propagation from labeled samples and additionally introduce a reliability measurement module to learn the scores of predicted labels. To reduce class-related bias, we compare reliability scores with class-specific adaptive thresholds to identify samples for additional learning. In addition, negative labels are estimated for unselected samples, which guides soft semantic learning to make the best use of all the information. To further minimize the heterogeneous gap, we build a bigraph graph that connects cross-modal similar examples and then conduct learning to cluster with most edges kept for alignment. Extensive experiments on several benchmark datasets validate the superiority of the proposed DREAM.
Cite
Text
Zhang et al. "DREAM: Decoupled Discriminative Learning with Bigraph-Aware Alignment for Semi-Supervised 2D-3D Cross-Modal Retrieval." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I12.33441Markdown
[Zhang et al. "DREAM: Decoupled Discriminative Learning with Bigraph-Aware Alignment for Semi-Supervised 2D-3D Cross-Modal Retrieval." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhang2025aaai-dream/) doi:10.1609/AAAI.V39I12.33441BibTeX
@inproceedings{zhang2025aaai-dream,
title = {{DREAM: Decoupled Discriminative Learning with Bigraph-Aware Alignment for Semi-Supervised 2D-3D Cross-Modal Retrieval}},
author = {Zhang, Fan and Wang, Changhu and Cheng, Zebang and Peng, Xiaojiang and Wang, Dongjie and Xiao, Yijia and Chen, Chong and Hua, Xian-Sheng and Luo, Xiao},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {13206-13214},
doi = {10.1609/AAAI.V39I12.33441},
url = {https://mlanthology.org/aaai/2025/zhang2025aaai-dream/}
}