Cross-Modal Generation and Alignment via Attribute-Guided Prompt for Unsupervised Text-Based Person Retrieval

Li, Zongyi; Li, Jianbo; Shi, Yuxuan; Ling, Hefei; Chen, Jiazhong; Wang, Runsheng; Huang, Shijuan

doi:10.24963/ijcai.2024/116

Cross-Modal Generation and Alignment via Attribute-Guided Prompt for Unsupervised Text-Based Person Retrieval

Zongyi Li, Jianbo Li, Yuxuan Shi, Hefei Ling, Jiazhong Chen, Runsheng Wang, Shijuan Huang

IJCAI 2024 pp. 1047-1055

doi:10.24963/ijcai.2024/116 /ijcai/2024/li2024ijcai-cross/

Abstract

Visible-infrared person re-identification (VIReID) provides a solution for ReID tasks in 24-hour scenarios; however, significant challenges persist in achieving satisfactory performance due to the substantial discrepancies between visible (VIS) and infrared (IR) modalities. Existing methods inadequately leverage information from different modalities, primarily focusing on digging distinguishing features from modality-shared information while neglecting modality-specific details. To fully utilize differentiated minutiae, we propose a Base-Detail Feature Learning Framework (BDLF) that enhances the learning of both base and detail knowledge, thereby capitalizing on both modality-shared and modality-specific information. Specifically, the proposed BDLF mines detail and base features through a lossless detail feature extraction module and a complementary base embedding generation mechanism, respectively, supported by a novel correlation restriction method that ensures the features gained by BDLF enrich both detail and base knowledge across VIS and IR features. Comprehensive experiments conducted on the SYSU-MM01, RegDB, and LLCM datasets validate the effectiveness of BDLF.

PDF IJCAI Semantic Scholar

Cite

Text

Li et al. "Cross-Modal Generation and Alignment via Attribute-Guided Prompt for Unsupervised Text-Based Person Retrieval." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/116

Markdown

[Li et al. "Cross-Modal Generation and Alignment via Attribute-Guided Prompt for Unsupervised Text-Based Person Retrieval." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/li2024ijcai-cross/) doi:10.24963/ijcai.2024/116

BibTeX

@inproceedings{li2024ijcai-cross,
  title     = {{Cross-Modal Generation and Alignment via Attribute-Guided Prompt for Unsupervised Text-Based Person Retrieval}},
  author    = {Li, Zongyi and Li, Jianbo and Shi, Yuxuan and Ling, Hefei and Chen, Jiazhong and Wang, Runsheng and Huang, Shijuan},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {1047-1055},
  doi       = {10.24963/ijcai.2024/116},
  url       = {https://mlanthology.org/ijcai/2024/li2024ijcai-cross/}
}