Deep Normalized Cross-Modal Hashing with Bi-Direction Relation Reasoning
Abstract
Due to the continuous growth of large-scale multi-modal data and increasing requirements for retrieval speed, deep cross-modal hashing has gained increasing attention recently. Most of existing studies take a similarity matrix as supervision to optimize their models, and the inner product between continuous surrogates of hash codes is utilized to depict the similarity in the Hamming space. However, all of them merely consider the relevant information to build the similarity matrix, ignoring the contribution of the irrelevant one, i.e., the categories that samples do not belong to. Therefore, they cannot effectively alleviate the effect of dissimilar samples. Moreover, due to the modality distribution difference, directly utilizing continuous surrogates of hash codes to calculate similarity may induce suboptimal retrieval performance. To tackle these issues, in this paper, we propose a novel deep normalized cross-modal hashing scheme with bi-direction relation reasoning, named Bi_NCMH. Specifically, we build the multi-level semantic similarity matrix by considering bi-direction relation, i.e., consistent and inconsistent relation. It hence can holistically characterize relations among instances. Besides, we execute feature normalization on continuous surrogates of hash codes to eliminate the deviation caused by modality gap, which further reduces the negative impact of binarization on retrieval performance. Extensive experiments on two cross-modal benchmark datasets demonstrate the superiority of our model over several state-of-the-art baselines.
Cite
Text
Sun et al. "Deep Normalized Cross-Modal Hashing with Bi-Direction Relation Reasoning." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022. doi:10.1109/CVPRW56347.2022.00541Markdown
[Sun et al. "Deep Normalized Cross-Modal Hashing with Bi-Direction Relation Reasoning." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022.](https://mlanthology.org/cvprw/2022/sun2022cvprw-deep/) doi:10.1109/CVPRW56347.2022.00541BibTeX
@inproceedings{sun2022cvprw-deep,
title = {{Deep Normalized Cross-Modal Hashing with Bi-Direction Relation Reasoning}},
author = {Sun, Changchang and Latapie, Hugo and Liu, Gaowen and Yan, Yan},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2022},
pages = {4937-4945},
doi = {10.1109/CVPRW56347.2022.00541},
url = {https://mlanthology.org/cvprw/2022/sun2022cvprw-deep/}
}