DLBD: A Self-Supervised Direct-Learned Binary Descriptor

Abstract

For learning-based binary descriptors, the binarization process has not been well addressed. The reason is that the binarization blocks gradient back-propagation. Existing learning-based binary descriptors learn real-valued output, and then it is converted to binary descriptors by their proposed binarization processes. Since their binarization processes are not a component of the network, the learning-based binary descriptor cannot fully utilize the advances of deep learning. To solve this issue, we propose a model-agnostic plugin binary transformation layer (BTL), making the network directly generate binary descriptors. Then, we present the first self-supervised, direct-learned binary descriptor, dubbed DLBD. Furthermore, we propose ultra-wide temperature-scaled cross-entropy loss to adjust the distribution of learned descriptors in a larger range. Experiments demonstrate that the proposed BTL can substitute the previous binarization process. Our proposed DLBD outperforms SOTA on different tasks such as image retrieval and classification.

Cite

Text

Xiao et al. "DLBD: A Self-Supervised Direct-Learned Binary Descriptor." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.01521

Markdown

[Xiao et al. "DLBD: A Self-Supervised Direct-Learned Binary Descriptor." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/xiao2023cvpr-dlbd/) doi:10.1109/CVPR52729.2023.01521

BibTeX

@inproceedings{xiao2023cvpr-dlbd,
  title     = {{DLBD: A Self-Supervised Direct-Learned Binary Descriptor}},
  author    = {Xiao, Bin and Hu, Yang and Liu, Bo and Bi, Xiuli and Li, Weisheng and Gao, Xinbo},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {15846-15855},
  doi       = {10.1109/CVPR52729.2023.01521},
  url       = {https://mlanthology.org/cvpr/2023/xiao2023cvpr-dlbd/}
}