Unsupervised Extraction of Local Image Descriptors via Relative Distance Ranking Loss

Abstract

State-of-the-art supervised local descriptor learning methods heavily rely on accurately labelled patches for training. However, since the process of labelling patches is laborious and inefficient, supervised training is limited by the availability and scale of training datasets. In comparison, unsupervised learning does not require burdensome data labelling; thus it is not restricted to a specific domain. Furthermore, extracting patches from training images in-volves minimal effort. Nevertheless, most of the existing unsupervised learning based methods are inherently inferior to the handcrafted local descriptors, such as the Scale-Invariant Feature Transform (SIFT). In this paper, we aim to leverage unlabelled data to learn descriptors for image patches by a deep convolutional neural network. We introduce a Relative Distance Ranking Loss (RDRL) that measures the deviation of a generated ranking order of patch similarities against a reference one. Specifically, our approach yields a patch similarity ranking based on the learned embedding of a neural network, and the ranking mechanism minimizes the proposed RDRL by mimicking a reference similarity ranking based on a competent handcrafted feature (i.e., SIFT). To our advantage, after the training process, our network is not only able to measure the patch similarity but also able to outperform SIFT by a large margin on several commonly used benchmark datasets as demonstrated in our extensive experiments.

Cite

Text

Yu et al. "Unsupervised Extraction of Local Image Descriptors via Relative Distance Ranking Loss." IEEE/CVF International Conference on Computer Vision Workshops, 2019. doi:10.1109/ICCVW.2019.00351

Markdown

[Yu et al. "Unsupervised Extraction of Local Image Descriptors via Relative Distance Ranking Loss." IEEE/CVF International Conference on Computer Vision Workshops, 2019.](https://mlanthology.org/iccvw/2019/yu2019iccvw-unsupervised/) doi:10.1109/ICCVW.2019.00351

BibTeX

@inproceedings{yu2019iccvw-unsupervised,
  title     = {{Unsupervised Extraction of Local Image Descriptors via Relative Distance Ranking Loss}},
  author    = {Yu, Xin and Tian, Yurun and Porikli, Fatih and Hartley, Richard and Li, Hongdong and Heijnen, Huub and Balntas, Vassileios},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2019},
  pages     = {2893-2902},
  doi       = {10.1109/ICCVW.2019.00351},
  url       = {https://mlanthology.org/iccvw/2019/yu2019iccvw-unsupervised/}
}