Learning Local Descriptors with a CDF-Based Dynamic Soft Margin

Abstract

The triplet loss is adopted by a variety of learning tasks, such as local feature descriptor learning. However, its standard formulation with a hard margin only leverages part of the training data in each mini-batch. Moreover, the margin is often empirically chosen or determined through computationally expensive validation, and stays unchanged during the entire training session. In this work, we propose a simple yet effective method to overcome the above limitations. The core idea is to replace the hard margin with a non-parametric soft margin, which is dynamically updated. The major observation is that the difficulty of a triplet can be inferred from the cumulative distribution function of the triplets' signed distances to the decision boundary. We demonstrate through experiments on both real-valued and binary local feature descriptors that our method leads to state-of-the-art performance on popular benchmarks, while eliminating the need to determine the best margin.

Cite

Text

Zhang and Rusinkiewicz. "Learning Local Descriptors with a CDF-Based Dynamic Soft Margin." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019. doi:10.1109/ICCV.2019.00306

Markdown

[Zhang and Rusinkiewicz. "Learning Local Descriptors with a CDF-Based Dynamic Soft Margin." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.](https://mlanthology.org/iccv/2019/zhang2019iccv-learning/) doi:10.1109/ICCV.2019.00306

BibTeX

@inproceedings{zhang2019iccv-learning,
  title     = {{Learning Local Descriptors with a CDF-Based Dynamic Soft Margin}},
  author    = {Zhang, Linguang and Rusinkiewicz, Szymon},
  booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision},
  year      = {2019},
  doi       = {10.1109/ICCV.2019.00306},
  url       = {https://mlanthology.org/iccv/2019/zhang2019iccv-learning/}
}