Hard Negative Examples Are Hard, but Useful

Abstract

but useful","Triplet loss is an extremely common approach to distance metric learning. Representations of images from the same class are optimized to be mapped closer together in an embedding space than representations of images from different classes. Much work on triplet losses focuses on selecting the most useful triplets of images to consider, with strategies that select dissimilar examples from the same class or similar examples from different classes. The consensus of previous research is that optimizing with the extit{hardest} negative examples leads to bad training behavior. That's a problem -- these hardest negatives are literally the cases where the distance metric fails to capture semantic similarity. In this paper, we characterize the space of triplets and derive why hard negatives make triplet loss training fail. We offer a simple fix to the loss function and show that, with this fix, optimizing with hard negative examples becomes feasible. This leads to more generalizable features, and image retrieval results that outperform state of the art for datasets with high intra-class variance.

Cite

Text

Xuan et al. "Hard Negative Examples Are Hard, but Useful." Proceedings of the European Conference on Computer Vision (ECCV), 2020. doi:10.1007/978-3-030-58568-6_8

Markdown

[Xuan et al. "Hard Negative Examples Are Hard, but Useful." Proceedings of the European Conference on Computer Vision (ECCV), 2020.](https://mlanthology.org/eccv/2020/xuan2020eccv-hard/) doi:10.1007/978-3-030-58568-6_8

BibTeX

@inproceedings{xuan2020eccv-hard,
  title     = {{Hard Negative Examples Are Hard, but Useful}},
  author    = {Xuan, Hong and Stylianou, Abby and Liu, Xiaotong and Pless, Robert},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2020},
  doi       = {10.1007/978-3-030-58568-6_8},
  url       = {https://mlanthology.org/eccv/2020/xuan2020eccv-hard/}
}