PUMP: Pyramidal and Uniqueness Matching Priors for Unsupervised Learning of Local Descriptors

Abstract

Existing approaches for learning local image descriptors have shown remarkable achievements in a wide range of geometric tasks. However, most of them require per-pixel correspondence-level supervision, which is difficult to acquire at scale and in high quality. In this paper, we propose to explicitly integrate two matching priors in a single loss in order to learn local descriptors without supervision. Given two images depicting the same scene, we extract pixel descriptors and build a correlation volume. The first prior enforces the local consistency of matches in this volume via a pyramidal structure iteratively constructed using a non-parametric module. The second prior exploits the fact that each descriptor should match with at most one descriptor from the other image. We combine our unsupervised loss with a standard self-supervised loss trained from synthetic image augmentations. Feature descriptors learned by the proposed approach outperform their fully- and self-supervised counterparts on various geometric benchmarks such as visual localization and image matching, achieving state-of-the-art performance. Project webpage: https://europe.naverlabs.com/research/3d-vision/pump

Cite

Text

Revaud et al. "PUMP: Pyramidal and Uniqueness Matching Priors for Unsupervised Learning of Local Descriptors." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00390

Markdown

[Revaud et al. "PUMP: Pyramidal and Uniqueness Matching Priors for Unsupervised Learning of Local Descriptors." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/revaud2022cvpr-pump/) doi:10.1109/CVPR52688.2022.00390

BibTeX

@inproceedings{revaud2022cvpr-pump,
  title     = {{PUMP: Pyramidal and Uniqueness Matching Priors for Unsupervised Learning of Local Descriptors}},
  author    = {Revaud, Jérome and Leroy, Vincent and Weinzaepfel, Philippe and Chidlovskii, Boris},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {3926-3936},
  doi       = {10.1109/CVPR52688.2022.00390},
  url       = {https://mlanthology.org/cvpr/2022/revaud2022cvpr-pump/}
}