Learning Rotation-Equivariant Features for Visual Correspondence

Abstract

Extracting discriminative local features that are invariant to imaging variations is an integral part of establishing correspondences between images. In this work, we introduce a self-supervised learning framework to extract discriminative rotation-invariant descriptors using group-equivariant CNNs. Thanks to employing group-equivariant CNNs, our method effectively learns to obtain rotation-equivariant features and their orientations explicitly, without having to perform sophisticated data augmentations. The resultant features and their orientations are further processed by group aligning, a novel invariant mapping technique that shifts the group-equivariant features by their orientations along the group dimension. Our group aligning technique achieves rotation-invariance without any collapse of the group dimension and thus eschews loss of discriminability. The proposed method is trained end-to-end in a self-supervised manner, where we use an orientation alignment loss for the orientation estimation and a contrastive descriptor loss for robust local descriptors to geometric/photometric variations. Our method demonstrates state-of-the-art matching accuracy among existing rotation-invariant descriptors under varying rotation and also shows competitive results when transferred to the task of keypoint matching and camera pose estimation.

Cite

Text

Lee et al. "Learning Rotation-Equivariant Features for Visual Correspondence." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.02096

Markdown

[Lee et al. "Learning Rotation-Equivariant Features for Visual Correspondence." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/lee2023cvpr-learning/) doi:10.1109/CVPR52729.2023.02096

BibTeX

@inproceedings{lee2023cvpr-learning,
  title     = {{Learning Rotation-Equivariant Features for Visual Correspondence}},
  author    = {Lee, Jongmin and Kim, Byungjin and Kim, Seungwook and Cho, Minsu},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {21887-21897},
  doi       = {10.1109/CVPR52729.2023.02096},
  url       = {https://mlanthology.org/cvpr/2023/lee2023cvpr-learning/}
}