Siamese Networks: The Tale of Two Manifolds
Abstract
Siamese networks are non-linear deep models that have found their ways into a broad set of problems in learning theory, thanks to their embedding capabilities. In this paper, we study Siamese networks from a new perspective and question the validity of their training procedure. We show that in the majority of cases, the objective of a Siamese network is endowed with an invariance property. Neglecting the invariance property leads to a hindrance in training the Siamese networks. To alleviate this issue, we propose two Riemannian structures and generalize a well-established accelerated stochastic gradient descent method to take into account the proposed Riemannian structures. Our empirical evaluations suggest that by making use of the Riemannian geometry, we achieve state-of-the-art results against several algorithms for the challenging problem of fine-grained image classification.
Cite
Text
Roy et al. "Siamese Networks: The Tale of Two Manifolds." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019. doi:10.1109/ICCV.2019.00314Markdown
[Roy et al. "Siamese Networks: The Tale of Two Manifolds." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.](https://mlanthology.org/iccv/2019/roy2019iccv-siamese/) doi:10.1109/ICCV.2019.00314BibTeX
@inproceedings{roy2019iccv-siamese,
title = {{Siamese Networks: The Tale of Two Manifolds}},
author = {Roy, Soumava Kumar and Harandi, Mehrtash and Nock, Richard and Hartley, Richard},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision},
year = {2019},
doi = {10.1109/ICCV.2019.00314},
url = {https://mlanthology.org/iccv/2019/roy2019iccv-siamese/}
}