Siamese Networks: The Tale of Two Manifolds

Abstract

Siamese networks are non-linear deep models that have found their ways into a broad set of problems in learning theory, thanks to their embedding capabilities. In this paper, we study Siamese networks from a new perspective and question the validity of their training procedure. We show that in the majority of cases, the objective of a Siamese network is endowed with an invariance property. Neglecting the invariance property leads to a hindrance in training the Siamese networks. To alleviate this issue, we propose two Riemannian structures and generalize a well-established accelerated stochastic gradient descent method to take into account the proposed Riemannian structures. Our empirical evaluations suggest that by making use of the Riemannian geometry, we achieve state-of-the-art results against several algorithms for the challenging problem of fine-grained image classification.

Cite

Text

Roy et al. "Siamese Networks: The Tale of Two Manifolds." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019. doi:10.1109/ICCV.2019.00314

Markdown

[Roy et al. "Siamese Networks: The Tale of Two Manifolds." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.](https://mlanthology.org/iccv/2019/roy2019iccv-siamese/) doi:10.1109/ICCV.2019.00314

BibTeX

@inproceedings{roy2019iccv-siamese,
  title     = {{Siamese Networks: The Tale of Two Manifolds}},
  author    = {Roy, Soumava Kumar and Harandi, Mehrtash and Nock, Richard and Hartley, Richard},
  booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision},
  year      = {2019},
  doi       = {10.1109/ICCV.2019.00314},
  url       = {https://mlanthology.org/iccv/2019/roy2019iccv-siamese/}
}