Group Equivariant Stand-Alone Self-Attention for Vision

Romero, David W.; Cordonnier, Jean-Baptiste

Group Equivariant Stand-Alone Self-Attention for Vision

David W. Romero, Jean-Baptiste Cordonnier

ICLR 2021

/iclr/2021/romero2021iclr-group/

Abstract

We provide a general self-attention formulation to impose group equivariance to arbitrary symmetry groups. This is achieved by defining positional encodings that are invariant to the action of the group considered. Since the group acts on the positional encoding directly, group equivariant self-attention networks (GSA-Nets) are steerable by nature. Our experiments on vision benchmarks demonstrate consistent improvements of GSA-Nets over non-equivariant self-attention networks.

PDF ICLR Code Semantic Scholar

Cite

Text

Romero and Cordonnier. "Group Equivariant Stand-Alone Self-Attention for Vision." International Conference on Learning Representations, 2021.

Markdown

[Romero and Cordonnier. "Group Equivariant Stand-Alone Self-Attention for Vision." International Conference on Learning Representations, 2021.](https://mlanthology.org/iclr/2021/romero2021iclr-group/)

BibTeX

@inproceedings{romero2021iclr-group,
  title     = {{Group Equivariant Stand-Alone Self-Attention for Vision}},
  author    = {Romero, David W. and Cordonnier, Jean-Baptiste},
  booktitle = {International Conference on Learning Representations},
  year      = {2021},
  url       = {https://mlanthology.org/iclr/2021/romero2021iclr-group/}
}