Spatial Transformer Networks

Max Jaderberg, Karen Simonyan, Andrew Zisserman, Koray Kavukcuoglu

NeurIPS 2015 pp. 2017-2025

/neurips/2015/jaderberg2015neurips-spatial/

Abstract

Convolutional Neural Networks define an exceptionallypowerful class of model, but are still limited by the lack of abilityto be spatially invariant to the input data in a computationally and parameterefficient manner. In this work we introduce a new learnable module, theSpatial Transformer, which explicitly allows the spatial manipulation ofdata within the network. This differentiable module can be insertedinto existing convolutional architectures, giving neural networks the ability toactively spatially transform feature maps, conditional on the feature map itself,without any extra training supervision or modification to the optimisation process. We show that the useof spatial transformers results in models which learn invariance to translation,scale, rotation and more generic warping, resulting in state-of-the-artperformance on several benchmarks, and for a numberof classes of transformations.

PDF NeurIPS Semantic Scholar

Cite

Text

Jaderberg et al. "Spatial Transformer Networks." Neural Information Processing Systems, 2015.

Markdown

[Jaderberg et al. "Spatial Transformer Networks." Neural Information Processing Systems, 2015.](https://mlanthology.org/neurips/2015/jaderberg2015neurips-spatial/)

BibTeX

@inproceedings{jaderberg2015neurips-spatial,
  title     = {{Spatial Transformer Networks}},
  author    = {Jaderberg, Max and Simonyan, Karen and Zisserman, Andrew and Kavukcuoglu, Koray},
  booktitle = {Neural Information Processing Systems},
  year      = {2015},
  pages     = {2017-2025},
  url       = {https://mlanthology.org/neurips/2015/jaderberg2015neurips-spatial/}
}