Spatial Transformer Networks
Abstract
Convolutional Neural Networks define an exceptionallypowerful class of model, but are still limited by the lack of abilityto be spatially invariant to the input data in a computationally and parameterefficient manner. In this work we introduce a new learnable module, theSpatial Transformer, which explicitly allows the spatial manipulation ofdata within the network. This differentiable module can be insertedinto existing convolutional architectures, giving neural networks the ability toactively spatially transform feature maps, conditional on the feature map itself,without any extra training supervision or modification to the optimisation process. We show that the useof spatial transformers results in models which learn invariance to translation,scale, rotation and more generic warping, resulting in state-of-the-artperformance on several benchmarks, and for a numberof classes of transformations.
Cite
Text
Jaderberg et al. "Spatial Transformer Networks." Neural Information Processing Systems, 2015.Markdown
[Jaderberg et al. "Spatial Transformer Networks." Neural Information Processing Systems, 2015.](https://mlanthology.org/neurips/2015/jaderberg2015neurips-spatial/)BibTeX
@inproceedings{jaderberg2015neurips-spatial,
title = {{Spatial Transformer Networks}},
author = {Jaderberg, Max and Simonyan, Karen and Zisserman, Andrew and Kavukcuoglu, Koray},
booktitle = {Neural Information Processing Systems},
year = {2015},
pages = {2017-2025},
url = {https://mlanthology.org/neurips/2015/jaderberg2015neurips-spatial/}
}