Semantic Texture for Robust Dense Tracking

Abstract

We argue that robust dense SLAM systems can make valuable use of the layers of features coming from a standard CNN as a pyramid of 'semantic texture' which is suitable for dense alignment while being much more robust to nuisance factors such as lighting than raw RGB values. We use a straightforward Lucas-Kanade formulation of image alignment, with a schedule of iterations over the coarse-to-fine levels of a pyramid, and simply replace the usual image pyramid by the hierarchy of convolutional feature maps from a pre-trained CNN. The resulting dense alignment performance is much more robust to lighting and other variations, as we show by camera rotation tracking experiments on time-lapse sequences captured over many hours. Looking towards the future of scene representation for real-time visual SLAM, we further demonstrate that a selection using simple criteria of a small number of the total set of features output by a CNN gives just as accurate but much more efficient tracking performance.

Cite

Text

Czarnowski et al. "Semantic Texture for Robust Dense Tracking." IEEE/CVF International Conference on Computer Vision Workshops, 2017. doi:10.1109/ICCVW.2017.105

Markdown

[Czarnowski et al. "Semantic Texture for Robust Dense Tracking." IEEE/CVF International Conference on Computer Vision Workshops, 2017.](https://mlanthology.org/iccvw/2017/czarnowski2017iccvw-semantic/) doi:10.1109/ICCVW.2017.105

BibTeX

@inproceedings{czarnowski2017iccvw-semantic,
  title     = {{Semantic Texture for Robust Dense Tracking}},
  author    = {Czarnowski, Jan and Leutenegger, Stefan and Davison, Andrew J.},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2017},
  pages     = {851-859},
  doi       = {10.1109/ICCVW.2017.105},
  url       = {https://mlanthology.org/iccvw/2017/czarnowski2017iccvw-semantic/}
}