Semantic Visual Localization

Abstract

Robust visual localization under a wide range of viewing conditions is a fundamental problem in computer vision. Handling the difficult cases of this problem is not only very challenging but also of high practical relevance, e.g., in the context of life-long localization for augmented reality or autonomous robots. In this paper, we propose a novel approach based on a joint 3D geometric and semantic understanding of the world, enabling it to succeed under conditions where previous approaches failed. Our method leverages a novel generative model for descriptor learning, trained on semantic scene completion as an auxiliary task. The resulting 3D descriptors are robust to missing observations by encoding high-level 3D geometric and semantic information. Experiments on several challenging large-scale localization datasets demonstrate reliable localization under extreme viewpoint, illumination, and geometry changes.

Cite

Text

Schönberger et al. "Semantic Visual Localization." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. doi:10.1109/CVPR.2018.00721

Markdown

[Schönberger et al. "Semantic Visual Localization." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.](https://mlanthology.org/cvpr/2018/schonberger2018cvpr-semantic/) doi:10.1109/CVPR.2018.00721

BibTeX

@inproceedings{schonberger2018cvpr-semantic,
  title     = {{Semantic Visual Localization}},
  author    = {Schönberger, Johannes L. and Pollefeys, Marc and Geiger, Andreas and Sattler, Torsten},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2018},
  doi       = {10.1109/CVPR.2018.00721},
  url       = {https://mlanthology.org/cvpr/2018/schonberger2018cvpr-semantic/}
}