Fine-Grained Segmentation Networks: Self-Supervised Segmentation for Improved Long-Term Visual Localization

Abstract

Long-term visual localization is the problem of estimating the camera pose of a given query image in a scene whose appearance changes over time. It is an important problem in practice that is, for example, encountered in autonomous driving. In order to gain robustness to such changes, long-term localization approaches often use segmantic segmentations as an invariant scene representation, as the semantic meaning of each scene part should not be affected by seasonal and other changes. However, these representations are typically not very discriminative due to the very limited number of available classes. In this paper, we propose a novel neural network, the Fine-Grained Segmentation Network (FGSN), that can be used to provide image segmentations with a larger number of labels and can be trained in a self-supervised fashion. In addition, we show how FGSNs can be trained to output consistent labels across seasonal changes. We show through extensive experiments that integrating the fine-grained segmentations produced by our FGSNs into existing localization algorithms leads to substantial improvements in localization performance.

Cite

Text

Larsson et al. "Fine-Grained Segmentation Networks: Self-Supervised Segmentation for Improved Long-Term Visual Localization." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019. doi:10.1109/ICCV.2019.00012

Markdown

[Larsson et al. "Fine-Grained Segmentation Networks: Self-Supervised Segmentation for Improved Long-Term Visual Localization." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.](https://mlanthology.org/iccv/2019/larsson2019iccv-finegrained/) doi:10.1109/ICCV.2019.00012

BibTeX

@inproceedings{larsson2019iccv-finegrained,
  title     = {{Fine-Grained Segmentation Networks: Self-Supervised Segmentation for Improved Long-Term Visual Localization}},
  author    = {Larsson, Mans and Stenborg, Erik and Toft, Carl and Hammarstrand, Lars and Sattler, Torsten and Kahl, Fredrik},
  booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision},
  year      = {2019},
  doi       = {10.1109/ICCV.2019.00012},
  url       = {https://mlanthology.org/iccv/2019/larsson2019iccv-finegrained/}
}