On-Line Large Scale Semantic Fusion

Abstract

Recent research towards 3D reconstruction has delivered reliable and fast pipelines to obtain accurate volumetric maps of large environments. Alongside, we witness dramatic improvements in the field of semantic segmentation of images due to deployment of deep learning architectures. In this paper, we pursue bridging the semantic gap of purely geometric representations by leveraging on a SLAM pipeline and a deep neural network so to endow surface patches with category labels. In particular, we present the first system that, based on the input stream provided by a commodity RGB-D sensor, can deliver interactively and automatically a map of a large scale environment featuring both geometric as well as semantic information. We also show how the significant computational cost inherent to deployment of a state-of-the-art deep network for semantic labeling does not hinder interactivity thanks to suitable scheduling of the workload on an off-the-shelf PC platform equipped with two GPUs.

Cite

Text

Cavallari and Di Stefano. "On-Line Large Scale Semantic Fusion." European Conference on Computer Vision, 2016. doi:10.1007/978-3-319-49409-8_10

Markdown

[Cavallari and Di Stefano. "On-Line Large Scale Semantic Fusion." European Conference on Computer Vision, 2016.](https://mlanthology.org/eccv/2016/cavallari2016eccv-line/) doi:10.1007/978-3-319-49409-8_10

BibTeX

@inproceedings{cavallari2016eccv-line,
  title     = {{On-Line Large Scale Semantic Fusion}},
  author    = {Cavallari, Tommaso and Di Stefano, Luigi},
  booktitle = {European Conference on Computer Vision},
  year      = {2016},
  pages     = {83-99},
  doi       = {10.1007/978-3-319-49409-8_10},
  url       = {https://mlanthology.org/eccv/2016/cavallari2016eccv-line/}
}