A Three-Layered Approach to Facade Parsing

Abstract

We propose a novel three-layered approach for semantic segmentation of building facades. In the first layer, starting from an oversegmentation of a facade, we employ the recently introduced machine learning technique Recursive Neural Networks (RNN) to obtain a probabilistic interpretation of each segment. In the second layer, initial labeling is augmented with the information coming from specialized facade component detectors. The information is merged using a Markov Random Field. In the third layer, we introduce weak architectural knowledge , which enforces the final reconstruction to be architecturally plausible and consistent. Rigorous tests performed on two existing datasets of building facades demonstrate that we significantly outperform the current-state of the art, even when using outputs from earlier layers of the pipeline. Also, we show how the final output of the third layer can be used to create a procedural reconstruction.

Cite

Text

Martinovic et al. "A Three-Layered Approach to Facade Parsing." European Conference on Computer Vision, 2012. doi:10.1007/978-3-642-33786-4_31

Markdown

[Martinovic et al. "A Three-Layered Approach to Facade Parsing." European Conference on Computer Vision, 2012.](https://mlanthology.org/eccv/2012/martinovic2012eccv-three/) doi:10.1007/978-3-642-33786-4_31

BibTeX

@inproceedings{martinovic2012eccv-three,
  title     = {{A Three-Layered Approach to Facade Parsing}},
  author    = {Martinovic, Andelo and Mathias, Markus and Weissenberg, Julien and Van Gool, Luc},
  booktitle = {European Conference on Computer Vision},
  year      = {2012},
  pages     = {416-429},
  doi       = {10.1007/978-3-642-33786-4_31},
  url       = {https://mlanthology.org/eccv/2012/martinovic2012eccv-three/}
}