A Three-Layered Approach to Facade Parsing
Abstract
We propose a novel three-layered approach for semantic segmentation of building facades. In the first layer, starting from an oversegmentation of a facade, we employ the recently introduced machine learning technique Recursive Neural Networks (RNN) to obtain a probabilistic interpretation of each segment. In the second layer, initial labeling is augmented with the information coming from specialized facade component detectors. The information is merged using a Markov Random Field. In the third layer, we introduce weak architectural knowledge , which enforces the final reconstruction to be architecturally plausible and consistent. Rigorous tests performed on two existing datasets of building facades demonstrate that we significantly outperform the current-state of the art, even when using outputs from earlier layers of the pipeline. Also, we show how the final output of the third layer can be used to create a procedural reconstruction.
Cite
Text
Martinovic et al. "A Three-Layered Approach to Facade Parsing." European Conference on Computer Vision, 2012. doi:10.1007/978-3-642-33786-4_31Markdown
[Martinovic et al. "A Three-Layered Approach to Facade Parsing." European Conference on Computer Vision, 2012.](https://mlanthology.org/eccv/2012/martinovic2012eccv-three/) doi:10.1007/978-3-642-33786-4_31BibTeX
@inproceedings{martinovic2012eccv-three,
title = {{A Three-Layered Approach to Facade Parsing}},
author = {Martinovic, Andelo and Mathias, Markus and Weissenberg, Julien and Van Gool, Luc},
booktitle = {European Conference on Computer Vision},
year = {2012},
pages = {416-429},
doi = {10.1007/978-3-642-33786-4_31},
url = {https://mlanthology.org/eccv/2012/martinovic2012eccv-three/}
}