3D Building Reconstruction from Monocular Remote Sensing Images

Abstract

3D building reconstruction from monocular remote sensing imagery is an important research problem and an economic solution to large-scale city modeling, compared with reconstruction from LiDAR data and multi-view imagery. However, several challenges such as the partial invisibility of building footprints and facades, the serious shadow effect, and the extreme variance of building height in large-scale areas, have restricted the existing monocular image based building reconstruction studies to certain application scenes, i.e., modeling simple low-rise buildings from near-nadir images. In this study, we propose a novel 3D building reconstruction method for monocular remote sensing images, which tackles the above difficulties, thus providing an appealing solution for more complicated scenarios. We design a multi-task building reconstruction network, named MTBR-Net, to learn the geometric property of oblique images, the key components of a 3D building model and their relations via four semantic-related and three offset-related tasks. The network outputs are further integrated by a prior knowledge based 3D model optimization method to produce the the final 3D building models. Results on a public 3D reconstruction dataset and a novel released dataset demonstrate that our method improves the height estimation performance by over 40% and the segmentation F1-score by 2% - 4% compared with current state-of-the-art.

Cite

Text

Li et al. "3D Building Reconstruction from Monocular Remote Sensing Images." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.01232

Markdown

[Li et al. "3D Building Reconstruction from Monocular Remote Sensing Images." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/li2021iccv-3d/) doi:10.1109/ICCV48922.2021.01232

BibTeX

@inproceedings{li2021iccv-3d,
  title     = {{3D Building Reconstruction from Monocular Remote Sensing Images}},
  author    = {Li, Weijia and Meng, Lingxuan and Wang, Jinwang and He, Conghui and Xia, Gui-Song and Lin, Dahua},
  booktitle = {International Conference on Computer Vision},
  year      = {2021},
  pages     = {12548-12557},
  doi       = {10.1109/ICCV48922.2021.01232},
  url       = {https://mlanthology.org/iccv/2021/li2021iccv-3d/}
}