Augmenting Crowd-Sourced 3D Reconstructions Using Semantic Detections

Abstract

Image-based 3D reconstruction for Internet photo collections has become a robust technology to produce impressive virtual representations of real-world scenes. However, several fundamental challenges remain for Structure-from-Motion (SfM) pipelines, namely: the placement and reconstruction of transient objects only observed in single views, estimating the absolute scale of the scene, and (suprisingly often) recovering ground surfaces in the scene. We propose a method to jointly address these remaining open problems of SfM. In particular, we focus on detecting people in individual images and accurately placing them into an existing 3D model. As part of this placement, our method also estimates the absolute scale of the scene from object semantics, which in this case constitutes the height distribution of the population. Further, we obtain a smooth approximation of the ground surface and recover the gravity vector of the scene directly from the individual person detections. We demonstrate the results of our approach on a number of unordered Internet photo collections, and we quantitatively evaluate the obtained absolute scene scales.

Cite

Text

Price et al. "Augmenting Crowd-Sourced 3D Reconstructions Using Semantic Detections." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. doi:10.1109/CVPR.2018.00206

Markdown

[Price et al. "Augmenting Crowd-Sourced 3D Reconstructions Using Semantic Detections." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.](https://mlanthology.org/cvpr/2018/price2018cvpr-augmenting/) doi:10.1109/CVPR.2018.00206

BibTeX

@inproceedings{price2018cvpr-augmenting,
  title     = {{Augmenting Crowd-Sourced 3D Reconstructions Using Semantic Detections}},
  author    = {Price, true and Schönberger, Johannes L. and Wei, Zhen and Pollefeys, Marc and Frahm, Jan-Michael},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2018},
  doi       = {10.1109/CVPR.2018.00206},
  url       = {https://mlanthology.org/cvpr/2018/price2018cvpr-augmenting/}
}