SceneCAD: Predicting Object Alignments and Layouts in RGB-D Scans

Abstract

We present a novel approach to reconstructing lightweight, CAD-based representations of scanned 3D environments from commodity RGB-D sensors. Our key idea is to jointly optimize for both CAD model alignments as well as layout estimations of the scanned scene, explicitly modeling inter-relationships between objects-to-objects and objects-to-layout. Since object arrangement and scene layout are intrinsically coupled, we show that treating the problem jointly significantly helps to produce globally-consistent representations of a scene. Object CAD models are aligned to the scene by establishing dense correspondences between geometry, and we introduce a hierarchical layout prediction approach to estimate layout planes from corners and edges of the scene. To this end, we propose a message-passing graph neural network to model the inter-relationships between objects and layout, guiding generation of a globally object alignment in a scene. By considering the global scene layout, we achieve significantly improved CAD alignments compared to state-of-the-art methods, improving from 41.83% to 58.41% alignment accuracy on SUNCG and from 50.05% to 61.24% on ScanNet, respectively. The resulting CAD-based representations makes our method well-suited for applications in content creation such as augmented- or virtual reality.

Cite

Text

Avetisyan et al. "SceneCAD: Predicting Object Alignments and Layouts in RGB-D Scans." Proceedings of the European Conference on Computer Vision (ECCV), 2020. doi:10.1007/978-3-030-58542-6_36

Markdown

[Avetisyan et al. "SceneCAD: Predicting Object Alignments and Layouts in RGB-D Scans." Proceedings of the European Conference on Computer Vision (ECCV), 2020.](https://mlanthology.org/eccv/2020/avetisyan2020eccv-scenecad/) doi:10.1007/978-3-030-58542-6_36

BibTeX

@inproceedings{avetisyan2020eccv-scenecad,
  title     = {{SceneCAD: Predicting Object Alignments and Layouts in RGB-D Scans}},
  author    = {Avetisyan, Armen and Khanova, Tatiana and Choy, Christopher and Dash, Denver and Dai, Angela and Nießner, Matthias},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2020},
  doi       = {10.1007/978-3-030-58542-6_36},
  url       = {https://mlanthology.org/eccv/2020/avetisyan2020eccv-scenecad/}
}