Holistic Scene Understanding for 3D Object Detection with RGBD Cameras

Abstract

In this paper, we tackle the problem of indoor scene understanding using RGBD data. Towards this goal, we propose a holistic approach that exploits 2D segmentation, 3D geometry, as well as contextual relations between scenes and objects. Specifically, we extend the CPMC [3] framework to 3D in order to generate candidate cuboids, and develop a conditional random field to integrate information from different sources to classify the cuboids. With this formulation, scene classification and 3D object recognition are coupled and can be jointly solved through probabilistic inference. We test the effectiveness of our approach on the challenging NYU v2 dataset. The experimental results demonstrate that through effective evidence integration and holistic reasoning, our approach achieves substantial improvement over the state-of-the-art.

Cite

Text

Lin et al. "Holistic Scene Understanding for 3D Object Detection with RGBD Cameras." International Conference on Computer Vision, 2013. doi:10.1109/ICCV.2013.179

Markdown

[Lin et al. "Holistic Scene Understanding for 3D Object Detection with RGBD Cameras." International Conference on Computer Vision, 2013.](https://mlanthology.org/iccv/2013/lin2013iccv-holistic/) doi:10.1109/ICCV.2013.179

BibTeX

@inproceedings{lin2013iccv-holistic,
  title     = {{Holistic Scene Understanding for 3D Object Detection with RGBD Cameras}},
  author    = {Lin, Dahua and Fidler, Sanja and Urtasun, Raquel},
  booktitle = {International Conference on Computer Vision},
  year      = {2013},
  doi       = {10.1109/ICCV.2013.179},
  url       = {https://mlanthology.org/iccv/2013/lin2013iccv-holistic/}
}