Indoor Scene Structure Analysis for Single Image Depth Estimation

Abstract

We tackle the problem of single image depth estimation, which, without additional knowledge, suffers from many ambiguities. Unlike previous approaches that only reason locally, we propose to exploit the global structure of the scene to estimate its depth. To this end, we introduce a hierarchical representation of the scene, which models local depth jointly with mid-level and global scene structures. We formulate single image depth estimation as inference in a graphical model whose edges let us encode the interactions within and across the different layers of our hierarchy. Our method therefore still produces detailed depth estimates, but also leverages higher-level information about the scene. We demonstrate the benefits of our approach over local depth estimation methods on standard indoor datasets.

Cite

Text

Zhuo et al. "Indoor Scene Structure Analysis for Single Image Depth Estimation." Conference on Computer Vision and Pattern Recognition, 2015. doi:10.1109/CVPR.2015.7298660

Markdown

[Zhuo et al. "Indoor Scene Structure Analysis for Single Image Depth Estimation." Conference on Computer Vision and Pattern Recognition, 2015.](https://mlanthology.org/cvpr/2015/zhuo2015cvpr-indoor/) doi:10.1109/CVPR.2015.7298660

BibTeX

@inproceedings{zhuo2015cvpr-indoor,
  title     = {{Indoor Scene Structure Analysis for Single Image Depth Estimation}},
  author    = {Zhuo, Wei and Salzmann, Mathieu and He, Xuming and Liu, Miaomiao},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2015},
  doi       = {10.1109/CVPR.2015.7298660},
  url       = {https://mlanthology.org/cvpr/2015/zhuo2015cvpr-indoor/}
}