Manhattan Scene Understanding Using Monocular, Stereo, and 3D Features

Flint, Alex; Murray, David William; Reid, Ian D.

doi:10.1109/ICCV.2011.6126501

Manhattan Scene Understanding Using Monocular, Stereo, and 3D Features

Alex Flint, David William Murray, Ian D. Reid

ICCV 2011 pp. 2228-2235

doi:10.1109/ICCV.2011.6126501 /iccv/2011/flint2011iccv-manhattan/

Abstract

This paper addresses scene understanding in the context of a moving camera, integrating semantic reasoning ideas from monocular vision with 3D information available through structure-from-motion. We combine geometric and photometric cues in a Bayesian framework, building on recent successes leveraging the indoor Manhattan assumption in monocular vision. We focus on indoor environments and show how to extract key boundaries while ignoring clutter and decorations. To achieve this we present a graphical model that relates photometric cues learned from labeled data, stereo photo-consistency across multiple views, and depth cues derived from structure-from-motion point clouds. We show how to solve MAP inference using dynamic programming, allowing exact, global inference in ~100 ms (in addition to feature computation of under one second) without using specialized hardware. Experiments show our system out-performing the state-of-the-art.

ICCV Semantic Scholar

Cite

Text

Flint et al. "Manhattan Scene Understanding Using Monocular, Stereo, and 3D Features." IEEE/CVF International Conference on Computer Vision, 2011. doi:10.1109/ICCV.2011.6126501

Markdown

[Flint et al. "Manhattan Scene Understanding Using Monocular, Stereo, and 3D Features." IEEE/CVF International Conference on Computer Vision, 2011.](https://mlanthology.org/iccv/2011/flint2011iccv-manhattan/) doi:10.1109/ICCV.2011.6126501

BibTeX

@inproceedings{flint2011iccv-manhattan,
  title     = {{Manhattan Scene Understanding Using Monocular, Stereo, and 3D Features}},
  author    = {Flint, Alex and Murray, David William and Reid, Ian D.},
  booktitle = {IEEE/CVF International Conference on Computer Vision},
  year      = {2011},
  pages     = {2228-2235},
  doi       = {10.1109/ICCV.2011.6126501},
  url       = {https://mlanthology.org/iccv/2011/flint2011iccv-manhattan/}
}