People Watching: Human Actions as a Cue for Single View Geometry

Abstract

We present an approach which exploits the coupling between human actions and scene geometry. We investigate the use of human pose as a cue for single-view 3D scene understanding. Our method builds upon recent advances in still-image pose estimation to extract functional and geometric constraints about the scene. These constraints are then used to improve state-of-the-art single-view 3D scene understanding approaches. The proposed method is validated on a collection of monocular time-lapse sequences collected from YouTube and a dataset of still images of indoor scenes. We demonstrate that observing people performing different actions can significantly improve estimates of 3D scene geometry.

Cite

Text

Fouhey et al. "People Watching: Human Actions as a Cue for Single View Geometry." European Conference on Computer Vision, 2012. doi:10.1007/978-3-642-33715-4_53

Markdown

[Fouhey et al. "People Watching: Human Actions as a Cue for Single View Geometry." European Conference on Computer Vision, 2012.](https://mlanthology.org/eccv/2012/fouhey2012eccv-people/) doi:10.1007/978-3-642-33715-4_53

BibTeX

@inproceedings{fouhey2012eccv-people,
  title     = {{People Watching: Human Actions as a Cue for Single View Geometry}},
  author    = {Fouhey, David F. and Delaitre, Vincent and Gupta, Abhinav and Efros, Alexei A. and Laptev, Ivan and Sivic, Josef},
  booktitle = {European Conference on Computer Vision},
  year      = {2012},
  pages     = {732-745},
  doi       = {10.1007/978-3-642-33715-4_53},
  url       = {https://mlanthology.org/eccv/2012/fouhey2012eccv-people/}
}