Predicting Actions from Static Scenes

Abstract

Human actions naturally co-occur with scenes. In this work we aim to discover action-scene correlation for a large number of scene categories and to use such correlation for action prediction. Towards this goal, we collect a new SUN Action dataset with manual annotations of typical human actions for 397 scenes. We next discover action-scene associations and demonstrate that scene categories can be well identified from their associated actions. Using discovered associations, we address a new task of predicting human actions for images of static scenes. We evaluate prediction of 23 and 38 action classes for images of indoor and outdoor scenes respectively and show promising results. We also propose a new application of geo-localized action prediction and demonstrate ability of our method to automatically answer queries such as “Where is a good place for a picnic?” or “Can I cycle along this path?”.

Cite

Text

Vu et al. "Predicting Actions from Static Scenes." European Conference on Computer Vision, 2014. doi:10.1007/978-3-319-10602-1_28

Markdown

[Vu et al. "Predicting Actions from Static Scenes." European Conference on Computer Vision, 2014.](https://mlanthology.org/eccv/2014/vu2014eccv-predicting/) doi:10.1007/978-3-319-10602-1_28

BibTeX

@inproceedings{vu2014eccv-predicting,
  title     = {{Predicting Actions from Static Scenes}},
  author    = {Vu, Tuan-Hung and Olsson, Catherine and Laptev, Ivan and Oliva, Aude and Sivic, Josef},
  booktitle = {European Conference on Computer Vision},
  year      = {2014},
  pages     = {421-436},
  doi       = {10.1007/978-3-319-10602-1_28},
  url       = {https://mlanthology.org/eccv/2014/vu2014eccv-predicting/}
}