Object, Scene and Actions: Combining Multiple Features for Human Action Recognition

Abstract

In many cases, human actions can be identified not only by the singular observation of the human body in motion, but also properties of the surrounding scene and the related objects. In this paper, we look into this problem and propose an approach for human action recognition that integrates multiple feature channels from several entities such as objects, scenes and people. We formulate the problem in a multiple instance learning (MIL) framework, based on multiple feature channels. By using a discriminative approach, we join multiple feature channels embedded to the MIL space. Our experiments over the large YouTube dataset show that scene and object information can be used to complement person features for human action recognition.

Cite

Text

Ikizler-Cinbis and Sclaroff. "Object, Scene and Actions: Combining Multiple Features for Human Action Recognition." European Conference on Computer Vision, 2010. doi:10.1007/978-3-642-15549-9_36

Markdown

[Ikizler-Cinbis and Sclaroff. "Object, Scene and Actions: Combining Multiple Features for Human Action Recognition." European Conference on Computer Vision, 2010.](https://mlanthology.org/eccv/2010/ikizlercinbis2010eccv-object/) doi:10.1007/978-3-642-15549-9_36

BibTeX

@inproceedings{ikizlercinbis2010eccv-object,
  title     = {{Object, Scene and Actions: Combining Multiple Features for Human Action Recognition}},
  author    = {Ikizler-Cinbis, Nazli and Sclaroff, Stan},
  booktitle = {European Conference on Computer Vision},
  year      = {2010},
  pages     = {494-507},
  doi       = {10.1007/978-3-642-15549-9_36},
  url       = {https://mlanthology.org/eccv/2010/ikizlercinbis2010eccv-object/}
}