Video Event Recognition Using Concept Attributes

Abstract

We propose to use action, scene and object concepts as semantic attributes for classification of video events in InTheWild content, such as YouTube videos. We model events using a variety of complementary semantic attribute features developed in a semantic concept space. Our contribution is to systematically demonstrate the advantages of this concept-based event representation (CBER) in applications of video event classification and understanding. Specifically, CBER has better generalization capability, which enables to recognize events with a few training examples. In addition, CBER makes it possible to recognize a novel event without training examples (i.e., zero-shot learning). We further show our proposed enhanced event model can further improve the zero-shot learning. Furthermore, CBER provides a straightforward way for event recounting/understanding. We use the TRECVID Multimedia Event Detection (MED11) open source event definitions and datasets as our test bed and show results on over 1400 hours of videos.

Cite

Text

Liu et al. "Video Event Recognition Using Concept Attributes." IEEE/CVF Winter Conference on Applications of Computer Vision, 2013. doi:10.1109/WACV.2013.6475038

Markdown

[Liu et al. "Video Event Recognition Using Concept Attributes." IEEE/CVF Winter Conference on Applications of Computer Vision, 2013.](https://mlanthology.org/wacv/2013/liu2013wacv-video/) doi:10.1109/WACV.2013.6475038

BibTeX

@inproceedings{liu2013wacv-video,
  title     = {{Video Event Recognition Using Concept Attributes}},
  author    = {Liu, Jingen and Yu, Qian and Javed, Omar and Ali, Saad and Tamrakar, Amir and Divakaran, Ajay and Cheng, Hui and Sawhney, Harpreet S.},
  booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
  year      = {2013},
  pages     = {339-346},
  doi       = {10.1109/WACV.2013.6475038},
  url       = {https://mlanthology.org/wacv/2013/liu2013wacv-video/}
}