Action Recognition with Motion-Appearance Vocabulary Forest

Abstract

In this paper we propose an approach for action recognition based on a vocabulary forest of local motion-appearance features. Large numbers of features with associated motion vectors are extracted from action data and are represented by many vocabulary trees. Features from a query sequence are matched to the trees and vote for action categories and their locations. Large number of trees make the process efficient and robust. The system is capable of simultaneous categorization and localization of actions using only a few frames per sequence. The approach obtains excellent performance on standard action recognition sequences. We perform large scale experiments on 17 challenging real action categories from Olympic Games <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup> . We demonstrate the robustness of our method to appearance variations, camera motion, scale change, asymmetric actions, background clutter and occlusion.

Cite

Text

Mikolajczyk and Uemura. "Action Recognition with Motion-Appearance Vocabulary Forest." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008. doi:10.1109/CVPR.2008.4587628

Markdown

[Mikolajczyk and Uemura. "Action Recognition with Motion-Appearance Vocabulary Forest." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008.](https://mlanthology.org/cvpr/2008/mikolajczyk2008cvpr-action/) doi:10.1109/CVPR.2008.4587628

BibTeX

@inproceedings{mikolajczyk2008cvpr-action,
  title     = {{Action Recognition with Motion-Appearance Vocabulary Forest}},
  author    = {Mikolajczyk, Krystian and Uemura, Hirofumi},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2008},
  doi       = {10.1109/CVPR.2008.4587628},
  url       = {https://mlanthology.org/cvpr/2008/mikolajczyk2008cvpr-action/}
}