A Hierarchical Model of Shape and Appearance for Human Action Classification
Abstract
We present a novel model for human action categorization. A video sequence is represented as a collection of spatial and spatial-temporal features by extracting static and dynamic interest points. We propose a hierarchical model that can be characterized as a constellation of bags-of-features and that is able to combine both spatial and spatial-temporal features. Given a novel video sequence, the model is able to categorize human actions in a frame-by-frame basis. We test the model on a publicly available human action dataset [2] and show that our new method performs well on the classification task. We also conducted control experiments to show that the use of the proposed mixture of hierarchical models improves the classification performance over bag of feature models. An additional experiment shows that using both dynamic and static features provides a richer representation of human actions when compared to the use of a single feature type, as demonstrated by our evaluation in the classification task.
Cite
Text
Niebles and Fei-Fei. "A Hierarchical Model of Shape and Appearance for Human Action Classification." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2007. doi:10.1109/CVPR.2007.383132Markdown
[Niebles and Fei-Fei. "A Hierarchical Model of Shape and Appearance for Human Action Classification." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2007.](https://mlanthology.org/cvpr/2007/niebles2007cvpr-hierarchical/) doi:10.1109/CVPR.2007.383132BibTeX
@inproceedings{niebles2007cvpr-hierarchical,
title = {{A Hierarchical Model of Shape and Appearance for Human Action Classification}},
author = {Niebles, Juan Carlos and Fei-Fei, Li},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2007},
doi = {10.1109/CVPR.2007.383132},
url = {https://mlanthology.org/cvpr/2007/niebles2007cvpr-hierarchical/}
}