Mining Motion Atoms and Phrases for Complex Action Recognition
Abstract
This paper proposes motion atom and phrase as a midlevel temporal "part" for representing and classifying complex action. Motion atom is defined as an atomic part of action, and captures the motion information of action video in a short temporal scale. Motion phrase is a temporal composite of multiple motion atoms with an AND/OR structure, which further enhances the discriminative ability of motion atoms by incorporating temporal constraints in a longer scale. Specifically, given a set of weakly labeled action videos, we firstly design a discriminative clustering method to automatically discover a set of representative motion atoms. Then, based on these motion atoms, we mine effective motion phrases with high discriminative and representative power. We introduce a bottom-up phrase construction algorithm and a greedy selection method for this mining task. We examine the classification performance of the motion atom and phrase based representation on two complex action datasets: Olympic Sports and UCF50. Experimental results show that our method achieves superior performance over recent published methods on both datasets.
Cite
Text
Wang et al. "Mining Motion Atoms and Phrases for Complex Action Recognition." International Conference on Computer Vision, 2013. doi:10.1109/ICCV.2013.333Markdown
[Wang et al. "Mining Motion Atoms and Phrases for Complex Action Recognition." International Conference on Computer Vision, 2013.](https://mlanthology.org/iccv/2013/wang2013iccv-mining/) doi:10.1109/ICCV.2013.333BibTeX
@inproceedings{wang2013iccv-mining,
title = {{Mining Motion Atoms and Phrases for Complex Action Recognition}},
author = {Wang, Limin and Qiao, Yu and Tang, Xiaoou},
booktitle = {International Conference on Computer Vision},
year = {2013},
doi = {10.1109/ICCV.2013.333},
url = {https://mlanthology.org/iccv/2013/wang2013iccv-mining/}
}