Hierarchical Dynamic Parsing and Encoding for Action Recognition

Su, Bing; Zhou, Jiahuan; Ding, Xiaoqing; Wang, Hao; Wu, Ying

doi:10.1007/978-3-319-46493-0_13

Hierarchical Dynamic Parsing and Encoding for Action Recognition

Bing Su, Jiahuan Zhou, Xiaoqing Ding, Hao Wang, Ying Wu

ECCV 2016 pp. 202-217

doi:10.1007/978-3-319-46493-0_13 /eccv/2016/su2016eccv-hierarchical/

Abstract

A video action generally exhibits quite complex rhythms and non-stationary dynamics. To model such non-uniform dynamics, this paper describes a novel hierarchical dynamic encoding method to capture both the locally smooth dynamics and globally drastic dynamic changes. It provides a multi-layer joint representation for temporal modeling for action recognition. At the first layer, the action sequence is parsed in an unsupervised manner into several smooth-changing stages corresponding to different key poses or temporal structures. The dynamics within each stage are encoded by mean-pooling or learning to rank based encoding. At the second layer, the temporal information of the ordered dynamics extracted from the previous layer is encoded again to form the overall representation. Extensive experiments on a gesture action dataset (Chalearn) and several generic action datasets (Olympic Sports and Hollywood2) have demonstrated the effectiveness of the proposed method.

PDF ECCV Semantic Scholar

Cite

Text

Su et al. "Hierarchical Dynamic Parsing and Encoding for Action Recognition." European Conference on Computer Vision, 2016. doi:10.1007/978-3-319-46493-0_13

Markdown

[Su et al. "Hierarchical Dynamic Parsing and Encoding for Action Recognition." European Conference on Computer Vision, 2016.](https://mlanthology.org/eccv/2016/su2016eccv-hierarchical/) doi:10.1007/978-3-319-46493-0_13

BibTeX

@inproceedings{su2016eccv-hierarchical,
  title     = {{Hierarchical Dynamic Parsing and Encoding for Action Recognition}},
  author    = {Su, Bing and Zhou, Jiahuan and Ding, Xiaoqing and Wang, Hao and Wu, Ying},
  booktitle = {European Conference on Computer Vision},
  year      = {2016},
  pages     = {202-217},
  doi       = {10.1007/978-3-319-46493-0_13},
  url       = {https://mlanthology.org/eccv/2016/su2016eccv-hierarchical/}
}