A Tree-Based Approach to Integrated Action Localization, Recognition and Segmentation

Jiang, Zhuolin; Lin, Zhe; Davis, Larry S.

doi:10.1007/978-3-642-35749-7_9

A Tree-Based Approach to Integrated Action Localization, Recognition and Segmentation

Zhuolin Jiang, Zhe Lin, Larry S. Davis

ECCV 2010 pp. 114-127

doi:10.1007/978-3-642-35749-7_9 /eccv/2010/jiang2010eccv-tree/

Abstract

A tree-based approach to integrated action segmentation, localization and recognition is proposed. An action is represented as a sequence of joint hog-flow descriptors extracted independently from each frame. During training, a set of action prototypes is first learned based on a k -means clustering, and then a binary tree model is constructed from the set of action prototypes based on hierarchical k -means clustering. Each tree node is characterized by a shape-motion descriptor and a rejection threshold, and an action segmentation mask is defined for leaf nodes (corresponding to a prototype). During testing, an action is localized by mapping each test frame to a nearest neighbor prototype using a fast matching method to search the learned tree, followed by global filtering refinement. An action is recognized by maximizing the sum of the joint probabilities of the action category and action prototype over test frames. Our approach does not explicitly rely on human tracking and background subtraction, and enables action localization and recognition in realistic and challenging conditions (such as crowded backgrounds). Experimental results show that our approach can achieve recognition rates of 100% on the CMU action dataset and 100% on the Weizmann dataset.

PDF ECCV Semantic Scholar

Cite

Text

Jiang et al. "A Tree-Based Approach to Integrated Action Localization, Recognition and Segmentation." European Conference on Computer Vision, 2010. doi:10.1007/978-3-642-35749-7_9

Markdown

[Jiang et al. "A Tree-Based Approach to Integrated Action Localization, Recognition and Segmentation." European Conference on Computer Vision, 2010.](https://mlanthology.org/eccv/2010/jiang2010eccv-tree/) doi:10.1007/978-3-642-35749-7_9

BibTeX

@inproceedings{jiang2010eccv-tree,
  title     = {{A Tree-Based Approach to Integrated Action Localization, Recognition and Segmentation}},
  author    = {Jiang, Zhuolin and Lin, Zhe and Davis, Larry S.},
  booktitle = {European Conference on Computer Vision},
  year      = {2010},
  pages     = {114-127},
  doi       = {10.1007/978-3-642-35749-7_9},
  url       = {https://mlanthology.org/eccv/2010/jiang2010eccv-tree/}
}