A Piggyback Representation for Action Recognition

Wolf, Lior; Hanani, Yair; Hassner, Tal

doi:10.1109/CVPRW.2014.81

A Piggyback Representation for Action Recognition

Lior Wolf, Yair Hanani, Tal Hassner

CVPRW 2014 pp. 520-525

doi:10.1109/CVPRW.2014.81 /cvprw/2014/wolf2014cvprw-piggyback/

Abstract

In video understanding, the spatial patterns formed by local space-time interest points hold discriminative information. We encode these spatial regularities using a word2vec neural network, a recently proposed tool in the field of text processing. Then, building upon recent accumulator based image representation solutions, input videos are represented in a hybrid manner: the appearance of local space time interest points is used to collect and associate the learned descriptors, which capture the spatial patterns. Promising results are shown on recent action recognition benchmarks, using well established methods as the underlying appearance descriptors.

PDF CVPRW Semantic Scholar

Cite

Text

Wolf et al. "A Piggyback Representation for Action Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2014. doi:10.1109/CVPRW.2014.81

Markdown

[Wolf et al. "A Piggyback Representation for Action Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2014.](https://mlanthology.org/cvprw/2014/wolf2014cvprw-piggyback/) doi:10.1109/CVPRW.2014.81

BibTeX

@inproceedings{wolf2014cvprw-piggyback,
  title     = {{A Piggyback Representation for Action Recognition}},
  author    = {Wolf, Lior and Hanani, Yair and Hassner, Tal},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2014},
  pages     = {520-525},
  doi       = {10.1109/CVPRW.2014.81},
  url       = {https://mlanthology.org/cvprw/2014/wolf2014cvprw-piggyback/}
}