Representing Pairwise Spatial and Temporal Relations for Action Recognition

Abstract

The popular bag-of-words paradigm for action recognition tasks is based on building histograms of quantized features, typically at the cost of discarding all information about relationships between them. However, although the beneficial nature of including these relationships seems obvious, in practice finding good representations for feature relationships in video is difficult. We propose a simple and computationally efficient method for expressing pairwise relationships between quantized features that combines the power of discriminative representations with key aspects of Naïve Bayes. We demonstrate how our technique can augment both appearance- and motion-based features, and that it significantly improves performance on both types of features.

Cite

Text

Matikainen et al. "Representing Pairwise Spatial and Temporal Relations for Action Recognition." European Conference on Computer Vision, 2010. doi:10.1007/978-3-642-15549-9_37

Markdown

[Matikainen et al. "Representing Pairwise Spatial and Temporal Relations for Action Recognition." European Conference on Computer Vision, 2010.](https://mlanthology.org/eccv/2010/matikainen2010eccv-representing/) doi:10.1007/978-3-642-15549-9_37

BibTeX

@inproceedings{matikainen2010eccv-representing,
  title     = {{Representing Pairwise Spatial and Temporal Relations for Action Recognition}},
  author    = {Matikainen, Pyry and Hebert, Martial and Sukthankar, Rahul},
  booktitle = {European Conference on Computer Vision},
  year      = {2010},
  pages     = {508-521},
  doi       = {10.1007/978-3-642-15549-9_37},
  url       = {https://mlanthology.org/eccv/2010/matikainen2010eccv-representing/}
}