Action Recognition Using Rank-1 Approximation of Joint Self-Similarity Volume
Abstract
In this paper, we make three main contributions in the area of action recognition: (i) We introduce the concept of Joint Self-Similarity Volume (Joint SSV) for modeling dynamical systems, and show that by using a new optimized rank-1 tensor approximation of Joint SSV one can obtain compact low-dimensional descriptors that very accurately preserve the dynamics of the original system, e.g. an action video sequence; (ii) The descriptor vectors derived from the optimized rank-1 approximation make it possible to recognize actions without explicitly aligning the action sequences of varying speed of execution or different frame rates; (iii) The method is generic and can be applied using different low-level features such as silhouettes, histogram of oriented gradients, etc. Hence, it does not necessarily require explicit tracking of features in the space-time volume. Our experimental results on three public datasets demonstrate that our method produces remarkably good results and outperforms all baseline methods.
Cite
Text
Sun et al. "Action Recognition Using Rank-1 Approximation of Joint Self-Similarity Volume." IEEE/CVF International Conference on Computer Vision, 2011. doi:10.1109/ICCV.2011.6126345Markdown
[Sun et al. "Action Recognition Using Rank-1 Approximation of Joint Self-Similarity Volume." IEEE/CVF International Conference on Computer Vision, 2011.](https://mlanthology.org/iccv/2011/sun2011iccv-action/) doi:10.1109/ICCV.2011.6126345BibTeX
@inproceedings{sun2011iccv-action,
title = {{Action Recognition Using Rank-1 Approximation of Joint Self-Similarity Volume}},
author = {Sun, Chuan and Junejo, Imran N. and Foroosh, Hassan},
booktitle = {IEEE/CVF International Conference on Computer Vision},
year = {2011},
pages = {1007-1012},
doi = {10.1109/ICCV.2011.6126345},
url = {https://mlanthology.org/iccv/2011/sun2011iccv-action/}
}