Human Interaction Recognition Based on the Co-Occurrence of Visual Words
Abstract
This paper describes a novel methodology for automated recognition of high-level activities. A key aspect of our framework relies on the concept of co-occurring visual words for describing interactions between several persons. Motivated by the numerous success of human activity recognition methods using bag-of-words, this paradigm is extended. A 3-D XYT spatio-temporal volume is generated for each interacting person and a set of visual words is extracted to represent his activity. The interaction is then represented by the frequency of co-occurring visual words between persons. For our experiments, we used the UT-interaction dataset which contains several complex human-human interactions.
Cite
Text
el Houda Slimani et al. "Human Interaction Recognition Based on the Co-Occurrence of Visual Words." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2014. doi:10.1109/CVPRW.2014.74Markdown
[el Houda Slimani et al. "Human Interaction Recognition Based on the Co-Occurrence of Visual Words." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2014.](https://mlanthology.org/cvprw/2014/elhoudaslimani2014cvprw-human/) doi:10.1109/CVPRW.2014.74BibTeX
@inproceedings{elhoudaslimani2014cvprw-human,
title = {{Human Interaction Recognition Based on the Co-Occurrence of Visual Words}},
author = {el Houda Slimani, Khadidja Nour and Benezeth, Yannick and Souami, Feriel},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2014},
pages = {461-466},
doi = {10.1109/CVPRW.2014.74},
url = {https://mlanthology.org/cvprw/2014/elhoudaslimani2014cvprw-human/}
}