Convolutional Relational Machine for Group Activity Recognition
Abstract
We present an end-to-end deep Convolutional Neural Network called Convolutional Relational Machine (CRM) for recognizing group activities that utilizes the information in spatial relations between individual persons in image or video. It learns to produce an intermediate spatial representation (activity map) based on individual and group activities. A multi-stage refinement component is responsible for decreasing the incorrect predictions in the activity map. Finally, an aggregation component uses the refined information to recognize group activities. Experimental results demonstrate the constructive contribution of the information extracted and represented in the form of the activity map. CRM shows advantages over state-of-the-art models on Volleyball and Collective Activity datasets.
Cite
Text
Azar et al. "Convolutional Relational Machine for Group Activity Recognition." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00808Markdown
[Azar et al. "Convolutional Relational Machine for Group Activity Recognition." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/azar2019cvpr-convolutional/) doi:10.1109/CVPR.2019.00808BibTeX
@inproceedings{azar2019cvpr-convolutional,
title = {{Convolutional Relational Machine for Group Activity Recognition}},
author = {Azar, Sina Mokhtarzadeh and Atigh, Mina Ghadimi and Nickabadi, Ahmad and Alahi, Alexandre},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2019},
doi = {10.1109/CVPR.2019.00808},
url = {https://mlanthology.org/cvpr/2019/azar2019cvpr-convolutional/}
}