Football Action Recognition Using Hierarchical LSTM
Abstract
We present a hierarchical recurrent network for understanding team sports activity in image and location sequences. In the hierarchical model, we integrate proposed multiple person-centered features over a temporal sequence based on LSTM's outputs. To achieve this scheme, we introduce the Keeping state in LSTM as one of externally controllable states, and extend the Hierarchical LSTMs to include mechanism for the integration. Experimental results demonstrate effectiveness of the proposed framework involving hierarchical LSTM and person-centered feature. In this study, we demonstrate improvement over the reference model. Specifically, by incorporating the person-centered feature with meta-information (e.g., location data) in our proposed late fusion framework, we also demonstrate increased discriminability of action categories and enhanced robustness against fluctuation in the number of observed players.
Cite
Text
Tsunoda et al. "Football Action Recognition Using Hierarchical LSTM." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2017. doi:10.1109/CVPRW.2017.25Markdown
[Tsunoda et al. "Football Action Recognition Using Hierarchical LSTM." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2017.](https://mlanthology.org/cvprw/2017/tsunoda2017cvprw-football/) doi:10.1109/CVPRW.2017.25BibTeX
@inproceedings{tsunoda2017cvprw-football,
title = {{Football Action Recognition Using Hierarchical LSTM}},
author = {Tsunoda, Takamasa and Komori, Yasuhiro and Matsugu, Masakazu and Harada, Tatsuya},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2017},
pages = {155-163},
doi = {10.1109/CVPRW.2017.25},
url = {https://mlanthology.org/cvprw/2017/tsunoda2017cvprw-football/}
}