Hockey Action Recognition via Integrated Stacked Hourglass Network
Abstract
A convolutional neural network (CNN) has been designed to interpret player actions in ice hockey video. The hourglass network is employed as the base to generate player pose estimation and layers are added to this network to produce action recognition. As such, the unified architecture is referred to as action recognition hourglass network, or ARHN. ARHN has three components. The first component is the latent pose estimator, the second transforms latent features to a common frame of reference, and the third performs action recognition. Since no benchmark dataset for pose estimation or action recognition is available for hockey players, we generate such an annotated dataset. Experimental results show action recognition accuracy of 65% for four types of actions in hockey. When similar poses are merged to three and two classes, the accuracy rate increases to 71% and 78%, proving the efficacy of the methodology for automated action recognition in hockey.
Cite
Text
Fani et al. "Hockey Action Recognition via Integrated Stacked Hourglass Network." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2017. doi:10.1109/CVPRW.2017.17Markdown
[Fani et al. "Hockey Action Recognition via Integrated Stacked Hourglass Network." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2017.](https://mlanthology.org/cvprw/2017/fani2017cvprw-hockey/) doi:10.1109/CVPRW.2017.17BibTeX
@inproceedings{fani2017cvprw-hockey,
title = {{Hockey Action Recognition via Integrated Stacked Hourglass Network}},
author = {Fani, Mehrnaz and Neher, Helmut and Clausi, David A. and Wong, Alexander and Zelek, John S.},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2017},
pages = {85-93},
doi = {10.1109/CVPRW.2017.17},
url = {https://mlanthology.org/cvprw/2017/fani2017cvprw-hockey/}
}