Fine-Grained Activity Recognition in Baseball Videos
Abstract
In this paper, we introduce a challenging new dataset, MLB-YouTube, designed for fine-grained activity detection. The dataset contains two settings: segmented video classification as well as activity detection in continuous videos. We experimentally compare various recognition approaches capturing temporal structure in activity videos, by classifying segmented videos and extending those approaches to continuous videos. We also compare models on the extremely difficult task of predicting pitch speed and pitch type from broadcast baseball videos. We find that learning temporal structure is valuable for fine-grained activity recognition.
Cite
Text
Piergiovanni and Ryoo. "Fine-Grained Activity Recognition in Baseball Videos." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2018. doi:10.1109/CVPRW.2018.00226Markdown
[Piergiovanni and Ryoo. "Fine-Grained Activity Recognition in Baseball Videos." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2018.](https://mlanthology.org/cvprw/2018/piergiovanni2018cvprw-finegrained/) doi:10.1109/CVPRW.2018.00226BibTeX
@inproceedings{piergiovanni2018cvprw-finegrained,
title = {{Fine-Grained Activity Recognition in Baseball Videos}},
author = {Piergiovanni, A. J. and Ryoo, Michael S.},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2018},
pages = {1740-1748},
doi = {10.1109/CVPRW.2018.00226},
url = {https://mlanthology.org/cvprw/2018/piergiovanni2018cvprw-finegrained/}
}