SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos

Abstract

In this paper, we introduce SoccerNet, a benchmark for action spotting in soccer videos. The dataset is composed of 500 complete soccer games from six main European leagues, covering three seasons from 2014 to 2017 and a total duration of 764 hours. A total of 6,637 temporal annotations are automatically parsed from online match reports at a one minute resolution for three main classes of events (Goal, Yellow/Red Card, and Substitution). As such, the dataset is easily scalable. These annotations are manually refined to a one second resolution by anchoring them at a single timestamp following well-defined soccer rules. With an average of one event every 6.9 minutes, this dataset focuses on the problem of localizing very sparse events within long videos. We define the task of spotting as finding the anchors of soccer events in a video. Making use of recent developments in the realm of generic action recognition and detection in video, we provide strong baselines for detecting soccer events. We show that our best model for classifying temporal segments of length one minute reaches a mean Average Precision (mAP) of 67.8%. For the spotting task, our baseline reaches an Average-mAP of 49.7% for tolerances d ranging from 5 to 60 seconds. Our dataset and models are available at https://silviogiancola.github.io/SoccerNet.

Cite

Text

Giancola et al. "SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2018. doi:10.1109/CVPRW.2018.00223

Markdown

[Giancola et al. "SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2018.](https://mlanthology.org/cvprw/2018/giancola2018cvprw-soccernet/) doi:10.1109/CVPRW.2018.00223

BibTeX

@inproceedings{giancola2018cvprw-soccernet,
  title     = {{SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos}},
  author    = {Giancola, Silvio and Amine, Mohieddine and Dghaily, Tarek and Ghanem, Bernard},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2018},
  pages     = {1711-1721},
  doi       = {10.1109/CVPRW.2018.00223},
  url       = {https://mlanthology.org/cvprw/2018/giancola2018cvprw-soccernet/}
}