Jersey Number Recognition with Semi-Supervised Spatial Transformer Network
Abstract
It is still a challenging task to recognize the jersey number of players on the court in soccer match videos, as the jersey numbers are very small in the object detection task and annotated data are not easy to collect. Based on the object detection results of all the players on the court, a CNN model is first introduced to classify these numbers on the deteced players' images. To localize the jersey number more precisely without involving another digit detector and extra consumption, we then improve the former network to an end-to-end framework by fusing with the spatial transformer network (STN). To further improve the accuracy, we bring extra supervision to STN and upgrade the model to a semi-supervised multi-task learning system, by labeling a small portion of the number areas in the dataset by quadrangle. Extensive experiments illustrate the effectiveness of the proposed framework.
Cite
Text
Li et al. "Jersey Number Recognition with Semi-Supervised Spatial Transformer Network." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2018. doi:10.1109/CVPRW.2018.00231Markdown
[Li et al. "Jersey Number Recognition with Semi-Supervised Spatial Transformer Network." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2018.](https://mlanthology.org/cvprw/2018/li2018cvprw-jersey/) doi:10.1109/CVPRW.2018.00231BibTeX
@inproceedings{li2018cvprw-jersey,
title = {{Jersey Number Recognition with Semi-Supervised Spatial Transformer Network}},
author = {Li, Gen and Xu, Shikun and Liu, Xiang and Li, Lei and Wang, Changhu},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2018},
pages = {1783-1790},
doi = {10.1109/CVPRW.2018.00231},
url = {https://mlanthology.org/cvprw/2018/li2018cvprw-jersey/}
}