Data Distillation: Towards Omni-Supervised Learning
Abstract
We investigate omni-supervised learning, a special regime of semi-supervised learning in which the learner exploits all available labeled data plus internet-scale sources of unlabeled data. Omni-supervised learning is lower-bounded by performance on existing labeled datasets, offering the potential to surpass state-of-the-art fully supervised methods. To exploit the omni-supervised setting, we propose data distillation, a method that ensembles predictions from multiple transformations of unlabeled data, using a single model, to automatically generate new training annotations. We argue that visual recognition models have recently become accurate enough that it is now possible to apply classic ideas about self-training to challenging real-world data. Our experimental results show that in the cases of human keypoint detection and general object detection, state-of-the-art models trained with data distillation surpass the performance of using labeled data from the COCO dataset alone.
Cite
Text
Radosavovic et al. "Data Distillation: Towards Omni-Supervised Learning." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. doi:10.1109/CVPR.2018.00433Markdown
[Radosavovic et al. "Data Distillation: Towards Omni-Supervised Learning." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.](https://mlanthology.org/cvpr/2018/radosavovic2018cvpr-data/) doi:10.1109/CVPR.2018.00433BibTeX
@inproceedings{radosavovic2018cvpr-data,
title = {{Data Distillation: Towards Omni-Supervised Learning}},
author = {Radosavovic, Ilija and Dollár, Piotr and Girshick, Ross and Gkioxari, Georgia and He, Kaiming},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2018},
doi = {10.1109/CVPR.2018.00433},
url = {https://mlanthology.org/cvpr/2018/radosavovic2018cvpr-data/}
}