Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning

Abstract

In this paper we revisit the idea of pseudo-labeling in the context of semi-supervised learning where a learning algorithm has access to a small set of labeled samples and a large set of unlabeled samples. Pseudo-labeling works by applying pseudo-labels to samples in the unlabeled set by using a model trained on combination of the labeled samples and any previously pseudo-labeled samples, and iteratively repeating this process in a self-training cycle. Current methods seem to have abandoned this approach in favor of consistency regularization methods that train models under a combination of different styles of self-supervised losses on the unlabeled samples and standard supervised losses on the labeled samples. We empirically demonstrate that pseudo-labeling can in fact be competitive with the state-of-the-art, while being more resilient to out-of-distribution samples in the unlabeled set. We identify two key factors that allow pseudo-labeling to achieve such remarkable results (1) applying curriculum learning principles and (2) avoiding concept drift by restarting model parameters before each self-training cycle. We obtain 94.91% accuracy on CIFAR-10 using only 4,000 labeled samples, and 68.87% top-1 accuracy on Imagenet-ILSVRC using only 10% of the labeled samples.

Cite

Text

Cascante-Bonilla et al. "Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I8.16852

Markdown

[Cascante-Bonilla et al. "Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/cascantebonilla2021aaai-curriculum/) doi:10.1609/AAAI.V35I8.16852

BibTeX

@inproceedings{cascantebonilla2021aaai-curriculum,
  title     = {{Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning}},
  author    = {Cascante-Bonilla, Paola and Tan, Fuwen and Qi, Yanjun and Ordonez, Vicente},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {6912-6920},
  doi       = {10.1609/AAAI.V35I8.16852},
  url       = {https://mlanthology.org/aaai/2021/cascantebonilla2021aaai-curriculum/}
}