Repetitive Reprediction Deep Decipher for Semi-Supervised Learning

Abstract

Most recent semi-supervised deep learning (deep SSL) methods used a similar paradigm: use network predictions to update pseudo-labels and use pseudo-labels to update network parameters iteratively. However, they lack theoretical support and cannot explain why predictions are good candidates for pseudo-labels. In this paper, we propose a principled end-to-end framework named deep decipher (D2) for SSL. Within the D2 framework, we prove that pseudo-labels are related to network predictions by an exponential link function, which gives a theoretical support for using predictions as pseudo-labels. Furthermore, we demonstrate that updating pseudo-labels by network predictions will make them uncertain. To mitigate this problem, we propose a training strategy called repetitive reprediction (R2). Finally, the proposed R2-D2 method is tested on the large-scale ImageNet dataset and outperforms state-of-the-art methods by 5 percentage points.

Cite

Text

Wang and Wu. "Repetitive Reprediction Deep Decipher for Semi-Supervised Learning." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I04.6082

Markdown

[Wang and Wu. "Repetitive Reprediction Deep Decipher for Semi-Supervised Learning." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/wang2020aaai-repetitive/) doi:10.1609/AAAI.V34I04.6082

BibTeX

@inproceedings{wang2020aaai-repetitive,
  title     = {{Repetitive Reprediction Deep Decipher for Semi-Supervised Learning}},
  author    = {Wang, Guo-Hua and Wu, Jianxin},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {6170-6177},
  doi       = {10.1609/AAAI.V34I04.6082},
  url       = {https://mlanthology.org/aaai/2020/wang2020aaai-repetitive/}
}