DivideMix: Learning with Noisy Labels as Semi-Supervised Learning

Li, Junnan; Socher, Richard; Hoi, Steven C. H.

DivideMix: Learning with Noisy Labels as Semi-Supervised Learning

Junnan Li, Richard Socher, Steven C. H. Hoi

ICLR 2020

/iclr/2020/li2020iclr-dividemix/

Abstract

Deep neural networks are known to be annotation-hungry. Numerous efforts have been devoted to reducing the annotation cost when learning with deep networks. Two prominent directions include learning with noisy labels and semi-supervised learning by exploiting unlabeled data. In this work, we propose DivideMix, a novel framework for learning with noisy labels by leveraging semi-supervised learning techniques. In particular, DivideMix models the per-sample loss distribution with a mixture model to dynamically divide the training data into a labeled set with clean samples and an unlabeled set with noisy samples, and trains the model on both the labeled and unlabeled data in a semi-supervised manner. To avoid confirmation bias, we simultaneously train two diverged networks where each network uses the dataset division from the other network. During the semi-supervised training phase, we improve the MixMatch strategy by performing label co-refinement and label co-guessing on labeled and unlabeled samples, respectively. Experiments on multiple benchmark datasets demonstrate substantial improvements over state-of-the-art methods. Code is available at https://github.com/LiJunnan1992/DivideMix .

PDF ICLR Semantic Scholar

Cite

Text

Li et al. "DivideMix: Learning with Noisy Labels as Semi-Supervised Learning." International Conference on Learning Representations, 2020.

Markdown

[Li et al. "DivideMix: Learning with Noisy Labels as Semi-Supervised Learning." International Conference on Learning Representations, 2020.](https://mlanthology.org/iclr/2020/li2020iclr-dividemix/)

BibTeX

@inproceedings{li2020iclr-dividemix,
  title     = {{DivideMix: Learning with Noisy Labels as Semi-Supervised Learning}},
  author    = {Li, Junnan and Socher, Richard and Hoi, Steven C. H.},
  booktitle = {International Conference on Learning Representations},
  year      = {2020},
  url       = {https://mlanthology.org/iclr/2020/li2020iclr-dividemix/}
}