Label Noise in Adversarial Training: A Novel Perspective to Study Robust Overfitting

Abstract

We show that label noise exists in adversarial training. Such label noise is due to the mismatch between the true label distribution of adversarial examples and the label inherited from clean examples – the true label distribution is distorted by the adversarial perturbation, but is neglected by the common practice that inherits labels from clean examples. Recognizing label noise sheds insights on the prevalence of robust overfitting in adversarial training, and explains its intriguing dependence on perturbation radius and data quality. Also, our label noise perspective aligns well with our observations of the epoch-wise double descent in adversarial training. Guided by our analyses, we proposed a method to automatically calibrate the label to address the label noise and robust overfitting. Our method achieves consistent performance improvements across various models and datasets without introducing new hyper-parameters or additional tuning.

Cite

Text

Dong et al. "Label Noise in Adversarial Training: A Novel Perspective to Study Robust Overfitting." Neural Information Processing Systems, 2022.

Markdown

[Dong et al. "Label Noise in Adversarial Training: A Novel Perspective to Study Robust Overfitting." Neural Information Processing Systems, 2022.](https://mlanthology.org/neurips/2022/dong2022neurips-label/)

BibTeX

@inproceedings{dong2022neurips-label,
  title     = {{Label Noise in Adversarial Training: A Novel Perspective to Study Robust Overfitting}},
  author    = {Dong, Chengyu and Liu, Liyuan and Shang, Jingbo},
  booktitle = {Neural Information Processing Systems},
  year      = {2022},
  url       = {https://mlanthology.org/neurips/2022/dong2022neurips-label/}
}