Well-Classified Examples Are Underestimated in Classification with Deep Neural Networks
Abstract
The conventional wisdom behind learning deep classification models is to focus on bad-classified examples and ignore well-classified examples that are far from the decision boundary. For instance, when training with cross-entropy loss, examples with higher likelihoods (i.e., well-classified examples) contribute smaller gradients in back-propagation. However, we theoretically show that this common practice hinders representation learning, energy optimization, and margin growth. To counteract this deficiency, we propose to reward well-classified examples with additive bonuses to revive their contribution to the learning process. This counterexample theoretically addresses these three issues. We empirically support this claim by directly verifying the theoretical results or significant performance improvement with our counterexample on diverse tasks, including image classification, graph classification, and machine translation. Furthermore, this paper shows that we can deal with complex scenarios, such as imbalanced classification, OOD detection, and applications under adversarial attacks because our idea can solve these three issues. Code is available at https://github.com/lancopku/well-classified-examples-are-underestimated.
Cite
Text
Zhao et al. "Well-Classified Examples Are Underestimated in Classification with Deep Neural Networks." AAAI Conference on Artificial Intelligence, 2022. doi:10.1609/AAAI.V36I8.20904Markdown
[Zhao et al. "Well-Classified Examples Are Underestimated in Classification with Deep Neural Networks." AAAI Conference on Artificial Intelligence, 2022.](https://mlanthology.org/aaai/2022/zhao2022aaai-well/) doi:10.1609/AAAI.V36I8.20904BibTeX
@inproceedings{zhao2022aaai-well,
title = {{Well-Classified Examples Are Underestimated in Classification with Deep Neural Networks}},
author = {Zhao, Guangxiang and Yang, Wenkai and Ren, Xuancheng and Li, Lei and Wu, Yunfang and Sun, Xu},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2022},
pages = {9180-9189},
doi = {10.1609/AAAI.V36I8.20904},
url = {https://mlanthology.org/aaai/2022/zhao2022aaai-well/}
}