Confidence-Aware Training of Smoothed Classifiers for Certified Robustness

Jeong, Jongheon; Kim, Seojin; Shin, Jinwoo

doi:10.1609/AAAI.V37I7.25968

Confidence-Aware Training of Smoothed Classifiers for Certified Robustness

Jongheon Jeong, Seojin Kim, Jinwoo Shin

AAAI 2023 pp. 8005-8013

doi:10.1609/AAAI.V37I7.25968 /aaai/2023/jeong2023aaai-confidence/

Abstract

Any classifier can be "smoothed out" under Gaussian noise to build a new classifier that is provably robust to l2-adversarial perturbations, viz., by averaging its predictions over the noise via randomized smoothing. Under the smoothed classifiers, the fundamental trade-off between accuracy and (adversarial) robustness has been well evidenced in the literature: i.e., increasing the robustness of a classifier for an input can be at the expense of decreased accuracy for some other inputs. In this paper, we propose a simple training method leveraging this trade-off to obtain robust smoothed classifiers, in particular, through a sample-wise control of robustness over the training samples. We make this control feasible by using "accuracy under Gaussian noise" as an easy-to-compute proxy of adversarial robustness for an input. Specifically, we differentiate the training objective depending on this proxy to filter out samples that are unlikely to benefit from the worst-case (adversarial) objective. Our experiments show that the proposed method, despite its simplicity, consistently exhibits improved certified robustness upon state-of-the-art training methods. Somewhat surprisingly, we find these improvements persist even for other notions of robustness, e.g., to various types of common corruptions. Code is available at https://github.com/alinlab/smoothing-catrs.

PDF AAAI Semantic Scholar

Cite

Text

Jeong et al. "Confidence-Aware Training of Smoothed Classifiers for Certified Robustness." AAAI Conference on Artificial Intelligence, 2023. doi:10.1609/AAAI.V37I7.25968

Markdown

[Jeong et al. "Confidence-Aware Training of Smoothed Classifiers for Certified Robustness." AAAI Conference on Artificial Intelligence, 2023.](https://mlanthology.org/aaai/2023/jeong2023aaai-confidence/) doi:10.1609/AAAI.V37I7.25968

BibTeX

@inproceedings{jeong2023aaai-confidence,
  title     = {{Confidence-Aware Training of Smoothed Classifiers for Certified Robustness}},
  author    = {Jeong, Jongheon and Kim, Seojin and Shin, Jinwoo},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {8005-8013},
  doi       = {10.1609/AAAI.V37I7.25968},
  url       = {https://mlanthology.org/aaai/2023/jeong2023aaai-confidence/}
}