The Analysis of the Expected Change in the Classification Probability of the Predicted Label

Abstract

We present a formalism for estimating the expected change in the probability distribution of the predicted label of an object, with respect to all small perturbations to the object. We first derive analytically an estimate of the expected probability change as a function of the input noise. We then conduct three empirical studies: in the first study, experimental results on image classification show that the proposed measure can be used to distinguish the not-robust label predictions from those that are robust, even when they are all predicted with high confidence. The second study shows that the proposed robustness measure is almost always higher for the predictions on the corrupted images, compared to the predictions on the original versions of them. The final study shows that the proposed measure is lower for models when they are trained using adversarial training approaches.

Cite

Text

Yang et al. "The Analysis of the Expected Change in the Classification Probability of the Predicted Label." Transactions on Machine Learning Research, 2023.

Markdown

[Yang et al. "The Analysis of the Expected Change in the Classification Probability of the Predicted Label." Transactions on Machine Learning Research, 2023.](https://mlanthology.org/tmlr/2023/yang2023tmlr-analysis/)

BibTeX

@article{yang2023tmlr-analysis,
  title     = {{The Analysis of the Expected Change in the Classification Probability of the Predicted Label}},
  author    = {Yang, Ruo and Liu, Ping and Bilgic, Mustafa},
  journal   = {Transactions on Machine Learning Research},
  year      = {2023},
  url       = {https://mlanthology.org/tmlr/2023/yang2023tmlr-analysis/}
}