Rethinking Robustness in Machine Learning: A Posterior Agreement Approach

Abstract

The robustness of algorithms against covariate shifts is a fundamental problem with critical implications for the deployment of machine learning algorithms in the real world. Current evaluation methods predominantly measure robustness through the lens of standard generalization, relying on task performance metrics like accuracy. This approach lacks a theoretical justification and underscores the need for a principled foundation of robustness assessment under distribution shifts. In this work, we set the desiderata for a robustness metric, and we propose a novel principled framework for the robustness assessment problem that directly follows the Posterior Agreement (PA) theory of model validation. Specifically, we extend the PA framework to the covariate shift setting and propose a metric for robustness evaluation. We assess the soundness of our metric in controlled environments and through an empirical robustness analysis in two different covariate shift scenarios: adversarial learning and domain generalization. We illustrate the suitability of PA by evaluating several models under different nature and magnitudes of shift, and proportion of affected observations. The results show that PA offers a reliable analysis of the vulnerabilities in learning algorithms across different shift conditions and provides higher discriminability than accuracy-based metrics, while requiring no supervision.

Cite

Text

Carvalho et al. "Rethinking Robustness in Machine Learning: A Posterior Agreement Approach." Transactions on Machine Learning Research, 2025.

Markdown

[Carvalho et al. "Rethinking Robustness in Machine Learning: A Posterior Agreement Approach." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/carvalho2025tmlr-rethinking/)

BibTeX

@article{carvalho2025tmlr-rethinking,
  title     = {{Rethinking Robustness in Machine Learning: A Posterior Agreement Approach}},
  author    = {Carvalho, João B. S. and Rodríguez, Víctor Jiménez and Torcinovich, Alessandro and Cinà, Antonio Emanuele and Cotrini, Carlos and Schönherr, Lea and Buhmann, Joachim M.},
  journal   = {Transactions on Machine Learning Research},
  year      = {2025},
  url       = {https://mlanthology.org/tmlr/2025/carvalho2025tmlr-rethinking/}
}