Generalization Analysis of Machine Learning Algorithms via the Worst-Case Data-Generating Probability Measure

Abstract

In this paper, the worst-case probability measure over the data is introduced as a tool for characterizing the generalization capabilities of machine learning algorithms. More specifically, the worst-case probability measure is a Gibbs probability measure and the unique solution to the maximization of the expected loss under a relative entropy constraint with respect to a reference probability measure. Fundamental generalization metrics, such as the sensitivity of the expected loss, the sensitivity of the empirical risk, and the generalization gap are shown to have closed-form expressions involving the worst-case data-generating probability measure. Existing results for the Gibbs algorithm, such as characterizing the generalization gap as a sum of mutual information and lautum information, up to a constant factor, are recovered. A novel parallel is established between the worst-case data-generating probability measure and the Gibbs algorithm. Specifically, the Gibbs probability measure is identified as a fundamental commonality of the model space and the data space for machine learning algorithms.

Cite

Text

Zou et al. "Generalization Analysis of Machine Learning Algorithms via the Worst-Case Data-Generating Probability Measure." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I15.29674

Markdown

[Zou et al. "Generalization Analysis of Machine Learning Algorithms via the Worst-Case Data-Generating Probability Measure." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/zou2024aaai-generalization/) doi:10.1609/AAAI.V38I15.29674

BibTeX

@inproceedings{zou2024aaai-generalization,
  title     = {{Generalization Analysis of Machine Learning Algorithms via the Worst-Case Data-Generating Probability Measure}},
  author    = {Zou, Xinying and Perlaza, Samir M. and Esnaola, Iñaki and Altman, Eitan},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {17271-17279},
  doi       = {10.1609/AAAI.V38I15.29674},
  url       = {https://mlanthology.org/aaai/2024/zou2024aaai-generalization/}
}