Statistical Indistinguishability of Learning Algorithms

Abstract

When two different parties use the same learning rule on their own data, how can we test whether the distributions of the two outcomes are similar? In this paper, we study the similarity of outcomes of learning rules through the lens of the Total Variation (TV) distance of distributions. We say that a learning rule is TV indistinguishable if the expected TV distance between the posterior distributions of its outputs, executed on two training data sets drawn independently from the same distribution, is small. We first investigate the learnability of hypothesis classes using TV indistinguishable learners. Our main results are information-theoretic equivalences between TV indistinguishability and existing algorithmic stability notions such as replicability and approximate differential privacy. Then, we provide statistical amplification and boosting algorithms for TV indistinguishable learners.

Cite

Text

Kalavasis et al. "Statistical Indistinguishability of Learning Algorithms." International Conference on Machine Learning, 2023.

Markdown

[Kalavasis et al. "Statistical Indistinguishability of Learning Algorithms." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/kalavasis2023icml-statistical/)

BibTeX

@inproceedings{kalavasis2023icml-statistical,
  title     = {{Statistical Indistinguishability of Learning Algorithms}},
  author    = {Kalavasis, Alkis and Karbasi, Amin and Moran, Shay and Velegkas, Grigoris},
  booktitle = {International Conference on Machine Learning},
  year      = {2023},
  pages     = {15586-15622},
  volume    = {202},
  url       = {https://mlanthology.org/icml/2023/kalavasis2023icml-statistical/}
}