Sharp Bounds for Generalized Uniformity Testing

Abstract

We study the problem of generalized uniformity testing of a discrete probability distribution: Given samples from a probability distribution p over an unknown size discrete domain Ω, we want to distinguish, with probability at least 2/3, between the case that p is uniform on some subset of Ω versus ε-far, in total variation distance, from any such uniform distribution. We establish tight bounds on the sample complexity of generalized uniformity testing. In more detail, we present a computationally efficient tester whose sample complexity is optimal, within constant factors, and a matching worst-case information-theoretic lower bound. Specifically, we show that the sample complexity of generalized uniformity testing is Θ(1/(ε^(4/3) ||p||_3) + 1/(ε^2 ||p||_2 )).

Cite

Text

Diakonikolas et al. "Sharp Bounds for Generalized Uniformity Testing." Neural Information Processing Systems, 2018.

Markdown

[Diakonikolas et al. "Sharp Bounds for Generalized Uniformity Testing." Neural Information Processing Systems, 2018.](https://mlanthology.org/neurips/2018/diakonikolas2018neurips-sharp/)

BibTeX

@inproceedings{diakonikolas2018neurips-sharp,
  title     = {{Sharp Bounds for Generalized Uniformity Testing}},
  author    = {Diakonikolas, Ilias and Kane, Daniel M. and Stewart, Alistair},
  booktitle = {Neural Information Processing Systems},
  year      = {2018},
  pages     = {6201-6210},
  url       = {https://mlanthology.org/neurips/2018/diakonikolas2018neurips-sharp/}
}