On Top-K Selection in Multi-Armed Bandits and Hidden Bipartite Graphs

Abstract

This paper discusses how to efficiently choose from $n$ unknowndistributions the $k$ ones whose means are the greatest by a certainmetric, up to a small relative error. We study the topic under twostandard settings---multi-armed bandits and hidden bipartitegraphs---which differ in the nature of the input distributions. In theformer setting, each distribution can be sampled (in the i.i.d.manner) an arbitrary number of times, whereas in the latter, eachdistribution is defined on a population of a finite size $m$ (andhence, is fully revealed after $m$ samples). For both settings, weprove lower bounds on the total number of samples needed, and proposeoptimal algorithms whose sample complexities match those lower bounds.

Cite

Text

Cao et al. "On Top-K Selection in Multi-Armed Bandits and Hidden Bipartite Graphs." Neural Information Processing Systems, 2015.

Markdown

[Cao et al. "On Top-K Selection in Multi-Armed Bandits and Hidden Bipartite Graphs." Neural Information Processing Systems, 2015.](https://mlanthology.org/neurips/2015/cao2015neurips-topk/)

BibTeX

@inproceedings{cao2015neurips-topk,
  title     = {{On Top-K Selection in Multi-Armed Bandits and Hidden Bipartite Graphs}},
  author    = {Cao, Wei and Li, Jian and Tao, Yufei and Li, Zhize},
  booktitle = {Neural Information Processing Systems},
  year      = {2015},
  pages     = {1036-1044},
  url       = {https://mlanthology.org/neurips/2015/cao2015neurips-topk/}
}