On the Existence of a Complexity in Fixed Budget Bandit Identification
Abstract
In fixed budget bandit identification, an algorithm sequentially observes samples from several distributions up to a given final time.It then answers a query about the set of distributions. A good algorithm will have a small probability of error.While that probability decreases exponentially with the final time, the best attainable rate is not known precisely for most identification tasks.We show that if a fixed budget task admits a complexity, defined as a lower bound on the probability of error which is attained by the same algorithm on all bandit problems, then that complexity is determined by the best non-adaptive sampling procedure for that problem.We show that there is no such complexity for several fixed budget identification tasks including Bernoulli best arm identification with two arms: there is no single algorithm that attains everywhere the best possible rate.
Cite
Text
Degenne. "On the Existence of a Complexity in Fixed Budget Bandit Identification." Conference on Learning Theory, 2023.Markdown
[Degenne. "On the Existence of a Complexity in Fixed Budget Bandit Identification." Conference on Learning Theory, 2023.](https://mlanthology.org/colt/2023/degenne2023colt-existence/)BibTeX
@inproceedings{degenne2023colt-existence,
title = {{On the Existence of a Complexity in Fixed Budget Bandit Identification}},
author = {Degenne, Rémy},
booktitle = {Conference on Learning Theory},
year = {2023},
pages = {1131-1154},
volume = {195},
url = {https://mlanthology.org/colt/2023/degenne2023colt-existence/}
}