Audibert and Bubeck. "Minimax Policies for Adversarial and Stochastic Bandits." Annual Conference on Computational Learning Theory, 2009.
Markdown
[Audibert and Bubeck. "Minimax Policies for Adversarial and Stochastic Bandits." Annual Conference on Computational Learning Theory, 2009.](https://mlanthology.org/colt/2009/audibert2009colt-minimax/)
BibTeX
@inproceedings{audibert2009colt-minimax,
title = {{Minimax Policies for Adversarial and Stochastic Bandits}},
author = {Audibert, Jean-Yves and Bubeck, Sébastien},
booktitle = {Annual Conference on Computational Learning Theory},
year = {2009},
url = {https://mlanthology.org/colt/2009/audibert2009colt-minimax/}
}