An Empirical Evaluation of Thompson Sampling

Chapelle, Olivier; Li, Lihong

An Empirical Evaluation of Thompson Sampling

NeurIPS 2011 pp. 2249-2257

/neurips/2011/chapelle2011neurips-empirical/

Abstract

Thompson sampling is one of oldest heuristic to address the exploration / exploitation trade-off, but it is surprisingly not very popular in the literature. We present here some empirical results using Thompson sampling on simulated and real data, and show that it is highly competitive. And since this heuristic is very easy to implement, we argue that it should be part of the standard baselines to compare against.

PDF NeurIPS Semantic Scholar

Cite

Text

Chapelle and Li. "An Empirical Evaluation of Thompson Sampling." Neural Information Processing Systems, 2011.

Markdown

[Chapelle and Li. "An Empirical Evaluation of Thompson Sampling." Neural Information Processing Systems, 2011.](https://mlanthology.org/neurips/2011/chapelle2011neurips-empirical/)

BibTeX

@inproceedings{chapelle2011neurips-empirical,
  title     = {{An Empirical Evaluation of Thompson Sampling}},
  author    = {Chapelle, Olivier and Li, Lihong},
  booktitle = {Neural Information Processing Systems},
  year      = {2011},
  pages     = {2249-2257},
  url       = {https://mlanthology.org/neurips/2011/chapelle2011neurips-empirical/}
}