UCBoost: A Boosting Approach to Tame Complexity and Optimality for Stochastic Bandits

Abstract

In this work, we address the open problem of finding low-complexity near-optimal multi-armed bandit algorithms for sequential decision making problems. Existing bandit algorithms are either sub-optimal and computationally simple (e.g., UCB1) or optimal and computationally complex (e.g., kl-UCB). We propose a boosting approach to Upper Confidence Bound based algorithms for stochastic bandits, that we call UCBoost. Specifically, we propose two types of UCBoost algorithms. We show that UCBoost($D$) enjoys $O(1)$ complexity for each arm per round as well as regret guarantee that is $1/e$-close to that of the kl-UCB algorithm. We propose an approximation-based UCBoost algorithm, UCBoost($\epsilon$), that enjoys a regret guarantee $\epsilon$-close to that of kl-UCB as well as $O(\log(1/\epsilon))$ complexity for each arm per round. Hence, our algorithms provide practitioners a practical way to trade optimality with computational complexity. Finally, we present numerical results which show that UCBoost($\epsilon$) can achieve the same regret performance as the standard kl-UCB while incurring only $1\%$ of the computational cost of kl-UCB.

Cite

Text

Liu et al. "UCBoost: A Boosting Approach to Tame Complexity and Optimality for Stochastic Bandits." International Joint Conference on Artificial Intelligence, 2018. doi:10.24963/IJCAI.2018/338

Markdown

[Liu et al. "UCBoost: A Boosting Approach to Tame Complexity and Optimality for Stochastic Bandits." International Joint Conference on Artificial Intelligence, 2018.](https://mlanthology.org/ijcai/2018/liu2018ijcai-ucboost/) doi:10.24963/IJCAI.2018/338

BibTeX

@inproceedings{liu2018ijcai-ucboost,
  title     = {{UCBoost: A Boosting Approach to Tame Complexity and Optimality for Stochastic Bandits}},
  author    = {Liu, Fang and Wang, Sinong and Buccapatnam, Swapna and Shroff, Ness B.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2018},
  pages     = {2440-2446},
  doi       = {10.24963/IJCAI.2018/338},
  url       = {https://mlanthology.org/ijcai/2018/liu2018ijcai-ucboost/}
}