A Near-Optimal Change-Detection Based Algorithm for Piecewise-Stationary Combinatorial Semi-Bandits

Abstract

We investigate the piecewise-stationary combinatorial semi-bandit problem. Compared to the original combinatorial semi-bandit problem, our setting assumes the reward distributions of base arms may change in a piecewise-stationary manner at unknown time steps. We propose an algorithm, GLR-CUCB, which incorporates an efficient combinatorial semi-bandit algorithm, CUCB, with an almost parameter-free change-point detector, the Generalized Likelihood Ratio Test (GLRT). Our analysis shows that the regret of GLR-CUCB is upper bounded by O(√NKT log T), where N is the number of piecewise-stationary segments, K is the number of base arms, and T is the number of time steps. As a complement, we also derive a nearly matching regret lower bound on the order of Ω(√NKT), for both piecewise-stationary multi-armed bandits and combinatorial semi-bandits, using information-theoretic techniques and judiciously constructed piecewise-stationary bandit instances. Our lower bound is tighter than the best available regret lower bound, which is Ω(√T). Numerical experiments on both synthetic and real-world datasets demonstrate the superiority of GLR-CUCB compared to other state-of-the-art algorithms.

Cite

Text

Zhou et al. "A Near-Optimal Change-Detection Based Algorithm for Piecewise-Stationary Combinatorial Semi-Bandits." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I04.6176

Markdown

[Zhou et al. "A Near-Optimal Change-Detection Based Algorithm for Piecewise-Stationary Combinatorial Semi-Bandits." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/zhou2020aaai-near/) doi:10.1609/AAAI.V34I04.6176

BibTeX

@inproceedings{zhou2020aaai-near,
  title     = {{A Near-Optimal Change-Detection Based Algorithm for Piecewise-Stationary Combinatorial Semi-Bandits}},
  author    = {Zhou, Huozhi and Wang, Lingda and Varshney, Lav R. and Lim, Ee-Peng},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {6933-6940},
  doi       = {10.1609/AAAI.V34I04.6176},
  url       = {https://mlanthology.org/aaai/2020/zhou2020aaai-near/}
}