Smooth Bandit Optimization: Generalization to Holder Space

AISTATS 2021 pp. 2206-2214

/aistats/2021/liu2021aistats-smooth/

Abstract

We consider bandit optimization of a smooth reward function, where the goal is cumulative regret minimization. This problem has been studied for $\alpha$-Holder continuous (including Lipschitz) functions with $0<\alpha\leq 1$. Our main result is in generalization of the reward function to Holder space with exponent $\alpha>1$ to bridge the gap between Lipschitz bandits and infinitely-differentiable models such as linear bandits. For Holder continuous functions, approaches based on random sampling in bins of a discretized domain suffices as optimal. In contrast, we propose a class of two-layer algorithms that deploy misspecified linear/polynomial bandit algorithms in bins. We demonstrate that the proposed algorithm can exploit higher-order smoothness of the function by deriving a regret upper bound of $\tilde{O}(T^\frac{d+\alpha}{d+2\alpha})$ for when $\alpha>1$, which matches existing lower bound. We also study adaptation to unknown function smoothness over a continuous scale of Holder spaces indexed by $\alpha$, with a bandit model selection approach applied with our proposed two-layer algorithms. We show that it achieves regret rate that matches the existing lower bound for adaptation within the $\alpha\leq 1$ subset.

PDF AISTATS Semantic Scholar

Cite

Text

Liu et al. " Smooth Bandit Optimization: Generalization to Holder Space ." Artificial Intelligence and Statistics, 2021.

Markdown

[Liu et al. " Smooth Bandit Optimization: Generalization to Holder Space ." Artificial Intelligence and Statistics, 2021.](https://mlanthology.org/aistats/2021/liu2021aistats-smooth/)

BibTeX

@inproceedings{liu2021aistats-smooth,
  title     = {{ Smooth Bandit Optimization: Generalization to Holder Space }},
  author    = {Liu, Yusha and Wang, Yining and Singh, Aarti},
  booktitle = {Artificial Intelligence and Statistics},
  year      = {2021},
  pages     = {2206-2214},
  volume    = {130},
  url       = {https://mlanthology.org/aistats/2021/liu2021aistats-smooth/}
}