Thompson Sampling for Optimizing Stochastic Local Search

Abstract

Stochastic local search (SLS), like many other stochastic optimization algorithms, has several parameters that need to be optimized in order for the algorithm to find high quality solutions within a short amount of time. In this paper, we formulate a stochastic local search bandit ( $\mathtt{SLSB}$ ), which is a novel learning variant of SLS based on multi-armed bandits. $\mathtt{SLSB}$ optimizes SLS over a sequence of stochastic optimization problems and achieves high average cumulative reward. In $\mathtt{SLSB}$ , we study how SLS can be optimized via low degree polynomials in its noise and restart parameters. To determine the coefficients of the polynomials, we present polynomial Thompson Sampling ( $\mathtt{PolyTS}$ ). We derive a regret bound for $\mathtt{PolyTS}$ and validate its performance on synthetic problems of varying difficulty as well as on feature selection problems. Compared to bandits with no assumptions of the reward function and other parameter optimization approaches, our $\mathtt{PolyTS}$ assuming polynomial structure can provide substantially better parameter optimization for SLS. In addition, due to its simple model update, $\mathtt{PolyTS}$ has low computational cost compared to other SLS parameter optimization methods.

Cite

Text

Yu et al. "Thompson Sampling for Optimizing Stochastic Local Search." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2017. doi:10.1007/978-3-319-71249-9_30

Markdown

[Yu et al. "Thompson Sampling for Optimizing Stochastic Local Search." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2017.](https://mlanthology.org/ecmlpkdd/2017/yu2017ecmlpkdd-thompson/) doi:10.1007/978-3-319-71249-9_30

BibTeX

@inproceedings{yu2017ecmlpkdd-thompson,
  title     = {{Thompson Sampling for Optimizing Stochastic Local Search}},
  author    = {Yu, Tong and Kveton, Branislav and Mengshoel, Ole J.},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2017},
  pages     = {493-510},
  doi       = {10.1007/978-3-319-71249-9_30},
  url       = {https://mlanthology.org/ecmlpkdd/2017/yu2017ecmlpkdd-thompson/}
}