Thompson Sampling for Optimizing Stochastic Local Search
Abstract
Stochastic local search (SLS), like many other stochastic optimization algorithms, has several parameters that need to be optimized in order for the algorithm to find high quality solutions within a short amount of time. In this paper, we formulate a stochastic local search bandit ( $\mathtt{SLSB}$ ), which is a novel learning variant of SLS based on multi-armed bandits. $\mathtt{SLSB}$ optimizes SLS over a sequence of stochastic optimization problems and achieves high average cumulative reward. In $\mathtt{SLSB}$ , we study how SLS can be optimized via low degree polynomials in its noise and restart parameters. To determine the coefficients of the polynomials, we present polynomial Thompson Sampling ( $\mathtt{PolyTS}$ ). We derive a regret bound for $\mathtt{PolyTS}$ and validate its performance on synthetic problems of varying difficulty as well as on feature selection problems. Compared to bandits with no assumptions of the reward function and other parameter optimization approaches, our $\mathtt{PolyTS}$ assuming polynomial structure can provide substantially better parameter optimization for SLS. In addition, due to its simple model update, $\mathtt{PolyTS}$ has low computational cost compared to other SLS parameter optimization methods.
Cite
Text
Yu et al. "Thompson Sampling for Optimizing Stochastic Local Search." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2017. doi:10.1007/978-3-319-71249-9_30Markdown
[Yu et al. "Thompson Sampling for Optimizing Stochastic Local Search." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2017.](https://mlanthology.org/ecmlpkdd/2017/yu2017ecmlpkdd-thompson/) doi:10.1007/978-3-319-71249-9_30BibTeX
@inproceedings{yu2017ecmlpkdd-thompson,
title = {{Thompson Sampling for Optimizing Stochastic Local Search}},
author = {Yu, Tong and Kveton, Branislav and Mengshoel, Ole J.},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2017},
pages = {493-510},
doi = {10.1007/978-3-319-71249-9_30},
url = {https://mlanthology.org/ecmlpkdd/2017/yu2017ecmlpkdd-thompson/}
}