Gaussian Process Planning with Lipschitz Continuous Reward Functions: Towards Unifying Bayesian Optimization, Active Learning, and Beyond

Abstract

This paper presents a novel nonmyopic adaptive Gaussian process planning (GPP) framework endowed with a general class of Lipschitz continuous reward functions that can unify some active learning/sensing and Bayesian optimization criteria and offer practitioners some flexibility to specify their desired choices for defining new tasks/problems. In particular, it utilizes a principled Bayesian sequential decision problem framework for jointly and naturally optimizing the exploration-exploitation trade-off. In general, the resulting induced GPP policy cannot be derived exactly due to an uncountable set of candidate observations. A key contribution of our work here thus lies in exploiting the Lipschitz continuity of the reward functions to solve for a nonmyopic adaptive epsilon-optimal GPP (epsilon-GPP) policy. To plan in real time, we further propose an asymptotically optimal, branch-and-bound anytime variant of epsilon-GPP with performance guarantee. We empirically demonstrate the effectiveness of our epsilon-GPP policy and its anytime variant in Bayesian optimization and an energy harvesting task.

Cite

Text

Ling et al. "Gaussian Process Planning with Lipschitz Continuous Reward Functions: Towards Unifying Bayesian Optimization, Active Learning, and Beyond." AAAI Conference on Artificial Intelligence, 2016. doi:10.1609/AAAI.V30I1.10210

Markdown

[Ling et al. "Gaussian Process Planning with Lipschitz Continuous Reward Functions: Towards Unifying Bayesian Optimization, Active Learning, and Beyond." AAAI Conference on Artificial Intelligence, 2016.](https://mlanthology.org/aaai/2016/ling2016aaai-gaussian/) doi:10.1609/AAAI.V30I1.10210

BibTeX

@inproceedings{ling2016aaai-gaussian,
  title     = {{Gaussian Process Planning with Lipschitz Continuous Reward Functions: Towards Unifying Bayesian Optimization, Active Learning, and Beyond}},
  author    = {Ling, Chun Kai and Low, Kian Hsiang and Jaillet, Patrick},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2016},
  pages     = {1860-1866},
  doi       = {10.1609/AAAI.V30I1.10210},
  url       = {https://mlanthology.org/aaai/2016/ling2016aaai-gaussian/}
}