A Gradient Based Strategy for Hamiltonian Monte Carlo Hyperparameter Optimization
Abstract
Hamiltonian Monte Carlo (HMC) is one of the most successful sampling methods in machine learning. However, its performance is significantly affected by the choice of hyperparameter values. Existing approaches for optimizing the HMC hyperparameters either optimize a proxy for mixing speed or consider the HMC chain as an implicit variational distribution and optimize a tractable lower bound that can be very loose in practice. Instead, we propose to optimize an objective that quantifies directly the speed of convergence to the target distribution. Our objective can be easily optimized using stochastic gradient descent. We evaluate our proposed method and compare to baselines on a variety of problems including sampling from synthetic 2D distributions, reconstructing sparse signals, learning deep latent variable models and sampling molecular configurations from the Boltzmann distribution of a 22 atom molecule. We find that our method is competitive with or improves upon alternative baselines in all these experiments.
Cite
Text
Campbell et al. "A Gradient Based Strategy for Hamiltonian Monte Carlo Hyperparameter Optimization." International Conference on Machine Learning, 2021.Markdown
[Campbell et al. "A Gradient Based Strategy for Hamiltonian Monte Carlo Hyperparameter Optimization." International Conference on Machine Learning, 2021.](https://mlanthology.org/icml/2021/campbell2021icml-gradient/)BibTeX
@inproceedings{campbell2021icml-gradient,
title = {{A Gradient Based Strategy for Hamiltonian Monte Carlo Hyperparameter Optimization}},
author = {Campbell, Andrew and Chen, Wenlong and Stimper, Vincent and Hernandez-Lobato, Jose Miguel and Zhang, Yichuan},
booktitle = {International Conference on Machine Learning},
year = {2021},
pages = {1238-1248},
volume = {139},
url = {https://mlanthology.org/icml/2021/campbell2021icml-gradient/}
}