Enhancing Low-Precision Sampling via Stochastic Gradient Hamiltonian Monte Carlo

Wang, Ziyi; Chen, Yujie; Song, Qifan; Zhang, Ruqi

Enhancing Low-Precision Sampling via Stochastic Gradient Hamiltonian Monte Carlo

Ziyi Wang, Yujie Chen, Qifan Song, Ruqi Zhang

TMLR 2024

/tmlr/2024/wang2024tmlr-enhancing-a/

Abstract

Low-precision training has emerged as a promising low-cost technique to enhance the training efficiency of deep neural networks without sacrificing much accuracy. Its Bayesian counterpart can further provide uncertainty quantification and improved generalization accuracy. This paper investigates low-precision sampling via Stochastic Gradient Hamiltonian Monte Carlo (SGHMC) with low-precision and full-precision gradient accumulators for both strongly log-concave and non-log-concave distributions. Theoretically, our results show that to achieve $\epsilon$-error in the 2-Wasserstein distance for non-log-concave distributions, low-precision SGHMC achieves quadratic improvement ($\tilde{\mathcal{O}}\left({\epsilon^{-2}{\mu^*}^{-2}\log^2\left({\epsilon^{-1}}\right)}\right)$) compared to the state-of-the-art low-precision sampler, Stochastic Gradient Langevin Dynamics (SGLD) ($\tilde{\mathcal{O}}\left({{\epsilon}^{-4}{\lambda^{*}}^{-1}\log^5\left({\epsilon^{-1}}\right)}\right)$). Moreover, we prove that low-precision SGHMC is more robust to the quantization error compared to low-precision SGLD due to the robustness of the momentum-based update w.r.t. gradient noise. Empirically, we conduct experiments on synthetic data, and MNIST, CIFAR-10 \& CIFAR-100 datasets, which validate our theoretical findings. Our study highlights the potential of low-precision SGHMC as an efficient and accurate sampling method for large-scale and resource-limited machine learning.

PDF TMLR Code Semantic Scholar

Cite

Text

Wang et al. "Enhancing Low-Precision Sampling via Stochastic Gradient Hamiltonian Monte Carlo." Transactions on Machine Learning Research, 2024.

Markdown

[Wang et al. "Enhancing Low-Precision Sampling via Stochastic Gradient Hamiltonian Monte Carlo." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/wang2024tmlr-enhancing-a/)

BibTeX

@article{wang2024tmlr-enhancing-a,
  title     = {{Enhancing Low-Precision Sampling via Stochastic Gradient Hamiltonian Monte Carlo}},
  author    = {Wang, Ziyi and Chen, Yujie and Song, Qifan and Zhang, Ruqi},
  journal   = {Transactions on Machine Learning Research},
  year      = {2024},
  url       = {https://mlanthology.org/tmlr/2024/wang2024tmlr-enhancing-a/}
}