Do Bayesian Neural Networks Need to Be Fully Stochastic?
Abstract
We investigate the benefit of treating all the parameters in a Bayesian neural network stochastically and find compelling theoretical and empirical evidence that this standard construction may be unnecessary. To this end, we prove that expressive predictive distributions require only small amounts of stochasticity. In particular, partially stochastic networks with only n stochastic biases are universal probabilistic predictors for n-dimensional predictive problems. In empirical investigations, we find no systematic benefit of full stochasticity across four different inference modalities and eight datasets; partially stochastic networks can match and sometimes even outperform fully stochastic networks, despite their reduced memory costs.
Cite
Text
Sharma et al. "Do Bayesian Neural Networks Need to Be Fully Stochastic?." Artificial Intelligence and Statistics, 2023.Markdown
[Sharma et al. "Do Bayesian Neural Networks Need to Be Fully Stochastic?." Artificial Intelligence and Statistics, 2023.](https://mlanthology.org/aistats/2023/sharma2023aistats-bayesian/)BibTeX
@inproceedings{sharma2023aistats-bayesian,
title = {{Do Bayesian Neural Networks Need to Be Fully Stochastic?}},
author = {Sharma, Mrinank and Farquhar, Sebastian and Nalisnick, Eric and Rainforth, Tom},
booktitle = {Artificial Intelligence and Statistics},
year = {2023},
pages = {7694-7722},
volume = {206},
url = {https://mlanthology.org/aistats/2023/sharma2023aistats-bayesian/}
}