Streamlining Prediction in Bayesian Deep Learning

Abstract

The rising interest in Bayesian deep learning (BDL) has led to a plethora of methods for estimating the posterior distribution. However, efficient computation of inferences, such as predictions, has been largely overlooked with Monte Carlo integration remaining the standard. In this work we examine streamlining prediction in BDL through a single forward pass without sampling. For this, we use local linearisation of activation functions and local Gaussian approximations at linear layers. Thus allowing us to analytically compute an approximation of the posterior predictive distribution. We showcase our approach for both MLP and transformers, such as ViT and GPT-2, and assess its performance on regression and classification tasks. Open-source library: https://github.com/AaltoML/SUQ.

Cite

Text

Li et al. "Streamlining Prediction in Bayesian Deep Learning." International Conference on Learning Representations, 2025.

Markdown

[Li et al. "Streamlining Prediction in Bayesian Deep Learning." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/li2025iclr-streamlining/)

BibTeX

@inproceedings{li2025iclr-streamlining,
  title     = {{Streamlining Prediction in Bayesian Deep Learning}},
  author    = {Li, Rui and Klasson, Marcus and Solin, Arno and Trapp, Martin},
  booktitle = {International Conference on Learning Representations},
  year      = {2025},
  url       = {https://mlanthology.org/iclr/2025/li2025iclr-streamlining/}
}