Masked Bayesian Neural Networks : Theoretical Guarantee and Its Posterior Inference

Abstract

Bayesian approaches for learning deep neural networks (BNN) have been received much attention and successfully applied to various applications. Particularly, BNNs have the merit of having better generalization ability as well as better uncertainty quantification. For the success of BNN, search an appropriate architecture of the neural networks is an important task, and various algorithms to find good sparse neural networks have been proposed. In this paper, we propose a new node-sparse BNN model which has good theoretical properties and is computationally feasible. We prove that the posterior concentration rate to the true model is near minimax optimal and adaptive to the smoothness of the true model. In particular the adaptiveness is the first of its kind for node-sparse BNNs. In addition, we develop a novel MCMC algorithm which makes the Bayesian inference of the node-sparse BNN model feasible in practice.

Cite

Text

Kong et al. "Masked Bayesian Neural Networks : Theoretical Guarantee and Its Posterior Inference." International Conference on Machine Learning, 2023.

Markdown

[Kong et al. "Masked Bayesian Neural Networks : Theoretical Guarantee and Its Posterior Inference." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/kong2023icml-masked/)

BibTeX

@inproceedings{kong2023icml-masked,
  title     = {{Masked Bayesian Neural Networks : Theoretical Guarantee and Its Posterior Inference}},
  author    = {Kong, Insung and Yang, Dongyoon and Lee, Jongjin and Ohn, Ilsang and Baek, Gyuseung and Kim, Yongdai},
  booktitle = {International Conference on Machine Learning},
  year      = {2023},
  pages     = {17462-17491},
  volume    = {202},
  url       = {https://mlanthology.org/icml/2023/kong2023icml-masked/}
}