SDE-Net: Equipping Deep Neural Networks with Uncertainty Estimates
Abstract
Uncertainty quantification is a fundamental yet unsolved problem for deep learning. The Bayesian framework provides a principled way of uncertainty estimation but is often not scalable to modern deep neural nets (DNNs) that have a large number of parameters. Non-Bayesian methods are simple to implement but often conflate different sources of uncertainties and require huge computing resources. We propose a new method for quantifying uncertainties of DNNs from a dynamical system perspective. The core of our method is to view DNN transformations as state evolution of a stochastic dynamical system and introduce a Brownian motion term for capturing epistemic uncertainty. Based on this perspective, we propose a neural stochastic differential equation model (SDE-Net) which consists of (1) a drift net that controls the system to fit the predictive function; and (2) a diffusion net that captures epistemic uncertainty. We theoretically analyze the existence and uniqueness of the solution to SDE-Net. Our experiments demonstrate that the SDE-Net model can outperform existing uncertainty estimation methods across a series of tasks where uncertainty plays a fundamental role.
Cite
Text
Kong et al. "SDE-Net: Equipping Deep Neural Networks with Uncertainty Estimates." International Conference on Machine Learning, 2020.Markdown
[Kong et al. "SDE-Net: Equipping Deep Neural Networks with Uncertainty Estimates." International Conference on Machine Learning, 2020.](https://mlanthology.org/icml/2020/kong2020icml-sdenet/)BibTeX
@inproceedings{kong2020icml-sdenet,
title = {{SDE-Net: Equipping Deep Neural Networks with Uncertainty Estimates}},
author = {Kong, Lingkai and Sun, Jimeng and Zhang, Chao},
booktitle = {International Conference on Machine Learning},
year = {2020},
pages = {5405-5415},
volume = {119},
url = {https://mlanthology.org/icml/2020/kong2020icml-sdenet/}
}