Modeling Uncertainty by Learning a Hierarchy of Deep Neural Connections

Abstract

Modeling uncertainty in deep neural networks, despite recent important advances, is still an open problem. Bayesian neural networks are a powerful solution, where the prior over network weights is a design choice, often a normal distribution or other distribution encouraging sparsity. However, this prior is agnostic to the generative process of the input data, which might lead to unwarranted generalization for out-of-distribution tested data. We suggest the presence of a confounder for the relation between the input data and the discriminative function given the target label. We propose an approach for modeling this confounder by sharing neural connectivity patterns between the generative and discriminative networks. This approach leads to a new deep architecture, where networks are sampled from the posterior of local causal structures, and coupled into a compact hierarchy. We demonstrate that sampling networks from this hierarchy, proportionally to their posterior, is efficient and enables estimating various types of uncertainties. Empirical evaluations of our method demonstrate significant improvement compared to state-of-the-art calibration and out-of-distribution detection methods.

Cite

Text

Rohekar et al. "Modeling Uncertainty by Learning a Hierarchy of Deep Neural Connections." Neural Information Processing Systems, 2019.

Markdown

[Rohekar et al. "Modeling Uncertainty by Learning a Hierarchy of Deep Neural Connections." Neural Information Processing Systems, 2019.](https://mlanthology.org/neurips/2019/rohekar2019neurips-modeling/)

BibTeX

@inproceedings{rohekar2019neurips-modeling,
  title     = {{Modeling Uncertainty by Learning a Hierarchy of Deep Neural Connections}},
  author    = {Rohekar, Raanan Yehezkel and Gurwicz, Yaniv and Nisimov, Shami and Novik, Gal},
  booktitle = {Neural Information Processing Systems},
  year      = {2019},
  pages     = {4244-4254},
  url       = {https://mlanthology.org/neurips/2019/rohekar2019neurips-modeling/}
}