Bayesian Backpropagation over I-O Functions Rather than Weights

Abstract

The conventional Bayesian justification of backprop is that it finds the MAP weight vector. As this paper shows, to find the MAP i-o function instead one must add a correction tenn to backprop. That tenn biases one towards i-o functions with small description lengths, and in particular fa(cid:173) vors (some kinds of) feature-selection, pruning, and weight-sharing.

Cite

Text

Wolpert. "Bayesian Backpropagation over I-O Functions Rather than Weights." Neural Information Processing Systems, 1993.

Markdown

[Wolpert. "Bayesian Backpropagation over I-O Functions Rather than Weights." Neural Information Processing Systems, 1993.](https://mlanthology.org/neurips/1993/wolpert1993neurips-bayesian/)

BibTeX

@inproceedings{wolpert1993neurips-bayesian,
  title     = {{Bayesian Backpropagation over I-O Functions Rather than Weights}},
  author    = {Wolpert, David H.},
  booktitle = {Neural Information Processing Systems},
  year      = {1993},
  pages     = {200-207},
  url       = {https://mlanthology.org/neurips/1993/wolpert1993neurips-bayesian/}
}