Generalization Error of Linear Neural Networks in an Empirical Bayes Approach

Nakajima, Shinichi; Watanabe, Sumio

Generalization Error of Linear Neural Networks in an Empirical Bayes Approach

IJCAI 2005 pp. 804-810

/ijcai/2005/nakajima2005ijcai-generalization/

Abstract

It is well known that in unidentifiable models, the Bayes estimation has the advantage of generalization performance to the maximum likelihood estimation. However, accurate approximation of the posterior distribution requires huge computational costs. In this paper, we consider an empirical Bayes approach where a part of the parameters are regarded as hyperparameters, which we call a subspace Bayes approach, and theoretically analyze the generalization error of three-layer linear neural networks. We show that a subspace Bayes approach is asymptotically equivalent to a positivepart James-Stein type shrinkage estimation, and behaves similarly to the Bayes estimation in typical cases. 1

PDF Semantic Scholar

Cite

Text

Nakajima and Watanabe. "Generalization Error of Linear Neural Networks in an Empirical Bayes Approach." International Joint Conference on Artificial Intelligence, 2005.

Markdown

[Nakajima and Watanabe. "Generalization Error of Linear Neural Networks in an Empirical Bayes Approach." International Joint Conference on Artificial Intelligence, 2005.](https://mlanthology.org/ijcai/2005/nakajima2005ijcai-generalization/)

BibTeX

@inproceedings{nakajima2005ijcai-generalization,
  title     = {{Generalization Error of Linear Neural Networks in an Empirical Bayes Approach}},
  author    = {Nakajima, Shinichi and Watanabe, Sumio},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2005},
  pages     = {804-810},
  url       = {https://mlanthology.org/ijcai/2005/nakajima2005ijcai-generalization/}
}