The Interchangeability of Learning Rate and Gain in Backpropagation Neural Networks

Abstract

The backpropagation algorithm is widely used for training multilayer neural networks. In this publication the gain of its activation function(s) is investigated. In specific, it is proven that changing the gain of the activation function is equivalent to changing the learning rate and the weights. This simplifies the backpropagation learning rule by eliminating one of its parameters. The theorem can be extended to hold for some well-known variations on the backpropagation algorithm, such as using a momentum term, flat spot elimination, or adaptive gain. Furthermore, it is successfully applied to compensate for the nonstandard gain of optical sigmoids for optical neural networks.

Cite

Text

Thimm et al. "The Interchangeability of Learning Rate and Gain in Backpropagation Neural Networks." Neural Computation, 1996. doi:10.1162/NECO.1996.8.2.451

Markdown

[Thimm et al. "The Interchangeability of Learning Rate and Gain in Backpropagation Neural Networks." Neural Computation, 1996.](https://mlanthology.org/neco/1996/thimm1996neco-interchangeability/) doi:10.1162/NECO.1996.8.2.451

BibTeX

@article{thimm1996neco-interchangeability,
  title     = {{The Interchangeability of Learning Rate and Gain in Backpropagation Neural Networks}},
  author    = {Thimm, Georg and Moerland, Perry and Fiesler, Emile},
  journal   = {Neural Computation},
  year      = {1996},
  pages     = {451-460},
  doi       = {10.1162/NECO.1996.8.2.451},
  volume    = {8},
  url       = {https://mlanthology.org/neco/1996/thimm1996neco-interchangeability/}
}