The Interchangeability of Learning Rate and Gain in Backpropagation Neural Networks
Abstract
The backpropagation algorithm is widely used for training multilayer neural networks. In this publication the gain of its activation function(s) is investigated. In specific, it is proven that changing the gain of the activation function is equivalent to changing the learning rate and the weights. This simplifies the backpropagation learning rule by eliminating one of its parameters. The theorem can be extended to hold for some well-known variations on the backpropagation algorithm, such as using a momentum term, flat spot elimination, or adaptive gain. Furthermore, it is successfully applied to compensate for the nonstandard gain of optical sigmoids for optical neural networks.
Cite
Text
Thimm et al. "The Interchangeability of Learning Rate and Gain in Backpropagation Neural Networks." Neural Computation, 1996. doi:10.1162/NECO.1996.8.2.451Markdown
[Thimm et al. "The Interchangeability of Learning Rate and Gain in Backpropagation Neural Networks." Neural Computation, 1996.](https://mlanthology.org/neco/1996/thimm1996neco-interchangeability/) doi:10.1162/NECO.1996.8.2.451BibTeX
@article{thimm1996neco-interchangeability,
title = {{The Interchangeability of Learning Rate and Gain in Backpropagation Neural Networks}},
author = {Thimm, Georg and Moerland, Perry and Fiesler, Emile},
journal = {Neural Computation},
year = {1996},
pages = {451-460},
doi = {10.1162/NECO.1996.8.2.451},
volume = {8},
url = {https://mlanthology.org/neco/1996/thimm1996neco-interchangeability/}
}