Relating the Slope of the Activation Function and the Learning Rate Within a Recurrent Neural Network

Abstract

A relationship between the learning rate η in the learning algorithm, and the slope β in the nonlinear activation function, for a class of recurrent neural networks (RNNs) trained by the real-time recurrent learning algorithm is provided. It is shown that an arbitrary RNN can be obtained via the referent RNN, with some deterministic rules imposed on its weights and the learning rate. Such relationships reduce the number of degrees of freedom when solving the nonlinear optimization task of finding the optimal RNN parameters.

Cite

Text

Mandic and Chambers. "Relating the Slope of the Activation Function and the Learning Rate Within a Recurrent Neural Network." Neural Computation, 1999. doi:10.1162/089976699300016340

Markdown

[Mandic and Chambers. "Relating the Slope of the Activation Function and the Learning Rate Within a Recurrent Neural Network." Neural Computation, 1999.](https://mlanthology.org/neco/1999/mandic1999neco-relating/) doi:10.1162/089976699300016340

BibTeX

@article{mandic1999neco-relating,
  title     = {{Relating the Slope of the Activation Function and the Learning Rate Within a Recurrent Neural Network}},
  author    = {Mandic, Danilo P. and Chambers, Jonathon A.},
  journal   = {Neural Computation},
  year      = {1999},
  pages     = {1069-1077},
  doi       = {10.1162/089976699300016340},
  volume    = {11},
  url       = {https://mlanthology.org/neco/1999/mandic1999neco-relating/}
}