Evolino: Hybrid Neuroevolution/Optimal Linear Search for Sequence Learning

Abstract

Current Neural Network learning algorithms are limited in their ability to model non-linear dynamical systems. Most supervised gradient-based recurrent neural networks (RNNs) suffer from a vanishing error signal that prevents learning from inputs far in the past. Those that do not, still have problems when there are numerous local minima. We introduce a general framework for sequence learning, EVOlution of recurrent systems with LINear outputs (Evolino). Evolino uses evolution to discover good RNN hidden node weights, while using methods such as linear regression or quadratic programming to compute optimal linear mappings from hidden state to output. Using the Long Short-Term Memory RNN Architecture, the method is tested in three very different problem domains: 1) context-sensitive languages, 2) multiple superimposed sine waves, and 3) the Mackey-Glass system. Evolino performs exceptionally well across all tasks, where other methods show notable deficiencies in some.

Cite

Text

Schmidhuber et al. "Evolino: Hybrid Neuroevolution/Optimal Linear Search for Sequence Learning." International Joint Conference on Artificial Intelligence, 2005.

Markdown

[Schmidhuber et al. "Evolino: Hybrid Neuroevolution/Optimal Linear Search for Sequence Learning." International Joint Conference on Artificial Intelligence, 2005.](https://mlanthology.org/ijcai/2005/schmidhuber2005ijcai-evolino/)

BibTeX

@inproceedings{schmidhuber2005ijcai-evolino,
  title     = {{Evolino: Hybrid Neuroevolution/Optimal Linear Search for Sequence Learning}},
  author    = {Schmidhuber, Jürgen and Wierstra, Daan and Gomez, Faustino J.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2005},
  pages     = {853-858},
  url       = {https://mlanthology.org/ijcai/2005/schmidhuber2005ijcai-evolino/}
}