Crossprop: Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks

Veeriah, Vivek; Zhang, Shangtong; Sutton, Richard S.

doi:10.1007/978-3-319-71249-9_27

Crossprop: Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks

Vivek Veeriah, Shangtong Zhang, Richard S. Sutton

ECML-PKDD 2017 pp. 445-459

doi:10.1007/978-3-319-71249-9_27 /ecmlpkdd/2017/veeriah2017ecmlpkdd-crossprop/

Abstract

Representations are fundamental to artificial intelligence. The performance of a learning system depends on how the data is represented. Typically, these representations are hand-engineered using domain knowledge. Recently, the trend is to learn these representations through stochastic gradient descent in multi-layer neural networks, which is called backprop . Learning representations directly from the incoming data stream reduces human labour involved in designing a learning system. More importantly, this allows in scaling up a learning system to difficult tasks. In this paper, we introduce a new incremental learning algorithm called crossprop , that learns incoming weights of hidden units based on the meta-gradient descent approach. This meta-gradient descent approach was previously introduced by Sutton (1992) and Schraudolph (1999) for learning step-sizes. The final update equation introduces an additional memory parameter for each of these weights and generalizes the backprop update equation. From our empirical experiments, we show that crossprop learns and reuses its feature representation while tackling new and unseen tasks whereas backprop relearns a new feature representation.

PDF ECML-PKDD Semantic Scholar

Cite

Text

Veeriah et al. "Crossprop: Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2017. doi:10.1007/978-3-319-71249-9_27

Markdown

[Veeriah et al. "Crossprop: Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2017.](https://mlanthology.org/ecmlpkdd/2017/veeriah2017ecmlpkdd-crossprop/) doi:10.1007/978-3-319-71249-9_27

BibTeX

@inproceedings{veeriah2017ecmlpkdd-crossprop,
  title     = {{Crossprop: Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks}},
  author    = {Veeriah, Vivek and Zhang, Shangtong and Sutton, Richard S.},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2017},
  pages     = {445-459},
  doi       = {10.1007/978-3-319-71249-9_27},
  url       = {https://mlanthology.org/ecmlpkdd/2017/veeriah2017ecmlpkdd-crossprop/}
}