Proto-Predictive Representation of States with Simple Recurrent Temporal-Difference Networks

Makino, Takaki

doi:10.1145/1553374.1553464

Proto-Predictive Representation of States with Simple Recurrent Temporal-Difference Networks

Takaki Makino

ICML 2009 pp. 697-704

doi:10.1145/1553374.1553464 /icml/2009/makino2009icml-proto/

Abstract

We propose a new neural network architecture, called Simple Recurrent Temporal-Difference Networks (SR-TDNs), that learns to predict future observations in partially observable environments. SR-TDNs incorporate the structure of simple recurrent neural networks (SRNs) into temporal-difference (TD) networks to use proto-predictive representation of states. Although they deviate from the principle of predictive representations to ground state representations on observations, they follow the same learning strategy as TD networks, i.e., applying TD-learning to general predictions. Simulation experiments revealed that SR-TDNs can correctly represent states with incomplete set of core tests (question networks), and consequently, SR-TDNs have better on-line learning capacity than TD networks in various environments.

PDF ICML Semantic Scholar

Cite

Text

Makino. "Proto-Predictive Representation of States with Simple Recurrent Temporal-Difference Networks." International Conference on Machine Learning, 2009. doi:10.1145/1553374.1553464

Markdown

[Makino. "Proto-Predictive Representation of States with Simple Recurrent Temporal-Difference Networks." International Conference on Machine Learning, 2009.](https://mlanthology.org/icml/2009/makino2009icml-proto/) doi:10.1145/1553374.1553464

BibTeX

@inproceedings{makino2009icml-proto,
  title     = {{Proto-Predictive Representation of States with Simple Recurrent Temporal-Difference Networks}},
  author    = {Makino, Takaki},
  booktitle = {International Conference on Machine Learning},
  year      = {2009},
  pages     = {697-704},
  doi       = {10.1145/1553374.1553464},
  url       = {https://mlanthology.org/icml/2009/makino2009icml-proto/}
}