Navigating Through Temporal Difference

Abstract

Barto, Sutton and Watkins [2] introduced a grid task as a didactic ex(cid:173) ample of temporal difference planning and asynchronous dynamical pre>(cid:173) gramming. This paper considers the effects of changing the coding of the input stimulus, and demonstrates that the self-supervised learning of a particular form of hidden unit representation improves performance.

Cite

Text

Dayan. "Navigating Through Temporal Difference." Neural Information Processing Systems, 1990.

Markdown

[Dayan. "Navigating Through Temporal Difference." Neural Information Processing Systems, 1990.](https://mlanthology.org/neurips/1990/dayan1990neurips-navigating/)

BibTeX

@inproceedings{dayan1990neurips-navigating,
  title     = {{Navigating Through Temporal Difference}},
  author    = {Dayan, Peter},
  booktitle = {Neural Information Processing Systems},
  year      = {1990},
  pages     = {464-470},
  url       = {https://mlanthology.org/neurips/1990/dayan1990neurips-navigating/}
}