Navigating Through Temporal Difference
Abstract
Barto, Sutton and Watkins [2] introduced a grid task as a didactic ex(cid:173) ample of temporal difference planning and asynchronous dynamical pre>(cid:173) gramming. This paper considers the effects of changing the coding of the input stimulus, and demonstrates that the self-supervised learning of a particular form of hidden unit representation improves performance.
Cite
Text
Dayan. "Navigating Through Temporal Difference." Neural Information Processing Systems, 1990.Markdown
[Dayan. "Navigating Through Temporal Difference." Neural Information Processing Systems, 1990.](https://mlanthology.org/neurips/1990/dayan1990neurips-navigating/)BibTeX
@inproceedings{dayan1990neurips-navigating,
title = {{Navigating Through Temporal Difference}},
author = {Dayan, Peter},
booktitle = {Neural Information Processing Systems},
year = {1990},
pages = {464-470},
url = {https://mlanthology.org/neurips/1990/dayan1990neurips-navigating/}
}