Off-Policy Temporal Difference Learning with Function Approximation
Cite
Text
Precup et al. "Off-Policy Temporal Difference Learning with Function Approximation." International Conference on Machine Learning, 2001.Markdown
[Precup et al. "Off-Policy Temporal Difference Learning with Function Approximation." International Conference on Machine Learning, 2001.](https://mlanthology.org/icml/2001/precup2001icml-off/)BibTeX
@inproceedings{precup2001icml-off,
title = {{Off-Policy Temporal Difference Learning with Function Approximation}},
author = {Precup, Doina and Sutton, Richard S. and Dasgupta, Sanjoy},
booktitle = {International Conference on Machine Learning},
year = {2001},
pages = {417-424},
url = {https://mlanthology.org/icml/2001/precup2001icml-off/}
}