Off-Policy Temporal Difference Learning with Function Approximation

Cite

Text

Precup et al. "Off-Policy Temporal Difference Learning with Function Approximation." International Conference on Machine Learning, 2001.

Markdown

[Precup et al. "Off-Policy Temporal Difference Learning with Function Approximation." International Conference on Machine Learning, 2001.](https://mlanthology.org/icml/2001/precup2001icml-off/)

BibTeX

@inproceedings{precup2001icml-off,
  title     = {{Off-Policy Temporal Difference Learning with Function Approximation}},
  author    = {Precup, Doina and Sutton, Richard S. and Dasgupta, Sanjoy},
  booktitle = {International Conference on Machine Learning},
  year      = {2001},
  pages     = {417-424},
  url       = {https://mlanthology.org/icml/2001/precup2001icml-off/}
}