Off-Policy TD( L) with a True Online Equivalence
Cite
Text
van Hasselt et al. "Off-Policy TD( L) with a True Online Equivalence." Conference on Uncertainty in Artificial Intelligence, 2014.Markdown
[van Hasselt et al. "Off-Policy TD( L) with a True Online Equivalence." Conference on Uncertainty in Artificial Intelligence, 2014.](https://mlanthology.org/uai/2014/vanhasselt2014uai-off/)BibTeX
@inproceedings{vanhasselt2014uai-off,
title = {{Off-Policy TD( L) with a True Online Equivalence}},
author = {van Hasselt, Hado and Mahmood, Ashique Rupam and Sutton, Richard S.},
booktitle = {Conference on Uncertainty in Artificial Intelligence},
year = {2014},
pages = {330-339},
url = {https://mlanthology.org/uai/2014/vanhasselt2014uai-off/}
}