Baxter and Bartlett. "Reinforcement Learning in POMDP's via Direct Gradient Ascent." International Conference on Machine Learning, 2000.
Markdown
[Baxter and Bartlett. "Reinforcement Learning in POMDP's via Direct Gradient Ascent." International Conference on Machine Learning, 2000.](https://mlanthology.org/icml/2000/baxter2000icml-reinforcement/)
BibTeX
@inproceedings{baxter2000icml-reinforcement,
title = {{Reinforcement Learning in POMDP's via Direct Gradient Ascent}},
author = {Baxter, Jonathan and Bartlett, Peter L.},
booktitle = {International Conference on Machine Learning},
year = {2000},
pages = {41-48},
url = {https://mlanthology.org/icml/2000/baxter2000icml-reinforcement/}
}