Reinforcement Learning in POMDP's via Direct Gradient Ascent

Cite

Text

Baxter and Bartlett. "Reinforcement Learning in POMDP's via Direct Gradient Ascent." International Conference on Machine Learning, 2000.

Markdown

[Baxter and Bartlett. "Reinforcement Learning in POMDP's via Direct Gradient Ascent." International Conference on Machine Learning, 2000.](https://mlanthology.org/icml/2000/baxter2000icml-reinforcement/)

BibTeX

@inproceedings{baxter2000icml-reinforcement,
  title     = {{Reinforcement Learning in POMDP's via Direct Gradient Ascent}},
  author    = {Baxter, Jonathan and Bartlett, Peter L.},
  booktitle = {International Conference on Machine Learning},
  year      = {2000},
  pages     = {41-48},
  url       = {https://mlanthology.org/icml/2000/baxter2000icml-reinforcement/}
}