Average-Reward Reinforcement Learning for Variance Penalized Markov Decision Problems

Cite

Text

Sato and Kobayashi. "Average-Reward Reinforcement Learning for Variance Penalized Markov Decision Problems." International Conference on Machine Learning, 2001.

Markdown

[Sato and Kobayashi. "Average-Reward Reinforcement Learning for Variance Penalized Markov Decision Problems." International Conference on Machine Learning, 2001.](https://mlanthology.org/icml/2001/sato2001icml-average/)

BibTeX

@inproceedings{sato2001icml-average,
  title     = {{Average-Reward Reinforcement Learning for Variance Penalized Markov Decision Problems}},
  author    = {Sato, Makoto and Kobayashi, Shigenobu},
  booktitle = {International Conference on Machine Learning},
  year      = {2001},
  pages     = {473-480},
  url       = {https://mlanthology.org/icml/2001/sato2001icml-average/}
}