Sato and Kobayashi. "Average-Reward Reinforcement Learning for Variance Penalized Markov Decision Problems." International Conference on Machine Learning, 2001.
Markdown
[Sato and Kobayashi. "Average-Reward Reinforcement Learning for Variance Penalized Markov Decision Problems." International Conference on Machine Learning, 2001.](https://mlanthology.org/icml/2001/sato2001icml-average/)
BibTeX
@inproceedings{sato2001icml-average,
title = {{Average-Reward Reinforcement Learning for Variance Penalized Markov Decision Problems}},
author = {Sato, Makoto and Kobayashi, Shigenobu},
booktitle = {International Conference on Machine Learning},
year = {2001},
pages = {473-480},
url = {https://mlanthology.org/icml/2001/sato2001icml-average/}
}