Sato, Makoto

1 publications

ICML 2001 Average-Reward Reinforcement Learning for Variance Penalized Markov Decision Problems Makoto Sato, Shigenobu Kobayashi