Matsubara, Takamitsu

5 publications

MLJ 2023 Cautious Policy Programming: Exploiting KL Regularization for Monotonic Policy Improvement in Reinforcement Learning Lingwei Zhu, Takamitsu Matsubara
ACML 2021 Geometric Value Iteration: Dynamic Error-Aware KL Regularization for Reinforcement Learning Toshinori Kitamura, Lingwei Zhu, Takamitsu Matsubara
UAI 2014 Latent Kullback Leibler Control for Continuous-State Systems Using Probabilistic Graphical Models Takamitsu Matsubara, Vicenç Gómez, Hilbert J. Kappen
ACML 2010 Adaptive Step-Size Policy Gradients with Average Reward Metric Takamitsu Matsubara, Tetsuro Morimura, Jun Morimoto
AAAI 2005 Learning CPG Sensory Feedback with Policy Gradient for Biped Locomotion for a Full-Body Humanoid Gen Endo, Jun Morimoto, Takamitsu Matsubara, Jun Nakanishi, Gordon Cheng