Omura, Motoki

3 publications

ICLR 2026 Rethinking Policy Diversity in Ensemble Policy Gradient in Large-Scale Reinforcement Learning Naoki Shitanda, Motoki Omura, Tatsuya Harada, Takayuki Osa
ICML 2025 Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning Motoki Omura, Kazuki Ota, Takayuki Osa, Yusuke Mukuta, Tatsuya Harada
AAAI 2024 Symmetric Q-Learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning Motoki Omura, Takayuki Osa, Yusuke Mukuta, Tatsuya Harada