Shitanda, Naoki

1 publications

ICLR 2026 Rethinking Policy Diversity in Ensemble Policy Gradient in Large-Scale Reinforcement Learning Naoki Shitanda, Motoki Omura, Tatsuya Harada, Takayuki Osa