Nishimori, Soichiro

2 publications

TMLR 2026 On Symmetric Losses for Policy Optimization with Noisy Preferences Soichiro Nishimori, Yu-Jie Zhang, Thanawat Lodkaew, Masashi Sugiyama
NeurIPS 2023 Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning Sotetsu Koyamada, Shinri Okano, Soichiro Nishimori, Yu Murata, Keigo Habara, Haruka Kita, Shin Ishii