Uchibe, Eiji

5 publications

TMLR 2025 Evaluation of Best-of-N Sampling Strategies for Language Model Alignment Yuki Ichihara, Yuu Jinnai, Tetsuro Morimura, Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Eiji Uchibe
AISTATS 2019 Theoretical Analysis of Efficiency and Robustness of SoftMax and Gap-Increasing Operators in Reinforcement Learning Tadashi Kozuno, Eiji Uchibe, Kenji Doya
NeurIPS 2009 A Generalized Natural Actor-Critic Algorithm Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto, Kenji Doya
ECML-PKDD 2008 A New Natural Policy Gradient by Stationary Distribution Metric Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto, Kenji Doya
ICCV 1998 State Space Construction for Behavior Acquisition in Multi Agent Environments with Vision and Action Eiji Uchibe, Minoru Asada, Koh Hosoda