ML Anthology
Authors
Search
About
Uchibe, Eiji
5 publications
TMLR
2025
Evaluation of Best-of-N Sampling Strategies for Language Model Alignment
Yuki Ichihara
,
Yuu Jinnai
,
Tetsuro Morimura
,
Kenshi Abe
,
Kaito Ariu
,
Mitsuki Sakamoto
,
Eiji Uchibe
AISTATS
2019
Theoretical Analysis of Efficiency and Robustness of SoftMax and Gap-Increasing Operators in Reinforcement Learning
Tadashi Kozuno
,
Eiji Uchibe
,
Kenji Doya
NeurIPS
2009
A Generalized Natural Actor-Critic Algorithm
Tetsuro Morimura
,
Eiji Uchibe
,
Junichiro Yoshimoto
,
Kenji Doya
ECML-PKDD
2008
A New Natural Policy Gradient by Stationary Distribution Metric
Tetsuro Morimura
,
Eiji Uchibe
,
Junichiro Yoshimoto
,
Kenji Doya
ICCV
1998
State Space Construction for Behavior Acquisition in Multi Agent Environments with Vision and Action
Eiji Uchibe
,
Minoru Asada
,
Koh Hosoda