Liu, Yanbaihui

1 publications

TMLR 2025 LAPP: Large Language Model Feedback for Preference-Driven Reinforcement Learning Pingcheng Jian, Xiao Wei, Yanbaihui Liu, Samuel A. Moore, Michael M. Zavlanos, Boyuan Chen