Zhao, Dezhong

1 publications

NeurIPS 2025 PRIMT: Preference-Based Reinforcement Learning with Multimodal Feedback and Trajectory Synthesis from Foundation Models Ruiqi Wang, Dezhong Zhao, Ziqin Yuan, Tianyu Shao, Guohua Chen, Dominic Kao, Sungeun Hong, Byung-Cheol Min