He, Yunzhong

3 publications

ICLR 2026 Chasing the Tail: Effective Rubric-Based Reward Modeling for Large Language Model Post-Training Junkai Zhang, Zihao Wang, Lin Gui, Swarnashree Mysore Sathyendra, Jaehwan Jeong, Victor Veitch, Wei Wang, Yunzhong He, Bing Liu, Lifeng Jin
ICLR 2026 Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains Anisha Gunjal, Anthony Wang, Elaine Lau, Vaskar Nath, Yunzhong He, Bing Liu, Sean M. Hendryx
CoRL 2017 Learning Human Utility from Video Demonstrations for Deductive Planning in Robotics Nishant Shukla, Yunzhong He, Frank Chen, Song-Chun Zhu