ML Anthology
Authors
Search
About
He, Yunzhong
3 publications
ICLR
2026
Chasing the Tail: Effective Rubric-Based Reward Modeling for Large Language Model Post-Training
Junkai Zhang
,
Zihao Wang
,
Lin Gui
,
Swarnashree Mysore Sathyendra
,
Jaehwan Jeong
,
Victor Veitch
,
Wei Wang
,
Yunzhong He
,
Bing Liu
,
Lifeng Jin
ICLR
2026
Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains
Anisha Gunjal
,
Anthony Wang
,
Elaine Lau
,
Vaskar Nath
,
Yunzhong He
,
Bing Liu
,
Sean M. Hendryx
CoRL
2017
Learning Human Utility from Video Demonstrations for Deductive Planning in Robotics
Nishant Shukla
,
Yunzhong He
,
Frank Chen
,
Song-Chun Zhu