Deng, Yiyun

1 publications

ICLR 2026 Learn to Reason Efficiently with Adaptive Length-Based Reward Shaping Wei Liu, Ruochen Zhou, Yiyun Deng, Yuzhen Huang, Junteng Liu, Yuntian Deng, Yizhe Zhang, Junxian He