ML Anthology
Authors
Search
About
Sun, Yiwen
1 publications
NeurIPS
2025
DAPO : Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage-Based Policy Optimization
Jiacai Liu
,
Chaojie Wang
,
Chris Yuhao Liu
,
Liang Zeng
,
Rui Yan
,
Yiwen Sun
,
Yang Liu