Zeng, Jinliang

1 publications

ICLR 2026 Boosting Multi-Domain Reasoning of LLMs via Curvature-Guided Policy Optimization Xize Liang, Lin Yang, Jie Wang, Rui Liu, Yang Lu, Jinliang Zeng, Hanzhu Chen, Dong Li, Jianye Hao