Peng, Yijie
6 publications
ICLR
2026
Half-Order Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer
Tao Ren, Zishi Zhang, Jinyang Jiang, Zehao Li, Shentao Qin, Yi Zheng, Guanghao Li, Qianyou Sun, Yan Li, Jiafeng Liang, Xinping Li, Yijie Peng ICLR
2026
RiskPO: Risk-Based Policy Optimization with Verifiable Reward for LLM Post-Training
Tao Ren, Jinyang Jiang, Hui Yang, Wan Tian, Minhao Zou, Guanghao Li, Zishi Zhang, Qinghao Wang, Shentao Qin, Yanjun Zhao, Rui Tao, Hui Shao, Yijie Peng