Peng, Yijie

6 publications

ICLR 2026 Half-Order Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer Tao Ren, Zishi Zhang, Jinyang Jiang, Zehao Li, Shentao Qin, Yi Zheng, Guanghao Li, Qianyou Sun, Yan Li, Jiafeng Liang, Xinping Li, Yijie Peng
ICLR 2026 RiskPO: Risk-Based Policy Optimization with Verifiable Reward for LLM Post-Training Tao Ren, Jinyang Jiang, Hui Yang, Wan Tian, Minhao Zou, Guanghao Li, Zishi Zhang, Qinghao Wang, Shentao Qin, Yanjun Zhao, Rui Tao, Hui Shao, Yijie Peng
TMLR 2025 CoNNect: Connectivity-Based Regularization for Structural Pruning of Neural Networks Christian P.C. Franssen, Jinyang Jiang, Yijie Peng, Bernd Heidergott
NeurIPS 2025 Exploring and Exploiting Model Uncertainty in Bayesian Optimization Zishi Zhang, Tao Ren, Yijie Peng
ICLR 2025 FLOPS: Forward Learning with OPtimal Sampling Tao Ren, Zishi Zhang, Jinyang Jiang, Guanghao Li, Zeliang Zhang, Mingqian Feng, Yijie Peng
ICLR 2024 One Forward Is Enough for Neural Network Training via Likelihood Ratio Method Jinyang Jiang, Zeliang Zhang, Chenliang Xu, Zhaofei Yu, Yijie Peng