Qin, Shentao

4 publications

ICLR 2026 Half-Order Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer Tao Ren, Zishi Zhang, Jinyang Jiang, Zehao Li, Shentao Qin, Yi Zheng, Guanghao Li, Qianyou Sun, Yan Li, Jiafeng Liang, Xinping Li, Yijie Peng
ICLR 2026 RiskPO: Risk-Based Policy Optimization with Verifiable Reward for LLM Post-Training Tao Ren, Jinyang Jiang, Hui Yang, Wan Tian, Minhao Zou, Guanghao Li, Zishi Zhang, Qinghao Wang, Shentao Qin, Yanjun Zhao, Rui Tao, Hui Shao, Yijie Peng
ICML 2025 LipsNet++: Unifying Filter and Controller into a Policy Network Xujie Song, Liangfa Chen, Tong Liu, Wenxuan Wang, Yinuo Wang, Shentao Qin, Yinsong Ma, Jingliang Duan, Shengbo Eben Li
ICML 2024 Feasible Reachable Policy Iteration Shentao Qin, Yujie Yang, Yao Mu, Jie Li, Wenjun Zou, Jingliang Duan, Shengbo Eben Li