Zhong, Shan

1 publications

NeurIPS 2025 GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning Shutong Ding, Ke Hu, Shan Zhong, Haoyang Luo, Weinan Zhang, Jingya Wang, Jun Wang, Ye Shi