Qi, Jiayin

2 publications

ICLR 2026 GEPO: Group Expectation Policy Optimization for Stable Heterogeneous Reinforcement Learning Han Zhang, RuibinZheng, Zexuan Yi, Zhuo Zhang, Hanyang Peng, Hui Wang, Jiayin Qi, Binxing Fang, Ruifeng Xu, Yue Yu
AAAI 2024 Wasserstein Differential Privacy Chengyi Yang, Jiayin Qi, Aimin Zhou