Cai, Qingpeng

15 publications

AAAI 2025 Flow Factorization for Efficient Generative Flow Networks Jiashun Liu, Chunhui Li, Cheng-Hao Liu, Dianbo Liu, Qingpeng Cai, Ling Pan
AAAI 2025 LLM-Powered User Simulator for Recommender System Zijian Zhang, Shuchang Liu, Ziru Liu, Rui Zhong, Qingpeng Cai, Xiangyu Zhao, Chunxu Zhang, Qidong Liu, Peng Jiang
ICML 2025 Random Policy Evaluation Uncovers Policies of Generative Flow Networks Haoran He, Emmanuel Bengio, Qingpeng Cai, Ling Pan
NeurIPS 2023 KuaiSim: A Comprehensive Simulator for Recommender Systems Kesen Zhao, Shuchang Liu, Qingpeng Cai, Xiangyu Zhao, Ziru Liu, Dong Zheng, Peng Jiang, Kun Gai
ICLR 2023 ResAct: Reinforcing Long-Term Engagement in Sequential Recommendation with Residual Actor Wanqi Xue, Qingpeng Cai, Ruohan Zhan, Dong Zheng, Peng Jiang, Kun Gai, Bo An
NeurIPS 2023 State Regularized Policy Optimization on Data with Dynamics Shift Zhenghai Xue, Qingpeng Cai, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An
CVPR 2022 BoostMIS: Boosting Medical Image Semi-Supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation Wenqiao Zhang, Lei Zhu, James Hallinan, Shengyu Zhang, Andrew Makmur, Qingpeng Cai, Beng Chin Ooi
AAAI 2022 MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-Based Image Captioning Wenqiao Zhang, Haochen Shi, Jiannan Guo, Shengyu Zhang, Qingpeng Cai, Juncheng Li, Sihui Luo, Yueting Zhuang
AAAI 2020 Deterministic Value-Policy Gradients Qingpeng Cai, Ling Pan, Pingzhong Tang
IJCAI 2020 Reinforcement Learning with Dynamic Boltzmann SoftMax Updates Ling Pan, Qingpeng Cai, Qi Meng, Wei Chen, Longbo Huang
NeurIPS 2020 SoftMax Deep Double Deterministic Policy Gradients Ling Pan, Qingpeng Cai, Longbo Huang
AAAI 2019 A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems Ling Pan, Qingpeng Cai, Zhixuan Fang, Pingzhong Tang, Longbo Huang
AAAI 2019 Policy Optimization with Model-Based Explorations Feiyang Pan, Qingpeng Cai, Anxiang Zeng, Chun-Xiang Pan, Qing Da, Hua-Lin He, Qing He, Pingzhong Tang
AAAI 2018 Reinforcement Mechanism Design for Fraudulent Behaviour in E-Commerce Qingpeng Cai, Aris Filos-Ratsikas, Pingzhong Tang, Yiwei Zhang
IJCAI 2016 Facility Location with Minimax Envy Qingpeng Cai, Aris Filos-Ratsikas, Pingzhong Tang