ML Anthology
Authors
Search
About
Cai, Qingpeng
15 publications
AAAI
2025
Flow Factorization for Efficient Generative Flow Networks
Jiashun Liu
,
Chunhui Li
,
Cheng-Hao Liu
,
Dianbo Liu
,
Qingpeng Cai
,
Ling Pan
AAAI
2025
LLM-Powered User Simulator for Recommender System
Zijian Zhang
,
Shuchang Liu
,
Ziru Liu
,
Rui Zhong
,
Qingpeng Cai
,
Xiangyu Zhao
,
Chunxu Zhang
,
Qidong Liu
,
Peng Jiang
ICML
2025
Random Policy Evaluation Uncovers Policies of Generative Flow Networks
Haoran He
,
Emmanuel Bengio
,
Qingpeng Cai
,
Ling Pan
NeurIPS
2023
KuaiSim: A Comprehensive Simulator for Recommender Systems
Kesen Zhao
,
Shuchang Liu
,
Qingpeng Cai
,
Xiangyu Zhao
,
Ziru Liu
,
Dong Zheng
,
Peng Jiang
,
Kun Gai
ICLR
2023
ResAct: Reinforcing Long-Term Engagement in Sequential Recommendation with Residual Actor
Wanqi Xue
,
Qingpeng Cai
,
Ruohan Zhan
,
Dong Zheng
,
Peng Jiang
,
Kun Gai
,
Bo An
NeurIPS
2023
State Regularized Policy Optimization on Data with Dynamics Shift
Zhenghai Xue
,
Qingpeng Cai
,
Shuchang Liu
,
Dong Zheng
,
Peng Jiang
,
Kun Gai
,
Bo An
CVPR
2022
BoostMIS: Boosting Medical Image Semi-Supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation
Wenqiao Zhang
,
Lei Zhu
,
James Hallinan
,
Shengyu Zhang
,
Andrew Makmur
,
Qingpeng Cai
,
Beng Chin Ooi
AAAI
2022
MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-Based Image Captioning
Wenqiao Zhang
,
Haochen Shi
,
Jiannan Guo
,
Shengyu Zhang
,
Qingpeng Cai
,
Juncheng Li
,
Sihui Luo
,
Yueting Zhuang
AAAI
2020
Deterministic Value-Policy Gradients
Qingpeng Cai
,
Ling Pan
,
Pingzhong Tang
IJCAI
2020
Reinforcement Learning with Dynamic Boltzmann SoftMax Updates
Ling Pan
,
Qingpeng Cai
,
Qi Meng
,
Wei Chen
,
Longbo Huang
NeurIPS
2020
SoftMax Deep Double Deterministic Policy Gradients
Ling Pan
,
Qingpeng Cai
,
Longbo Huang
AAAI
2019
A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems
Ling Pan
,
Qingpeng Cai
,
Zhixuan Fang
,
Pingzhong Tang
,
Longbo Huang
AAAI
2019
Policy Optimization with Model-Based Explorations
Feiyang Pan
,
Qingpeng Cai
,
Anxiang Zeng
,
Chun-Xiang Pan
,
Qing Da
,
Hua-Lin He
,
Qing He
,
Pingzhong Tang
AAAI
2018
Reinforcement Mechanism Design for Fraudulent Behaviour in E-Commerce
Qingpeng Cai
,
Aris Filos-Ratsikas
,
Pingzhong Tang
,
Yiwei Zhang
IJCAI
2016
Facility Location with Minimax Envy
Qingpeng Cai
,
Aris Filos-Ratsikas
,
Pingzhong Tang