Li, Qingyang

4 publications

ICLRW 2025 Towards Comprehensive Preference Data Collection for Reward Modeling Yulan Hu, Qingyang Li, Sheng Ouyang, Ge Chen, Jinman Zhao, Yong Liu
NeurIPS 2021 Offline Model-Based Adaptable Policy Learning Xiong-Hui Chen, Yang Yu, Qingyang Li, Fan-Ming Luo, Zhiwei Qin, Wenjie Shang, Jieping Ye
MLJ 2021 Partially Observable Environment Estimation with Uplift Inference for Reinforcement Learning Based Recommendation Wenjie Shang, Qingyang Li, Zhiwei Qin, Yang Yu, Yiping Meng, Jieping Ye
ICML 2014 A Highly Scalable Parallel Algorithm for Isotropic Total Variation Models Jie Wang, Qingyang Li, Sen Yang, Wei Fan, Peter Wonka, Jieping Ye