Zheng, Qinqing

15 publications

NeurIPS 2025 D1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning Siyan Zhao, Devaansh Gupta, Qinqing Zheng, Aditya Grover
ICLR 2025 Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces DiJia Su, Sainbayar Sukhbaatar, Michael Rabbat, Yuandong Tian, Qinqing Zheng
ICML 2025 Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Dijia Su, Hanlin Zhu, Yingchen Xu, Jiantao Jiao, Yuandong Tian, Qinqing Zheng
ICLR 2024 Dual RL: Unification and New Methods for Reinforcement and Imitation Learning Harshit Sikchi, Qinqing Zheng, Amy Zhang, Scott Niekum
ICLR 2023 Latent State Marginalization as a Low-Cost Approach for Improving Exploration Dinghuai Zhang, Aaron Courville, Yoshua Bengio, Qinqing Zheng, Amy Zhang, Ricky T. Q. Chen
JMLR 2023 Minimax Estimation for Personalized Federated Learning: An Alternative Between FedAvg and Local Training? Shuxiao Chen, Qinqing Zheng, Qi Long, Weijie J. Su
ICML 2023 Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories Qinqing Zheng, Mikael Henaff, Brandon Amos, Aditya Grover
ICLRW 2023 Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories Qinqing Zheng, Mikael Henaff, Brandon Amos, Aditya Grover
NeurIPSW 2022 ConserWeightive Behavioral Cloning for Reliable Offline Reinforcement Learning Tung Nguyen, Qinqing Zheng, Aditya Grover
ICML 2022 Online Decision Transformer Qinqing Zheng, Amy Zhang, Aditya Grover
AISTATS 2021 Federated F-Differential Privacy Qinqing Zheng, Shuxiao Chen, Qi Long, Weijie Su
ICML 2021 Near-Optimal Confidence Sequences for Bounded Random Variables Arun K Kuchibhotla, Qinqing Zheng
ICML 2020 Sharp Composition Bounds for Gaussian Differential Privacy via Edgeworth Expansion Qinqing Zheng, Jinshuo Dong, Qi Long, Weijie Su
NeurIPS 2015 A Convergent Gradient Descent Algorithm for Rank Minimization and Semidefinite Programming from Random Linear Measurements Qinqing Zheng, John Lafferty
NeurIPS 2015 Interpolating Convex and Non-Convex Tensor Decompositions via the Subspace Norm Qinqing Zheng, Ryota Tomioka