ML Anthology
Authors
Search
About
Zheng, Qinqing
15 publications
NeurIPS
2025
D1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning
Siyan Zhao
,
Devaansh Gupta
,
Qinqing Zheng
,
Aditya Grover
ICLR
2025
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces
DiJia Su
,
Sainbayar Sukhbaatar
,
Michael Rabbat
,
Yuandong Tian
,
Qinqing Zheng
ICML
2025
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
Dijia Su
,
Hanlin Zhu
,
Yingchen Xu
,
Jiantao Jiao
,
Yuandong Tian
,
Qinqing Zheng
ICLR
2024
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
Harshit Sikchi
,
Qinqing Zheng
,
Amy Zhang
,
Scott Niekum
ICLR
2023
Latent State Marginalization as a Low-Cost Approach for Improving Exploration
Dinghuai Zhang
,
Aaron Courville
,
Yoshua Bengio
,
Qinqing Zheng
,
Amy Zhang
,
Ricky T. Q. Chen
JMLR
2023
Minimax Estimation for Personalized Federated Learning: An Alternative Between FedAvg and Local Training?
Shuxiao Chen
,
Qinqing Zheng
,
Qi Long
,
Weijie J. Su
ICML
2023
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
Qinqing Zheng
,
Mikael Henaff
,
Brandon Amos
,
Aditya Grover
ICLRW
2023
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
Qinqing Zheng
,
Mikael Henaff
,
Brandon Amos
,
Aditya Grover
NeurIPSW
2022
ConserWeightive Behavioral Cloning for Reliable Offline Reinforcement Learning
Tung Nguyen
,
Qinqing Zheng
,
Aditya Grover
ICML
2022
Online Decision Transformer
Qinqing Zheng
,
Amy Zhang
,
Aditya Grover
AISTATS
2021
Federated F-Differential Privacy
Qinqing Zheng
,
Shuxiao Chen
,
Qi Long
,
Weijie Su
ICML
2021
Near-Optimal Confidence Sequences for Bounded Random Variables
Arun K Kuchibhotla
,
Qinqing Zheng
ICML
2020
Sharp Composition Bounds for Gaussian Differential Privacy via Edgeworth Expansion
Qinqing Zheng
,
Jinshuo Dong
,
Qi Long
,
Weijie Su
NeurIPS
2015
A Convergent Gradient Descent Algorithm for Rank Minimization and Semidefinite Programming from Random Linear Measurements
Qinqing Zheng
,
John Lafferty
NeurIPS
2015
Interpolating Convex and Non-Convex Tensor Decompositions via the Subspace Norm
Qinqing Zheng
,
Ryota Tomioka