Gu, Quanquan

235 publications

TMLR 2026 Multi-Step Alignment as Markov Games: An Optimistic Online Mirror Descent Approach with Convergence Guarantees Yongtao Wu, Luca Viano, Kimon Antonakopoulos, Yihang Chen, Zhenyu Zhu, Quanquan Gu, Volkan Cevher
ICML 2025 An All-Atom Generative Model for Designing Protein Complexes Ruizhe Chen, Dongyu Xue, Xiangxin Zhou, Zaixiang Zheng, Xiangxiang Zeng, Quanquan Gu
ICML 2025 Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment Yifan Zhang, Ge Zhang, Yue Wu, Kangping Xu, Quanquan Gu
ICLRW 2025 Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment Yifan Zhang, Ge Zhang, Yue Wu, Kangping Xu, Quanquan Gu
ICLR 2025 Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration Heyang Zhao, Xingrui Yu, David Mark Bossens, Ivor Tsang, Quanquan Gu
ICLR 2025 Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis Zikun Zhang, Zixiang Chen, Quanquan Gu
ICLR 2025 CryoFM: A Flow-Based Foundation Model for Cryo-EM Densities Yi Zhou, Yilai Li, Jing Yuan, Quanquan Gu
ICLR 2025 DPLM-2: A Multimodal Diffusion Protein Language Model Xinyou Wang, Zaixiang Zheng, Fei Ye, Dongyu Xue, Shujian Huang, Quanquan Gu
TMLR 2025 Decomposed Direct Preference Optimization for Structure-Based Drug Design Xiwei Cheng, Xiangxin Zhou, Yuwei Yang, Yu Bao, Quanquan Gu
ICML 2025 Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling Xiangxin Zhou, Mingyu Li, Yi Xiao, Jiahan Li, Dongyu Xue, Zaixiang Zheng, Jianzhu Ma, Quanquan Gu
ICML 2025 Elucidating the Design Space of Multimodal Protein Language Models Cheng-Yen Hsieh, Xinyou Wang, Daiheng Zhang, Dongyu Xue, Fei Ye, Shujian Huang, Zaixiang Zheng, Quanquan Gu
ICLR 2025 Energy-Weighted Flow Matching for Offline Reinforcement Learning Shiyuan Zhang, Weitong Zhang, Quanquan Gu
ICLRW 2025 Game-Theoretic Regularized Self-Play Alignment of Large Language Models Xiaohang Tang, Sangwoong Yoon, Seongho Son, Huizhuo Yuan, Quanquan Gu, Ilija Bogunovic
ICML 2025 Global Convergence and Rich Feature Learning in $l$-Layer Infinite-Width Neural Networks Under $μ$ Parametrization Zixiang Chen, Greg Yang, Qingyue Zhao, Quanquan Gu
TMLR 2025 Guided Discrete Diffusion for Electronic Health Record Generation Jun Han, Zixiang Chen, Yongqian Li, Yiwen Kou, Eran Halperin, Robert E. Tillman, Quanquan Gu
CVPR 2025 LLaVA-Critic: Learning to Evaluate Multimodal Models Tianyi Xiong, Xiyao Wang, Dong Guo, Qinghao Ye, Haoqi Fan, Quanquan Gu, Heng Huang, Chunyuan Li
ICML 2025 Logarithmic Regret for Online KL-Regularized Reinforcement Learning Heyang Zhao, Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang
ICML 2025 MARS: Unleashing the Power of Variance Reduction for Training Large Models Huizhuo Yuan, Yifeng Liu, Shuang Wu, Zhou Xun, Quanquan Gu
ICML 2025 Mitigating Object Hallucination in Large Vision-Language Models via Image-Grounded Guidance Linxi Zhao, Yihe Deng, Weitong Zhang, Quanquan Gu
ICML 2025 Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback Qiwei Di, Jiafan He, Quanquan Gu
AISTATS 2025 On the Power of Multitask Representation Learning with Gradient Descent Qiaobo Li, Zixiang Chen, Yihe Deng, Yiwen Kou, Yuan Cao, Quanquan Gu
ICLR 2025 ProteinBench: A Holistic Evaluation of Protein Foundation Models Fei Ye, Zaixiang Zheng, Dongyu Xue, Yuning Shen, Lihao Wang, Yiming Ma, Yan Wang, Xinyou Wang, Xiangxin Zhou, Quanquan Gu
ICML 2025 Ranking with Multiple Oracles: From Weak to Strong Stochastic Transitivity Tao Jin, Yue Wu, Quanquan Gu, Farzad Farnoud
TMLR 2025 Reinforcement Learning from Human Feedback with Active Queries Kaixuan Ji, Jiafan He, Quanquan Gu
ICLR 2025 Self-Play Preference Optimization for Language Model Alignment Yue Wu, Zhiqing Sun, Huizhuo Yuan, Kaixuan Ji, Yiming Yang, Quanquan Gu
NeurIPS 2025 Sharp Analysis for KL-Regularized Contextual Bandits and RLHF Heyang Zhao, Chenlu Ye, Quanquan Gu, Tong Zhang
NeurIPS 2025 Simultaneous Modeling of Protein Conformation and Dynamics via Autoregression Yuning Shen, Lihao Wang, Huizhuo Yuan, Yan Wang, Bangji Yang, Quanquan Gu
NeurIPS 2025 Tensor Product Attention Is All You Need Yifan Zhang, Yifeng Liu, Huizhuo Yuan, Zhen Qin, Yang Yuan, Quanquan Gu, Andrew C Yao
ICLR 2025 Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers Runjia Li, Qiwei Di, Quanquan Gu
NeurIPS 2025 Variance-Aware Feel-Good Thompson Sampling for Contextual Bandits Xuheng Li, Quanquan Gu
NeurIPS 2024 A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation Heyang Zhao, Jiafan He, Quanquan Gu
NeurIPSW 2024 Accelerated Preference Optimization for Large Language Model Alignment Jiafan He, Huizhuo Yuan, Quanquan Gu
NeurIPS 2024 Achieving Constant Regret in Linear Markov Decision Processes Weitong Zhang, Zhiyuan Fan, Jiafan He, Quanquan Gu
NeurIPS 2024 Antigen-Specific Antibody Design via Direct Energy-Based Preference Optimization Xiangxin Zhou, Dongyu Xue, Ruizhe Chen, Zaixiang Zheng, Liang Wang, Quanquan Gu
ICMLW 2024 Antigen-Specific Antibody Design via Direct Energy-Based Preference Optimization Xiangxin Zhou, Dongyu Xue, Ruizhe Chen, Zaixiang Zheng, Liang Wang, Quanquan Gu
ICML 2024 Borda Regret Minimization for Generalized Linear Dueling Bandits Yue Wu, Tao Jin, Qiwei Di, Hao Lou, Farzad Farnoud, Quanquan Gu
NeurIPSW 2024 CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing Chen Yang, Chenyang Zhao, Quanquan Gu, Dongruo Zhou
ICLR 2024 DecompOpt: Controllable and Decomposed Diffusion Models for Structure-Based Molecular Optimization Xiangxin Zhou, Xiwei Cheng, Yuwei Yang, Yu Bao, Liang Wang, Quanquan Gu
ICML 2024 Diffusion Language Models Are Versatile Protein Learners Xinyou Wang, Zaixiang Zheng, Fei Ye, Dongyu Xue, Shujian Huang, Quanquan Gu
NeurIPS 2024 Enhancing Large Vision Language Models with Self-Training on Image Comprehension Yihe Deng, Pan Lu, Fan Yin, Ziniu Hu, Sheng Shen, Quanquan Gu, James Zou, Kai-Wei Chang, Wei Wang
NeurIPS 2024 Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time Zixiang Chen, Huizhuo Yuan, Yongqian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu
ICML 2024 Feel-Good Thompson Sampling for Contextual Dueling Bandits Xuheng Li, Heyang Zhao, Quanquan Gu
ICLR 2024 Horizon-Free Reinforcement Learning in Adversarial Linear Mixture MDPs Kaixuan Ji, Qingyue Zhao, Jiafan He, Weitong Zhang, Quanquan Gu
ICLR 2024 How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression? Jingfeng Wu, Difan Zou, Zixiang Chen, Vladimir Braverman, Quanquan Gu, Peter Bartlett
NeurIPSW 2024 Imbalance-Regularized LoRA: A Plug-and-Play Method for Improving Fine-Tuning of Foundation Models Zhenyu Zhu, Yongtao Wu, Quanquan Gu, Volkan Cevher
NeurIPS 2024 Matching the Statistical Query Lower Bound for $k$-Sparse Parity Problems with Sign Stochastic Gradient Descent Yiwen Kou, Zixiang Chen, Quanquan Gu, Sham M. Kakade
NeurIPSW 2024 Mitigating Object Hallucination in Large Vision-Language Models via Image-Grounded Guidance Linxi Zhao, Yihe Deng, Weitong Zhang, Quanquan Gu
NeurIPSW 2024 Multi-Step Preference Optimization via Two-Player Markov Games Yongtao Wu, Luca Viano, Yihang Chen, Zhenyu Zhu, Quanquan Gu, Volkan Cevher
TMLR 2024 On the Convergence of Adaptive Gradient Methods for Nonconvex Optimization Dongruo Zhou, Jinghui Chen, Yuan Cao, Ziyan Yang, Quanquan Gu
ICLR 2024 Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning Qiwei Di, Heyang Zhao, Jiafan He, Quanquan Gu
ICML 2024 Position: TrustLLM: Trustworthiness in Large Language Models Yue Huang, Lichao Sun, Haoran Wang, Siyuan Wu, Qihui Zhang, Yuan Li, Chujie Gao, Yixin Huang, Wenhan Lyu, Yixuan Zhang, Xiner Li, Hanchi Sun, Zhengliang Liu, Yixin Liu, Yijue Wang, Zhikun Zhang, Bertie Vidgen, Bhavya Kailkhura, Caiming Xiong, Chaowei Xiao, Chunyuan Li, Eric P. Xing, Furong Huang, Hao Liu, Heng Ji, Hongyi Wang, Huan Zhang, Huaxiu Yao, Manolis Kellis, Marinka Zitnik, Meng Jiang, Mohit Bansal, James Zou, Jian Pei, Jian Liu, Jianfeng Gao, Jiawei Han, Jieyu Zhao, Jiliang Tang, Jindong Wang, Joaquin Vanschoren, John Mitchell, Kai Shu, Kaidi Xu, Kai-Wei Chang, Lifang He, Lifu Huang, Michael Backes, Neil Zhenqiang Gong, Philip S. Yu, Pin-Yu Chen, Quanquan Gu, Ran Xu, Rex Ying, Shuiwang Ji, Suman Jana, Tianlong Chen, Tianming Liu, Tianyi Zhou, William Yang Wang, Xiang Li, Xiangliang Zhang, Xiao Wang, Xing Xie, Xun Chen, Xuyu Wang, Yan Liu, Yanfang Ye, Yinzhi Cao, Yong Chen, Yue Zhao
TMLR 2024 Pre-Trained Hypergraph Convolutional Neural Networks with Self-Supervised Learning Yihe Deng, Ruochi Zhang, Pan Xu, Jian Ma, Quanquan Gu
ICML 2024 Protein Conformation Generation via Force-Guided SE(3) Diffusion Models Yan Wang, Lihao Wang, Yuning Shen, Yiqun Wang, Huizhuo Yuan, Yue Wu, Quanquan Gu
UAI 2024 Pure Exploration in Asynchronous Federated Bandits Zichen Wang, Chuanhao Li, Chenyu Song, Lianghui Wang, Quanquan Gu, Huazheng Wang
ICLR 2024 Risk Bounds of Accelerated SGD for Overparameterized Linear Regression Xuheng Li, Yihe Deng, Jingfeng Wu, Dongruo Zhou, Quanquan Gu
ICML 2024 Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models Zixiang Chen, Yihe Deng, Huizhuo Yuan, Kaixuan Ji, Quanquan Gu
NeurIPS 2024 Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan, Zixiang Chen, Kaixuan Ji, Quanquan Gu
ICMLW 2024 Self-Play Preference Optimization for Language Model Alignment Yue Wu, Zhiqing Sun, Huizhuo Yuan, Kaixuan Ji, Yiming Yang, Quanquan Gu
NeurIPSW 2024 Self-Play Preference Optimization for Language Model Alignment Yue Wu, Zhiqing Sun, Huizhuo Yuan, Kaixuan Ji, Yiming Yang, Quanquan Gu
NeurIPSW 2024 Sharp Analysis for KL-Regularized Contextual Bandits and RLHF Heyang Zhao, Chenlu Ye, Quanquan Gu, Tong Zhang
ICML 2024 Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption Chenlu Ye, Jiafan He, Quanquan Gu, Tong Zhang
ICML 2024 Uncertainty-Aware Reward-Free Exploration with General Function Approximation Junkai Zhang, Weitong Zhang, Dongruo Zhou, Quanquan Gu
ICLR 2024 Understanding Transferable Representation Learning and Zero-Shot Transfer in CLIP Zixiang Chen, Yihe Deng, Yuanzhi Li, Quanquan Gu
ICLR 2024 Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits Qiwei Di, Tao Jin, Yue Wu, Heyang Zhao, Farzad Farnoud, Quanquan Gu
ICLR 2023 A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning Zixiang Chen, Chris Junchi Li, Huizhuo Yuan, Quanquan Gu, Michael Jordan
UAI 2023 Benign Overfitting in Adversarially Robust Linear Classification Jinghui Chen, Yuan Cao, Quanquan Gu
ICML 2023 Benign Overfitting in Two-Layer ReLU Convolutional Neural Networks Yiwen Kou, Zixiang Chen, Yuanzhou Chen, Quanquan Gu
JMLR 2023 Benign Overfitting of Constant-Stepsize SGD for Linear Regression Difan Zou, Jingfeng Wu, Vladimir Braverman, Quanquan Gu, Sham M. Kakade
ICMLW 2023 Borda Regret Minimization for Generalized Linear Dueling Bandits Yue Wu, Tao Jin, Qiwei Di, Hao Lou, Farzad Farnoud, Quanquan Gu
NeurIPSW 2023 Causal Graph ODE: Continuous Treatment Effect Modeling in Multi-Agent Dynamical Systems Zijie Huang, Jeehyun Hwang, Junkai Zhang, Jinwoo Baik, Weitong Zhang, Dominik Wodarz, Yizhou Sun, Quanquan Gu, Wei Wang
ICML 2023 Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation Yifei Min, Jiafan He, Tianhao Wang, Quanquan Gu
ICML 2023 Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang
NeurIPS 2023 Corruption-Robust Offline Reinforcement Learning with General Function Approximation Chenlu Ye, Rui Yang, Quanquan Gu, Tong Zhang
NeurIPSW 2023 CryoSTAR: Cryo-EM Heterogeneous Reconstruction of Atomic Models with Structural Regularization Yi Zhou, Yilai Li, Jing Yuan, Fei Ye, Quanquan Gu
ICML 2023 DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design Jiaqi Guan, Xiangxin Zhou, Yuwei Yang, Yu Bao, Jian Peng, Jianzhu Ma, Qiang Liu, Liang Wang, Quanquan Gu
ICMLW 2023 DiffMol: 3D Structured Molecule Generation with Discrete Denoising Diffusion Probabilistic Models Weitong Zhang, Xiaoyun Wang, Justin Smith, Joe Eaton, Brad Rees, Quanquan Gu
UAI 2023 Efficient Privacy-Preserving Stochastic Nonconvex Optimization Lingxiao Wang, Bargav Jayaraman, David Evans, Quanquan Gu
ICML 2023 Finite-Sample Analysis of Learning High-Dimensional Single ReLU Neuron Jingfeng Wu, Difan Zou, Zixiang Chen, Vladimir Braverman, Quanquan Gu, Sham M. Kakade
ICLR 2023 How Does Semi-Supervised Learning with Pseudo-Labelers Work? a Case Study Yiwen Kou, Zixiang Chen, Yuan Cao, Quanquan Gu
NeurIPS 2023 Implicit Bias of Gradient Descent for Two-Layer ReLU and Leaky ReLU Networks on Nearly-Orthogonal Data Yiwen Kou, Zixiang Chen, Quanquan Gu
NeurIPSW 2023 MoleculeGPT: Instruction Following Large Language Models for Molecular Property Prediction Weitong Zhang, Xiaoyun Wang, Weili Nie, Joe Eaton, Brad Rees, Quanquan Gu
ICML 2023 Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path Qiwei Di, Jiafan He, Dongruo Zhou, Quanquan Gu
ICML 2023 Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes Jiafan He, Heyang Zhao, Dongruo Zhou, Quanquan Gu
ICML 2023 Nesterov Meets Optimism: Rate-Optimal Separable Minimax Optimization Chris Junchi Li, Huizhuo Yuan, Gauthier Gidel, Quanquan Gu, Michael Jordan
ICML 2023 On the Interplay Between Misspecification and Sub-Optimality Gap in Linear Contextual Bandits Weitong Zhang, Jiafan He, Zhiyuan Fan, Quanquan Gu
NeurIPS 2023 Optimal Extragradient-Based Algorithms for Stochastic Variational Inequalities with Separable Structure Angela Yuan, Chris Junchi Li, Gauthier Gidel, Michael I. Jordan, Quanquan Gu, Simon S Du
ICML 2023 Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs Junkai Zhang, Weitong Zhang, Quanquan Gu
ICML 2023 Optimal Online Generalized Linear Regression with Stochastic Noise and Its Application to Heteroscedastic Bandits Heyang Zhao, Dongruo Zhou, Jiafan He, Quanquan Gu
ICML 2023 Personalized Federated Learning Under Mixture of Distributions Yue Wu, Shuaicheng Zhang, Wenchao Yu, Yanchi Liu, Quanquan Gu, Dawei Zhou, Haifeng Chen, Wei Cheng
NeurIPSW 2023 Risk Bounds of Accelerated SGD for Overparameterized Linear Regression Xuheng Li, Yihe Deng, Jingfeng Wu, Dongruo Zhou, Quanquan Gu
NeurIPS 2023 Robust Learning with Progressive Data Expansion Against Spurious Correlation Yihe Deng, Yu Yang, Baharan Mirzasoleiman, Quanquan Gu
ICML 2023 Structure-Informed Language Models Are Protein Designers Zaixiang Zheng, Yifan Deng, Dongyu Xue, Yi Zhou, Fei Ye, Quanquan Gu
ICML 2023 The Benefits of Mixup for Feature Learning Difan Zou, Yuan Cao, Yuanzhi Li, Quanquan Gu
COLT 2023 The Implicit Bias of Batch Normalization in Linear Models and Two-Layer Linear Convolutional Neural Networks Yuan Cao, Difan Zou, Yuanzhi Li, Quanquan Gu
ICLR 2023 Understanding Train-Validation Split in Meta-Learning with Neural Networks Xinzhe Zuo, Zixiang Chen, Huaxiu Yao, Yuan Cao, Quanquan Gu
NeurIPSW 2023 Understanding Transferable Representation Learning and Zero-Shot Transfer in CLIP Zixiang Chen, Yihe Deng, Yuanzhi Li, Quanquan Gu
ICLR 2023 Understanding the Generalization of Adam in Learning Neural Networks with Proper Regularization Difan Zou, Yuan Cao, Yuanzhi Li, Quanquan Gu
UAI 2023 Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension Yue Wu, Jiafan He, Quanquan Gu
COLT 2023 Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency Heyang Zhao, Jiafan He, Dongruo Zhou, Tong Zhang, Quanquan Gu
NeurIPS 2023 Why Does Sharpness-Aware Minimization Generalize Better than SGD? Zixiang Chen, Junkai Zhang, Yiwen Kou, Xiangning Chen, Cho-Jui Hsieh, Quanquan Gu
AISTATS 2022 Adaptive Sampling for Heterogeneous Rank Aggregation from Noisy Pairwise Comparisons Yue Wu, Tao Jin, Hao Lou, Pan Xu, Farzad Farnoud, Quanquan Gu
AISTATS 2022 Near-Optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs Jiafan He, Dongruo Zhou, Quanquan Gu
AISTATS 2022 Nearly Minimax Optimal Regret for Learning Infinite-Horizon Average-Reward MDPs with Linear Function Approximation Yue Wu, Dongruo Zhou, Quanquan Gu
AISTATS 2022 Self-Training Converts Weak Learners to Strong Learners in Mixture Models Spencer Frei, Difan Zou, Zixiang Chen, Quanquan Gu
NeurIPSW 2022 A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning Zixiang Chen, Chris Junchi Li, Angela Yuan, Quanquan Gu, Michael Jordan
NeurIPS 2022 A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits Jiafan He, Tianhao Wang, Yifei Min, Quanquan Gu
NeurIPS 2022 Active Ranking Without Strong Stochastic Transitivity Hao Lou, Tao Jin, Yue Wu, Pan Xu, Quanquan Gu, Farzad Farnoud
ALT 2022 Almost Optimal Algorithms for Two-Player Zero-Sum Linear Mixture Markov Games Zixiang Chen, Dongruo Zhou, Quanquan Gu
NeurIPS 2022 Benign Overfitting in Two-Layer Convolutional Neural Networks Yuan Cao, Zixiang Chen, Misha Belkin, Quanquan Gu
NeurIPS 2022 Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs Dongruo Zhou, Quanquan Gu
ICML 2022 Dimension-Free Complexity Bounds for High-Order Nonconvex Finite-Sum Optimization Dongruo Zhou, Quanquan Gu
AAAI 2022 Efficient Robust Training via Backward Smoothing Jinghui Chen, Yu Cheng, Zhe Gan, Quanquan Gu, Jingjing Liu
ALT 2022 Faster Perturbed Stochastic Gradient Methods for Finding Local Minima Zixiang Chen, Dongruo Zhou, Quanquan Gu
ICML 2022 Last Iterate Risk Bounds of SGD with Decaying Stepsize for Overparameterized Linear Regression Jingfeng Wu, Difan Zou, Vladimir Braverman, Quanquan Gu, Sham Kakade
ICLR 2022 Learning Neural Contextual Bandits Through Perturbed Rewards Yiling Jia, Weitong Zhang, Dongruo Zhou, Quanquan Gu, Hongning Wang
ICML 2022 Learning Stochastic Shortest Path with Linear Function Approximation Yifei Min, Jiafan He, Tianhao Wang, Quanquan Gu
NeurIPS 2022 Learning Two-Player Markov Games: Neural Function Approximation and Correlated Equilibrium Chris Junchi Li, Dongruo Zhou, Quanquan Gu, Michael I. Jordan
ACML 2022 Locally Differentially Private Reinforcement Learning for Linear Mixture Markov Decision Processes Chonghua Liao, Jiafan He, Quanquan Gu
NeurIPS 2022 Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions Jiafan He, Dongruo Zhou, Tong Zhang, Quanquan Gu
ICLR 2022 Neural Contextual Bandits with Deep Representation and Shallow Exploration Pan Xu, Zheng Wen, Handong Zhao, Quanquan Gu
ICLR 2022 On the Convergence of Certified Robust Training with Interval Bound Propagation Yihan Wang, Zhouxing Shi, Quanquan Gu, Cho-Jui Hsieh
ICML 2022 On the Sample Complexity of Learning Infinite-Horizon Discounted Linear Kernel MDPs Yuanzhou Chen, Jiafan He, Quanquan Gu
NeurIPS 2022 Risk Bounds of Multi-Pass SGD for Least Squares in the Interpolation Regime Difan Zou, Jingfeng Wu, Vladimir Braverman, Quanquan Gu, Sham Kakade
NeurIPS 2022 The Power and Limitation of Pretraining-Finetuning for Linear Regression Under Covariate Shift Jingfeng Wu, Difan Zou, Vladimir Braverman, Quanquan Gu, Sham Kakade
NeurIPS 2022 Towards Understanding the Mixture-of-Experts Layer in Deep Learning Zixiang Chen, Yihe Deng, Yue Wu, Quanquan Gu, Yuanzhi Li
ICML 2021 Agnostic Learning of Halfspaces with Gradient Descent via Soft Margins Spencer Frei, Yuan Cao, Quanquan Gu
ICML 2021 Almost Optimal Anytime Algorithm for Batched Multi-Armed Bandits Tianyuan Jin, Jing Tang, Pan Xu, Keke Huang, Xiaokui Xiao, Quanquan Gu
COLT 2021 Benign Overfitting of Constant-Stepsize SGD for Linear Regression Difan Zou, Jingfeng Wu, Vladimir Braverman, Quanquan Gu, Sham Kakade
ICLR 2021 Direction Matters: On the Implicit Bias of Stochastic Gradient Descent with Moderate Learning Rate Jingfeng Wu, Difan Zou, Vladimir Braverman, Quanquan Gu
NeurIPS 2021 Do Wider Neural Networks Really Help Adversarial Robustness? Boxi Wu, Jinghui Chen, Deng Cai, Xiaofei He, Quanquan Gu
COLT 2021 Double Explore-Then-Commit: Asymptotic Optimality and Beyond Tianyuan Jin, Pan Xu, Xiaokui Xiao, Quanquan Gu
NeurIPS 2021 Exploring Architectural Ingredients of Adversarially Robust Deep Neural Networks Hanxun Huang, Yisen Wang, Sarah Erfani, Quanquan Gu, James Bailey, Xingjun Ma
UAI 2021 Faster Convergence of Stochastic Gradient Langevin Dynamics for Non-Log-Concave Sampling Difan Zou, Pan Xu, Quanquan Gu
ICLR 2021 How Much Over-Parameterization Is Sufficient to Learn Deep ReLU Networks? Zixiang Chen, Yuan Cao, Difan Zou, Quanquan Gu
NeurIPS 2021 Iterative Teacher-Aware Learning Luyao Yuan, Dongruo Zhou, Junhong Shen, Jingdong Gao, Jeffrey L Chen, Quanquan Gu, Ying Nian Wu, Song-Chun Zhu
NeurIPSW 2021 Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium Chris Junchi Li, Dongruo Zhou, Quanquan Gu, Michael Jordan
ICML 2021 Logarithmic Regret for Reinforcement Learning with Linear Function Approximation Jiafan He, Dongruo Zhou, Quanquan Gu
ICML 2021 MOTS: Minimax Optimal Thompson Sampling Tianyuan Jin, Pan Xu, Jieming Shi, Xiaokui Xiao, Quanquan Gu
NeurIPS 2021 Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs Jiafan He, Dongruo Zhou, Quanquan Gu
COLT 2021 Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes Dongruo Zhou, Quanquan Gu, Csaba Szepesvari
ICLR 2021 Neural Thompson Sampling Weitong Zhang, Dongruo Zhou, Lihong Li, Quanquan Gu
ICML 2021 On the Convergence of Hamiltonian Monte Carlo with Stochastic Gradients Difan Zou, Quanquan Gu
ICML 2021 Provable Generalization of SGD-Trained Neural Networks of Any Width in the Presence of Adversarial Label Noise Spencer Frei, Yuan Cao, Quanquan Gu
ICML 2021 Provable Robustness of Adversarial Training for Learning Halfspaces with Noise Difan Zou, Spencer Frei, Quanquan Gu
ICML 2021 Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping Dongruo Zhou, Jiafan He, Quanquan Gu
NeurIPS 2021 Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints Tianhao Wang, Dongruo Zhou, Quanquan Gu
NeurIPS 2021 Proxy Convexity: A Unified Framework for the Analysis of Neural Networks Trained by Gradient Descent Spencer Frei, Quanquan Gu
NeurIPS 2021 Pure Exploration in Kernel and Neural Bandits Yinglun Zhu, Dongruo Zhou, Ruoxi Jiang, Quanquan Gu, Rebecca Willett, Robert Nowak
NeurIPS 2021 Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation Weitong Zhang, Dongruo Zhou, Quanquan Gu
NeurIPS 2021 Risk Bounds for Over-Parameterized Maximum Margin Classification on Sub-Gaussian Mixtures Yuan Cao, Quanquan Gu, Mikhail Belkin
NeurIPS 2021 The Benefits of Implicit Regularization from SGD in Least Squares Problems Difan Zou, Jingfeng Wu, Vladimir Braverman, Quanquan Gu, Dean P. Foster, Sham Kakade
IJCAI 2021 Towards Understanding the Spectral Bias of Deep Learning Yuan Cao, Zhiying Fang, Yue Wu, Ding-Xuan Zhou, Quanquan Gu
NeurIPS 2021 Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation Jiafan He, Dongruo Zhou, Quanquan Gu
NeurIPS 2021 Variance-Aware Off-Policy Evaluation with Linear Function Approximation Yifei Min, Tianhao Wang, Dongruo Zhou, Quanquan Gu
ICML 2020 A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation Pan Xu, Quanquan Gu
NeurIPS 2020 A Finite-Time Analysis of Two Time-Scale Actor-Critic Methods Yue Frank Wu, Weitong Zhang, Pan Xu, Quanquan Gu
AAAI 2020 A Frank-Wolfe Framework for Efficient and Effective Adversarial Attacks Jinghui Chen, Dongruo Zhou, Jinfeng Yi, Quanquan Gu
NeurIPS 2020 A Generalized Neural Tangent Kernel Analysis for Two-Layer Neural Networks Zixiang Chen, Yuan Cao, Quanquan Gu, Tong Zhang
AAAI 2020 A Knowledge Transfer Framework for Differentially Private Sparse Learning Lingxiao Wang, Quanquan Gu
AISTATS 2020 Accelerated Factored Gradient Descent for Low-Rank Matrix Factorization Dongruo Zhou, Yuan Cao, Quanquan Gu
NeurIPS 2020 Agnostic Learning of a Single Neuron with Gradient Descent Spencer Frei, Yuan Cao, Quanquan Gu
IJCAI 2020 Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks Jinghui Chen, Dongruo Zhou, Yiqi Tang, Ziyan Yang, Yuan Cao, Quanquan Gu
AAAI 2020 Generalization Error Bounds of Gradient Descent for Learning Over-Parameterized Deep ReLU Networks Yuan Cao, Quanquan Gu
MLJ 2020 Gradient Descent Optimizes Over-Parameterized Deep ReLU Networks Difan Zou, Yuan Cao, Dongruo Zhou, Quanquan Gu
ICLR 2020 Improving Adversarial Robustness Requires Revisiting Misclassified Examples Yisen Wang, Difan Zou, Jinfeng Yi, James Bailey, Xingjun Ma, Quanquan Gu
ICLR 2020 Improving Neural Language Generation with Spectrum Control Lingxiao Wang, Jing Huang, Kevin Huang, Ziniu Hu, Guangtao Wang, Quanquan Gu
ICML 2020 Neural Contextual Bandits with UCB-Based Exploration Dongruo Zhou, Lihong Li, Quanquan Gu
ICLR 2020 On the Global Convergence of Training Deep Linear ResNets Difan Zou, Philip M. Long, Quanquan Gu
ICML 2020 Optimization Theory for ReLU Neural Networks Trained with Normalization Layers Yonatan Dukler, Quanquan Gu, Guido Montufar
AAAI 2020 Rank Aggregation via Heterogeneous Thurstone Preference Models Tao Jin, Pan Xu, Quanquan Gu, Farzad Farnoud
ICLR 2020 Sample Efficient Policy Gradient Methods with Recursive Variance Reduction Pan Xu, Felicia Gao, Quanquan Gu
JMLR 2020 Stochastic Nested Variance Reduction for Nonconvex Optimization Dongruo Zhou, Pan Xu, Quanquan Gu
AISTATS 2020 Stochastic Recursive Variance-Reduced Cubic Regularization Methods Dongruo Zhou, Quanquan Gu
AISTATS 2020 Understanding the Intrinsic Robustness of Image Distributions Using Conditional Generative Models Xiao Zhang, Jinghui Chen, Quanquan Gu, David Evans
NeurIPS 2019 Algorithm-Dependent Generalization Bounds for Overparameterized Deep Residual Networks Spencer Frei, Yuan Cao, Quanquan Gu
NeurIPS 2019 An Improved Analysis of Training Over-Parameterized Deep Neural Networks Difan Zou, Quanquan Gu
UAI 2019 An Improved Convergence Analysis of Stochastic Variance-Reduced Policy Gradient Pan Xu, Felicia Gao, Quanquan Gu
IJCAI 2019 Differentially Private Iterative Gradient Hard Thresholding for Sparse Learning Lingxiao Wang, Quanquan Gu
NeurIPS 2019 Generalization Bounds of Stochastic Gradient Descent for Wide and Deep Neural Networks Yuan Cao, Quanquan Gu
NeurIPS 2019 Layer-Dependent Importance Sampling for Training Deep and Large Graph Convolutional Networks Difan Zou, Ziniu Hu, Yewen Wang, Song Jiang, Yizhou Sun, Quanquan Gu
AISTATS 2019 Learning One-Hidden-Layer ReLU Networks via Gradient Descent Xiao Zhang, Yaodong Yu, Lingxiao Wang, Quanquan Gu
ICML 2019 Lower Bounds for Smooth Nonconvex Finite-Sum Optimization Dongruo Zhou, Quanquan Gu
ICML 2019 On the Convergence and Robustness of Adversarial Training Yisen Wang, Xingjun Ma, James Bailey, Jinfeng Yi, Bowen Zhou, Quanquan Gu
AISTATS 2019 Sampling from Non-Log-Concave Distributions via Variance-Reduced Gradient Langevin Dynamics Difan Zou, Pan Xu, Quanquan Gu
NeurIPS 2019 Stochastic Gradient Hamiltonian Monte Carlo Methods with Recursive Variance Reduction Difan Zou, Pan Xu, Quanquan Gu
JMLR 2019 Stochastic Variance-Reduced Cubic Regularization Methods Dongruo Zhou, Pan Xu, Quanquan Gu
NeurIPS 2019 Tight Sample Complexity of Learning One-Hidden-Layer Convolutional Neural Networks Yuan Cao, Quanquan Gu
ICML 2018 A Primal-Dual Analysis of Global Optimality in Nonconvex Low-Rank Matrix Recovery Xiao Zhang, Lingxiao Wang, Yaodong Yu, Quanquan Gu
AISTATS 2018 A Unified Framework for Nonconvex Low-Rank Plus Sparse Matrix Recovery Xiao Zhang, Lingxiao Wang, Quanquan Gu
AISTATS 2018 Accelerated Stochastic Mirror Descent: From Continuous-Time Dynamics to Discrete-Time Algorithms Pan Xu, Tianhao Wang, Quanquan Gu
ICML 2018 Continuous and Discrete-Time Accelerated Stochastic Mirror Descent for Strongly Convex Functions Pan Xu, Tianhao Wang, Quanquan Gu
ICML 2018 Covariate Adjusted Precision Matrix Estimation via Nonconvex Optimization Jinghui Chen, Pan Xu, Lingxiao Wang, Jian Ma, Quanquan Gu
ECML-PKDD 2018 Differentially Private Hypothesis Transfer Learning Yang Wang, Quanquan Gu, Donald E. Brown
NeurIPS 2018 Distributed Learning Without Distress: Privacy-Preserving Empirical Risk Minimization Bargav Jayaraman, Lingxiao Wang, David Evans, Quanquan Gu
ICML 2018 Fast and Sample Efficient Inductive Matrix Completion via Multi-Phase Procrustes Flow Xiao Zhang, Simon Du, Quanquan Gu
NeurIPS 2018 Global Convergence of Langevin Dynamics Based Algorithms for Nonconvex Optimization Pan Xu, Jinghui Chen, Difan Zou, Quanquan Gu
NeurIPS 2018 Stochastic Nested Variance Reduction for Nonconvex Optimization Dongruo Zhou, Pan Xu, Quanquan Gu
ICML 2018 Stochastic Variance-Reduced Cubic Regularized Newton Methods Dongruo Zhou, Pan Xu, Quanquan Gu
ICML 2018 Stochastic Variance-Reduced Hamilton Monte Carlo Methods Difan Zou, Pan Xu, Quanquan Gu
UAI 2018 Subsampled Stochastic Variance-Reduced Gradient Langevin Dynamics Difan Zou, Pan Xu, Quanquan Gu
NeurIPS 2018 Third-Order Smoothness Helps: Faster Stochastic Optimization Algorithms for Finding Local Minima Yaodong Yu, Pan Xu, Quanquan Gu
AISTATS 2017 A Unified Computational and Statistical Framework for Nonconvex Low-Rank Matrix Estimation Lingxiao Wang, Xiao Zhang, Quanquan Gu
ICML 2017 A Unified Variance Reduction-Based Framework for Nonconvex Low-Rank Matrix Recovery Lingxiao Wang, Xiao Zhang, Quanquan Gu
AISTATS 2017 Communication-Efficient Distributed Sparse Linear Discriminant Analysis Lu Tian, Quanquan Gu
AISTATS 2017 Efficient Algorithm for Sparse Tensor-Variate Gaussian Graphical Models via Gradient Descent Pan Xu, Tingting Zhang, Quanquan Gu
AISTATS 2017 High-Dimensional Time Series Clustering via Cross-Predictability Dezhi Hong, Quanquan Gu, Kamin Whitehouse
ICML 2017 High-Dimensional Variance-Reduced Stochastic Gradient Expectation-Maximization Algorithm Rongda Zhu, Lingxiao Wang, Chengxiang Zhai, Quanquan Gu
ICML 2017 Robust Gaussian Graphical Model Estimation with Arbitrary Corruption Lingxiao Wang, Quanquan Gu
NeurIPS 2017 Speeding up Latent Variable Gaussian Graphical Model Estimation via Nonconvex Optimization Pan Xu, Jian Ma, Quanquan Gu
ICML 2017 Uncertainty Assessment and False Discovery Rate Control in High-Dimensional Granger Causal Inference Aditya Chaudhry, Pan Xu, Quanquan Gu
UAI 2016 Accelerated Stochastic Block Coordinate Gradient Descent for Sparsity Constrained Nonconvex Optimization Jinghui Chen, Quanquan Gu
UAI 2016 Forward Backward Greedy Algorithms for Multi-Task Learning with Faster Rates Lu Tian, Pan Xu, Quanquan Gu
AISTATS 2016 Low-Rank and Sparse Structure Pursuit via Alternating Minimization Quanquan Gu, Zhaoran Wang, Han Liu
ICML 2016 On the Statistical Limits of Convex Relaxations Zhaoran Wang, Quanquan Gu, Han Liu
AISTATS 2016 Optimal Statistical and Computational Rates for One Bit Matrix Completion Renkun Ni, Quanquan Gu
AISTATS 2016 Precision Matrix Estimation in High Dimensional Gaussian Graphical Models with Faster Rates Lingxiao Wang, Xiang Ren, Quanquan Gu
NeurIPS 2016 Semiparametric Differential Graph Models Pan Xu, Quanquan Gu
ICML 2016 Towards Faster Rates and Oracle Property for Low-Rank Matrix Estimation Huan Gui, Jiawei Han, Quanquan Gu
NeurIPS 2015 High Dimensional EM Algorithm: Statistical Optimization and Asymptotic Normality Zhaoran Wang, Quanquan Gu, Yang Ning, Han Liu
ECML-PKDD 2015 Robust Classification of Information Networks by Consistent Graph Learning Shi Zhi, Jiawei Han, Quanquan Gu
ICML 2015 Towards a Lower Sample Complexity for Robust One-Bit Compressed Sensing Rongda Zhu, Quanquan Gu
UAI 2014 Batch-Mode Active Learning via Error Bound Minimization Quanquan Gu, Tong Zhang, Jiawei Han
NeurIPS 2014 Robust Tensor Decomposition with Gross Corruption Quanquan Gu, Huan Gui, Jiawei Han
NeurIPS 2014 Sparse PCA with Oracle Property Quanquan Gu, Zhaoran Wang, Han Liu
AISTATS 2013 Clustered Support Vector Machines Quanquan Gu, Jiawei Han
AISTATS 2013 Unsupervised Link Selection in Networks Quanquan Gu, Charu C. Aggarwal, Jiawei Han
AISTATS 2012 Locality Preserving Feature Learning Quanquan Gu, Marina Danilevsky, Zhenhui Li, Jiawei Han
NeurIPS 2012 Selective Labeling via Error Bound Minimization Quanquan Gu, Tong Zhang, Jiawei Han, Chris H. Ding
UAI 2011 Generalized Fisher Score for Feature Selection Quanquan Gu, Zhenhui Li, Jiawei Han
IJCAI 2011 Joint Feature Selection and Subspace Learning Quanquan Gu, Zhenhui Li, Jiawei Han
AAAI 2011 Learning a Kernel for Multi-Task Clustering Quanquan Gu, Zhenhui Li, Jiawei Han
ECML-PKDD 2011 Linear Discriminant Dimensionality Reduction Quanquan Gu, Zhenhui Li, Jiawei Han
IJCAI 2011 On Trivial Solution and Scale Transfer Problems in Graph Regularized NMF Quanquan Gu, Chris H. Q. Ding, Jiawei Han
IJCAI 2009 Local Learning Regularized Nonnegative Matrix Factorization Quanquan Gu, Jie Zhou
ECML-PKDD 2009 Transductive Classification via Dual Regularization Quanquan Gu, Jie Zhou