Zhou, Zhengyuan

44 publications

ICML 2025 Concurrent Reinforcement Learning with Aggregated States via Randomized Least Squares Value Iteration Yan Chen, Qinxun Bai, Yiteng Zhang, Maria Dimakopoulou, Shi Dong, Qi Sun, Zhengyuan Zhou
JAIR 2025 DSAC: Distributional Soft Actor-Critic for Risk-Sensitive Reinforcement Learning Xiaoteng Ma, Junyao Chen, Li Xia, Jun Yang, Qianchuan Zhao, Zhengyuan Zhou
ICML 2025 Distributionally Robust Policy Learning Under Concept Drifts Jingyuan Wang, Zhimei Ren, Ruohan Zhan, Zhengyuan Zhou
NeurIPS 2025 Improved Confidence Regions and Optimal Algorithms for Online and Offline Linear MNL Bandits Yuxuan Han, Jose Blanchet, Zhengyuan Zhou
ICML 2025 Improved Last-Iterate Convergence of Shuffling Gradient Methods for Nonsmooth Convex Optimization Zijian Liu, Zhengyuan Zhou
ICLR 2025 Nonconvex Stochastic Optimization Under Heavy-Tailed Noises: Optimal Convergence Without Gradient Clipping Zijian Liu, Zhengyuan Zhou
NeurIPS 2025 Precise Asymptotics and Refined Regret of Variance-Aware UCB Yingying Fan, Yuxuan Han, Jinchi Lv, Xiaocong Xu, Zhengyuan Zhou
AISTATS 2025 Statistical Learning of Distributionally Robust Stochastic Control in Continuous State Spaces Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou
ICML 2024 Adaptively Learning to Select-Rank in Online Platforms Jingyuan Wang, Perry Dong, Ying Jin, Ruohan Zhan, Zhengyuan Zhou
AISTATS 2024 Feasible $q$-Learning for Average Reward Reinforcement Learning Ying Jin, Ramki Gummadi, Zhengyuan Zhou, Jose Blanchet
ICML 2024 On the Convergence of Projected Bures-Wasserstein Gradient Descent Under Euclidean Strong Convexity Junyi Fan, Yuxuan Han, Zijian Liu, Jian-Feng Cai, Yang Wang, Zhengyuan Zhou
ICML 2024 On the Last-Iterate Convergence of Shuffling Gradient Methods Zijian Liu, Zhengyuan Zhou
ICLR 2024 Revisiting the Last-Iterate Convergence of Stochastic Gradient Methods Zijian Liu, Zhengyuan Zhou
JMLR 2024 Sample Complexity of Variance-Reduced Distributionally Robust Q-Learning Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou
ICML 2024 Single-Trajectory Distributionally Robust Reinforcement Learning Zhipeng Liang, Xiaoteng Ma, Jose Blanchet, Jun Yang, Jiheng Zhang, Zhengyuan Zhou
NeurIPS 2024 Stochastic Contextual Bandits with Graph Feedback: From Independence Number to MAS Number Yuxiao Wen, Yanjun Han, Zhengyuan Zhou
AISTATS 2023 A Finite Sample Complexity Bound for Distributionally Robust Q-Learning Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou
JAIR 2023 A Unified Linear Speedup Analysis of Federated Averaging and Nesterov FedAvg Zhaonan Qu, Kaixiang Lin, Zhaojian Li, Jiayu Zhou, Zhengyuan Zhou
COLT 2023 Breaking the Lower Bound with (Little) Structure: Acceleration in Non-Convex Stochastic Optimization with Heavy-Tailed Noise Zijian Liu, Jiawei Zhang, Zhengyuan Zhou
JAIR 2022 Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning Yuexiang Zhai, Christina Baek, Zhengyuan Zhou, Jiantao Jiao, Yi Ma
ICML 2022 Distributionally Robust $q$-Learning Zijian Liu, Qinxun Bai, Jose Blanchet, Perry Dong, Wei Xu, Zhengqing Zhou, Zhengyuan Zhou
ICML 2022 Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning Nathan Kallus, Xiaojie Mao, Kaiwen Wang, Zhengyuan Zhou
NeurIPS 2022 Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions Wei Zhang, Yanjun Han, Zhengyuan Zhou, Aaron Flores, Tsachy Weissman
JMLR 2022 No Weighted-Regret Learning in Adversarial Bandits with Delays Ilai Bistritz, Zhengyuan Zhou, Xi Chen, Nicholas Bambos, Jose Blanchet
JMLR 2022 Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States Shi Dong, Benjamin Van Roy, Zhengyuan Zhou
NeurIPS 2022 Society of Agents: Regret Bounds of Concurrent Thompson Sampling Yan Chen, Perry Dong, Qinxun Bai, Maria Dimakopoulou, Wei Xu, Zhengyuan Zhou
AISTATS 2021 Finite-Sample Regret Bound for Distributionally Robust Offline Tabular Reinforcement Learning Zhengqing Zhou, Zhengyuan Zhou, Qinxun Bai, Linhai Qiu, Jose Blanchet, Peter Glynn
NeurIPS 2021 Online Multi-Armed Bandits with Adaptive Inference Maria Dimakopoulou, Zhimei Ren, Zhengyuan Zhou
L4DC 2021 Provably Sample Efficient Reinforcement Learning in Competitive Linear Quadratic Systems Jingwei Zhang, Zhuoran Yang, Zhengyuan Zhou, Zhaoran Wang
AAAI 2020 Delay-Adaptive Distributed Stochastic Optimization Zhaolin Ren, Zhengyuan Zhou, Linhai Qiu, Ajay Deshpande, Jayant Kalagnanam
ICML 2020 Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits Nian Si, Fan Zhang, Zhengyuan Zhou, Jose Blanchet
ICML 2020 Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games Tianyi Lin, Zhengyuan Zhou, Panayotis Mertikopoulos, Michael Jordan
NeurIPS 2020 Optimistic Dual Extrapolation for Coherent Non-Monotone Variational Inequalities Chaobing Song, Zhengyuan Zhou, Yichao Zhou, Yong Jiang, Yi Ma
ICLR 2020 Understanding L4-Based Dictionary Learning: Interpretation, Stability, and Robustness Yuexiang Zhai, Hermish Mehta, Zhengyuan Zhou, Yi Ma
AAAI 2019 Balanced Linear Contextual Bandits Maria Dimakopoulou, Zhengyuan Zhou, Susan Athey, Guido Imbens
NeurIPS 2019 Learning in Generalized Linear Contextual Bandits with Stochastic Delays Zhengyuan Zhou, Renyuan Xu, Jose Blanchet
NeurIPS 2019 Online EXP3 Learning in Adversarial Bandits with Delayed Feedback Ilai Bistritz, Zhengyuan Zhou, Xi Chen, Nicholas Bambos, Jose Blanchet
ICML 2018 Distributed Asynchronous Optimization with Unbounded Delays: How Slow Can You Go? Zhengyuan Zhou, Panayotis Mertikopoulos, Nicholas Bambos, Peter Glynn, Yinyu Ye, Li-Jia Li, Li Fei-Fei
NeurIPS 2018 Learning in Games with Lossy Feedback Zhengyuan Zhou, Panayotis Mertikopoulos, Susan Athey, Nicholas Bambos, Peter W. Glynn, Yinyu Ye
ICML 2018 MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels Lu Jiang, Zhengyuan Zhou, Thomas Leung, Li-Jia Li, Li Fei-Fei
NeurIPS 2017 Countering Feedback Delays in Multi-Agent Learning Zhengyuan Zhou, Panayotis Mertikopoulos, Nicholas Bambos, Peter W. Glynn, Claire Tomlin
NeurIPS 2017 Stochastic Mirror Descent in Variationally Coherent Optimization Problems Zhengyuan Zhou, Panayotis Mertikopoulos, Nicholas Bambos, Stephen Boyd, Peter W. Glynn
AAAI 2014 Hybrid Singular Value Thresholding for Tensor Completion Xiaoqin Zhang, Zhengyuan Zhou, Di Wang, Yi Ma
NeurIPS 2013 Simultaneous Rectification and Alignment via Robust Recovery of Low-Rank Tensors Xiaoqin Zhang, Di Wang, Zhengyuan Zhou, Yi Ma