Zou, Shaofeng

32 publications

TMLR 2025 Adaptive Gradient Normalization and Independent Sampling for (Stochastic) Generalized-Smooth Optimization Yufeng Yang, Erin E. Tripp, Yifan Sun, Shaofeng Zou, Yi Zhou
TMLR 2025 Convergence Guarantees for RMSProp and Adam in Generalized-Smooth Non-Convex Optimization with Affine Noise Variance Qi Zhang, Yi Zhou, Shaofeng Zou
ICLR 2025 MGDA Converges Under Generalized Smoothness, Provably Qi Zhang, Peiyao Xiao, Shaofeng Zou, Kaiyi Ji
AISTATS 2025 Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative Model Zilong Deng, Simon Khan, Shaofeng Zou
ICLR 2025 Revisiting Large-Scale Non-Convex Distributionally Robust Optimization Qi Zhang, Yi Zhou, Simon Khan, Ashley Prater-Bennette, Lixin Shen, Shaofeng Zou
NeurIPS 2024 A Unified Principle of Pessimism for Offline Reinforcement Learning Under Model Mismatch Yue Wang, Zhongchang Sun, Shaofeng Zou
TMLR 2024 Achieving the Asymptotically Minimax Optimal Sample Complexity of Offline Reinforcement Learning: A DRO-Based Approach Yue Wang, Jinjun Xiong, Shaofeng Zou
ICML 2024 Constrained Reinforcement Learning Under Model Mismatch Zhongchang Sun, Sihong He, Fei Miao, Shaofeng Zou
MLJ 2024 Finite-Time Error Bounds for Greedy-GQ Yue Wang, Yi Zhou, Shaofeng Zou
AAAI 2024 Large-Scale Non-Convex Stochastic Constrained Distributionally Robust Optimization Qi Zhang, Yi Zhou, Ashley Prater-Bennette, Lixin Shen, Shaofeng Zou
UAI 2024 Model-Free Robust Reinforcement Learning with Sample Complexity Analysis Yudan Wang, Shaofeng Zou, Yue Wang
ICML 2024 Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation Yudan Wang, Yue Wang, Yi Zhou, Shaofeng Zou
NeurIPS 2024 Policy Optimization for Robust Average Reward MDPs Zhongchang Sun, Sihong He, Fei Miao, Shaofeng Zou
JAIR 2024 Robust Average-Reward Reinforcement Learning Yue Wang, Alvaro Velasquez, George K. Atia, Ashley Prater-Bennette, Shaofeng Zou
AISTATS 2024 Sample Complexity Characterization for Linear Contextual MDPs Junze Deng, Yuan Cheng, Shaofeng Zou, Yingbin Liang
TMLR 2024 What Is the Solution for State-Adversarial Multi-Agent Reinforcement Learning? Songyang Han, Sanbao Su, Sihong He, Shuo Han, Haizhao Yang, Shaofeng Zou, Fei Miao
JMLR 2023 Decentralized Robust V-Learning for Solving Markov Games with Model Uncertainty Shaocong Ma, Ziyi Chen, Shaofeng Zou, Yi Zhou
NeurIPSW 2023 Large-Scale Non-Convex Stochastic Constrained Distributionally Robust Optimization Qi Zhang, Yi Zhou, Ashley Prater-Bennette, Lixin Shen, Shaofeng Zou
ICML 2023 Model-Free Robust Average-Reward Reinforcement Learning Yue Wang, Alvaro Velasquez, George K. Atia, Ashley Prater-Bennette, Shaofeng Zou
AAAI 2023 Robust Average-Reward Markov Decision Processes Yue Wang, Alvaro Velasquez, George K. Atia, Ashley Prater-Bennette, Shaofeng Zou
TMLR 2023 Robust Multi-Agent Reinforcement Learning with State Uncertainty Sihong He, Songyang Han, Sanbao Su, Shuo Han, Shaofeng Zou, Fei Miao
ICML 2022 Policy Gradient Method for Robust Reinforcement Learning Yue Wang, Shaofeng Zou
ICML 2022 Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis Ziyi Chen, Yi Zhou, Rong-Rong Chen, Shaofeng Zou
ICLR 2021 Greedy-GQ with Variance Reduction: Finite-Time Analysis and Improved Complexity Shaocong Ma, Ziyi Chen, Yi Zhou, Shaofeng Zou
AAAI 2021 Learning Graph Neural Networks with Approximate Gradient Descent Qunwei Li, Shaofeng Zou, Wenliang Zhong
NeurIPS 2021 Non-Asymptotic Analysis for Two Time-Scale TDC with General Smooth Function Approximation Yue Wang, Shaofeng Zou, Yi Zhou
NeurIPS 2021 Online Robust Reinforcement Learning with Model Uncertainty Yue Wang, Shaofeng Zou
UAI 2020 Finite-Sample Analysis of Greedy-GQ with Linear Function Approximation Under Markovian Noise Yue Wang, Shaofeng Zou
AAAI 2020 Information-Theoretic Understanding of Population Risk Improvement with Model Compression Yuheng Bu, Weihao Gao, Shaofeng Zou, Venugopal V. Veeravalli
NeurIPS 2020 Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence Analysis Shaocong Ma, Yi Zhou, Shaofeng Zou
NeurIPS 2019 Finite-Sample Analysis for SARSA with Linear Function Approximation Shaofeng Zou, Tengyu Xu, Yingbin Liang
NeurIPS 2019 Two Time-Scale Off-Policy TD Learning: Non-Asymptotic Analysis over Markovian Samples Tengyu Xu, Shaofeng Zou, Yingbin Liang