Zou, Shaofeng

32 publications

TMLR 2025 Adaptive Gradient Normalization and Independent Sampling for (Stochastic) Generalized-Smooth Optimization Yufeng Yang, Erin E. Tripp, Yifan Sun, Shaofeng Zou, Yi Zhou

TMLR 2025 Convergence Guarantees for RMSProp and Adam in Generalized-Smooth Non-Convex Optimization with Affine Noise Variance Qi Zhang, Yi Zhou, Shaofeng Zou

ICLR 2025 MGDA Converges Under Generalized Smoothness, Provably Qi Zhang, Peiyao Xiao, Shaofeng Zou, Kaiyi Ji

AISTATS 2025 Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative Model Zilong Deng, Simon Khan, Shaofeng Zou

ICLR 2025 Revisiting Large-Scale Non-Convex Distributionally Robust Optimization Qi Zhang, Yi Zhou, Simon Khan, Ashley Prater-Bennette, Lixin Shen, Shaofeng Zou

NeurIPS 2024 A Unified Principle of Pessimism for Offline Reinforcement Learning Under Model Mismatch Yue Wang, Zhongchang Sun, Shaofeng Zou

TMLR 2024 Achieving the Asymptotically Minimax Optimal Sample Complexity of Offline Reinforcement Learning: A DRO-Based Approach Yue Wang, Jinjun Xiong, Shaofeng Zou

ICML 2024 Constrained Reinforcement Learning Under Model Mismatch Zhongchang Sun, Sihong He, Fei Miao, Shaofeng Zou

MLJ 2024 Finite-Time Error Bounds for Greedy-GQ Yue Wang, Yi Zhou, Shaofeng Zou

AAAI 2024 Large-Scale Non-Convex Stochastic Constrained Distributionally Robust Optimization Qi Zhang, Yi Zhou, Ashley Prater-Bennette, Lixin Shen, Shaofeng Zou

UAI 2024 Model-Free Robust Reinforcement Learning with Sample Complexity Analysis Yudan Wang, Shaofeng Zou, Yue Wang

ICML 2024 Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation Yudan Wang, Yue Wang, Yi Zhou, Shaofeng Zou

NeurIPS 2024 Policy Optimization for Robust Average Reward MDPs Zhongchang Sun, Sihong He, Fei Miao, Shaofeng Zou

JAIR 2024 Robust Average-Reward Reinforcement Learning Yue Wang, Alvaro Velasquez, George K. Atia, Ashley Prater-Bennette, Shaofeng Zou

AISTATS 2024 Sample Complexity Characterization for Linear Contextual MDPs Junze Deng, Yuan Cheng, Shaofeng Zou, Yingbin Liang

TMLR 2024 What Is the Solution for State-Adversarial Multi-Agent Reinforcement Learning? Songyang Han, Sanbao Su, Sihong He, Shuo Han, Haizhao Yang, Shaofeng Zou, Fei Miao

JMLR 2023 Decentralized Robust V-Learning for Solving Markov Games with Model Uncertainty Shaocong Ma, Ziyi Chen, Shaofeng Zou, Yi Zhou

NeurIPSW 2023 Large-Scale Non-Convex Stochastic Constrained Distributionally Robust Optimization Qi Zhang, Yi Zhou, Ashley Prater-Bennette, Lixin Shen, Shaofeng Zou

ICML 2023 Model-Free Robust Average-Reward Reinforcement Learning Yue Wang, Alvaro Velasquez, George K. Atia, Ashley Prater-Bennette, Shaofeng Zou

AAAI 2023 Robust Average-Reward Markov Decision Processes Yue Wang, Alvaro Velasquez, George K. Atia, Ashley Prater-Bennette, Shaofeng Zou

TMLR 2023 Robust Multi-Agent Reinforcement Learning with State Uncertainty Sihong He, Songyang Han, Sanbao Su, Shuo Han, Shaofeng Zou, Fei Miao

ICML 2022 Policy Gradient Method for Robust Reinforcement Learning Yue Wang, Shaofeng Zou

ICML 2022 Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis Ziyi Chen, Yi Zhou, Rong-Rong Chen, Shaofeng Zou

ICLR 2021 Greedy-GQ with Variance Reduction: Finite-Time Analysis and Improved Complexity Shaocong Ma, Ziyi Chen, Yi Zhou, Shaofeng Zou

AAAI 2021 Learning Graph Neural Networks with Approximate Gradient Descent Qunwei Li, Shaofeng Zou, Wenliang Zhong

NeurIPS 2021 Non-Asymptotic Analysis for Two Time-Scale TDC with General Smooth Function Approximation Yue Wang, Shaofeng Zou, Yi Zhou

NeurIPS 2021 Online Robust Reinforcement Learning with Model Uncertainty Yue Wang, Shaofeng Zou

UAI 2020 Finite-Sample Analysis of Greedy-GQ with Linear Function Approximation Under Markovian Noise Yue Wang, Shaofeng Zou

AAAI 2020 Information-Theoretic Understanding of Population Risk Improvement with Model Compression Yuheng Bu, Weihao Gao, Shaofeng Zou, Venugopal V. Veeravalli

NeurIPS 2020 Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence Analysis Shaocong Ma, Yi Zhou, Shaofeng Zou

NeurIPS 2019 Finite-Sample Analysis for SARSA with Linear Function Approximation Shaofeng Zou, Tengyu Xu, Yingbin Liang

NeurIPS 2019 Two Time-Scale Off-Policy TD Learning: Non-Asymptotic Analysis over Markovian Samples Tengyu Xu, Shaofeng Zou, Yingbin Liang