Zou, Shaofeng
32 publications
AISTATS
2025
Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative Model
NeurIPS
2024
A Unified Principle of Pessimism for Offline Reinforcement Learning Under Model Mismatch
ICML
2022
Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis
NeurIPS
2021
Non-Asymptotic Analysis for Two Time-Scale TDC with General Smooth Function Approximation