Zhong, Han
37 publications
AISTATS
2024
Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation
ICML
2024
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret
NeurIPS
2023
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes
NeurIPS
2023
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration
ICML
2022
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets
ICLRW
2022
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets