Zhao, Peng
70 publications
NeurIPS
2025
Gradient-Variation Online Adaptivity for Accelerated Optimization with Hölder Smoothness
JMLR
2024
Adaptivity and Non-Stationarity: Problem-Dependent Dynamic Regret for Online Convex Optimization
AAAI
2024
Dynamic Regret of Adversarial MDPs with Unknown Transition and Linear Function Approximation
AISTATS
2024
Improved Algorithm for Adversarial Linear Mixture MDPs with Bandit Feedback and Unknown Transition
JMLR
2024
Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization
NeurIPS
2024
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation
ICML
2023
Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization
NeurIPS
2023
Universal Online Learning with Gradient Variations: A Multi-Layer Online Ensemble Approach
ICCV
2023
Weakly-Supervised Action Localization by Hierarchically-Structured Latent Attention Modeling