Zhao, Canzhe

8 publications

ICML 2025 Learning Imperfect Information Extensive-Form Games with Last-Iterate Convergence Under Bandit Feedback Canzhe Zhao, Yutian Cheng, Jing Dong, Baoxiang Wang, Shuai Li
AAAI 2025 Logarithmic Regret for Linear Markov Decision Processes with Adversarial Corruptions Canzhe Zhao, Xiangcheng Zhang, Baoxiang Wang, Shuai Li
UAI 2025 Towards Provably Efficient Learning of Imperfect Information Extensive-Form Games with Linear Function Approximation Canzhe Zhao, Shuze Chen, Weiming Liu, Haobo Fu, Qiang Fu, Shuai Li
COLT 2023 Best-of-Three-Worlds Analysis for Linear Bandits with Follow-the-Regularized-Leader Algorithm Fang Kong, Canzhe Zhao, Shuai Li
IJCAI 2023 DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning Canzhe Zhao, Yanjie Ze, Jing Dong, Baoxiang Wang, Shuai Li
ICLR 2023 Learning Adversarial Linear Mixture Markov Decision Processes with Bandit Feedback and Unknown Transition Canzhe Zhao, Ruofeng Yang, Baoxiang Wang, Shuai Li
NeurIPS 2023 Learning Adversarial Low-Rank Markov Decision Processes with Unknown Transition and Full-Information Feedback Canzhe Zhao, Ruofeng Yang, Baoxiang Wang, Xuezhou Zhang, Shuai Li
AAAI 2022 Simultaneously Learning Stochastic and Adversarial Bandits Under the Position-Based Model Cheng Chen, Canzhe Zhao, Shuai Li