Ni, Chengzhuo

7 publications

JMLR 2023 Learning Good State and Action Representations for Markov Decision Process via Tensor Decomposition Chengzhuo Ni, Yaqi Duan, Munther Dahleh, Mengdi Wang, Anru R. Zhang
ICLR 2023 Representation Learning for Low-Rank General-Sum Markov Games Chengzhuo Ni, Yuda Song, Xuezhou Zhang, Zihan Ding, Chi Jin, Mengdi Wang
NeurIPS 2023 Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Minshuo Chen, Mengdi Wang
NeurIPS 2022 Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization Hui Yuan, Chengzhuo Ni, Huazheng Wang, Xuezhou Zhang, Le Cong, Csaba Szepesvari, Mengdi Wang
ICML 2022 Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory Ruiqi Zhang, Xuezhou Zhang, Chengzhuo Ni, Mengdi Wang
ICML 2022 Optimal Estimation of Policy Gradient via Double Fitted Iteration Chengzhuo Ni, Ruiqi Zhang, Xiang Ji, Xuezhou Zhang, Mengdi Wang
NeurIPS 2021 On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method Junyu Zhang, Chengzhuo Ni, Zheng Yu, Csaba Szepesvari, Mengdi Wang