ML Anthology
Authors
Search
About
Ye, Chenlu
11 publications
ICML
2025
Catoni Contextual Bandits Are Robust to Heavy-Tailed Rewards
Chenlu Ye
,
Yujia Jin
,
Alekh Agarwal
,
Tong Zhang
ICML
2025
Logarithmic Regret for Online KL-Regularized Reinforcement Learning
Heyang Zhao
,
Chenlu Ye
,
Wei Xiong
,
Quanquan Gu
,
Tong Zhang
JMLR
2025
Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning
Yong Lin
,
Chen Liu
,
Chenlu Ye
,
Qing Lian
,
Yuan Yao
,
Tong Zhang
NeurIPS
2025
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Heyang Zhao
,
Chenlu Ye
,
Quanquan Gu
,
Tong Zhang
ICML
2025
Understanding Overadaptation in Supervised Fine-Tuning: The Role of Ensemble Methods
Yifan Hao
,
Xingyuan Pan
,
Hanning Zhang
,
Chenlu Ye
,
Rui Pan
,
Tong Zhang
ICML
2024
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF Under KL-Constraint
Wei Xiong
,
Hanze Dong
,
Chenlu Ye
,
Ziqi Wang
,
Han Zhong
,
Heng Ji
,
Nan Jiang
,
Tong Zhang
NeurIPS
2024
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model
Chenlu Ye
,
Wei Xiong
,
Yuheng Zhang
,
Hanze Dong
,
Nan Jiang
,
Tong Zhang
NeurIPSW
2024
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Heyang Zhao
,
Chenlu Ye
,
Quanquan Gu
,
Tong Zhang
ICML
2024
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption
Chenlu Ye
,
Jiafan He
,
Quanquan Gu
,
Tong Zhang
ICML
2023
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes
Chenlu Ye
,
Wei Xiong
,
Quanquan Gu
,
Tong Zhang
NeurIPS
2023
Corruption-Robust Offline Reinforcement Learning with General Function Approximation
Chenlu Ye
,
Rui Yang
,
Quanquan Gu
,
Tong Zhang