Chen, Yudong
58 publications
ICLRW
2025
Navigating Solution Spaces in Large Language Models Through Controlled Embedding Exploration
NeurIPS
2025
Optimal Single-Policy Sample Complexity and Transient Coverage for Average-Reward Offline RL
NeurIPS
2025
The $\varphi$ Curve: The Shape of Generalization Through the Lens of Norm-Based Capacity Control
ALT
2025
The Plug-in Approach for Average-Reward and Discounted MDPs: Optimal Sample Complexity Analysis
NeurIPS
2024
Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPs
NeurIPS
2024
The Collusion of Memory and Nonlinearity in Stochastic Approximation with Constant Stepsize
NeurIPSW
2022
Matrix Estimation for Offline Evaluation in Reinforcement Learning with Low-Rank Structure
NeurIPS
2021
Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning
JMLR
2015
Iterative and Active Graph Clustering Using Trace Norm Minimization Without Cluster Size Constraints