Li, Yuanzhi
86 publications
ICLR
2025
Physics of Language Models: Part 2.2, How to Learn from Mistakes on Grade-School Math Problems
NeurIPSW
2024
Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Data
ICLR
2023
Sampling Is as Easy as Learning the Score: Theory for Diffusion Models with Minimal Data Assumptions
ICLR
2023
Towards Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning
ICLR
2023
Understanding the Generalization of Adam in Learning Neural Networks with Proper Regularization
NeurIPSW
2022
Sampling Is as Easy as Learning the Score: Theory for Diffusion Models with Minimal Data Assumptions
ICML
2021
Sample Efficient Reinforcement Learning in Continuous State Spaces: A Perspective Beyond Linearity
ICLR
2019
Algorithmic Framework for Model-Based Deep Reinforcement Learning with Theoretical Guarantees
NeurIPS
2019
Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers
NeurIPS
2019
Towards Explaining the Regularization Effect of Initial Large Learning Rate in Training Neural Networks
NeurIPS
2018
Learning Overparameterized Neural Networks via Stochastic Gradient Descent on Structured Data
ICML
2017
Provable Alternating Gradient Descent for Non-Negative Matrix Factorization with Strong Correlations