Ahn, Kwangjun
25 publications
NeurIPS
2025
Through the River: Understanding the Benefit of Schedule-Free Methods for Language Model Training
NeurIPS
2023
Transformers Learn to Implement Preconditioned Gradient Descent for In-Context Learning
25 publications