Pesme, Scott
9 publications
NeurIPS
2025
A Theoretical Framework for Grokking: Interpolation Followed by Riemannian Norm Minimisation
AISTATS
2024
Leveraging Continuous Time to Understand Momentum When Training Diagonal Linear Networks
9 publications