Mousavi-Hosseini, Alireza
12 publications
NeurIPS
2025
From Information to Generative Exponent: Learning Rate Induces Phase Transitions in SGD
NeurIPS
2025
When Do Transformers Outperform Feedforward and Recurrent Networks? a Statistical Perspective