Milligan, Alan

4 publications

NeurIPS 2025 Understanding Adam Requires Better Rotation Dependent Assumptions Tianyue H. Zhang, Lucas Maes, Alan Milligan, Alexia Jolicoeur-Martineau, Ioannis Mitliagkas, Damien Scieur, Simon Lacoste-Julien, Charles Guille-Escuret
NeurIPS 2024 Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models Frederik Kunstner, Alan Milligan, Robin Yadav, Mark Schmidt, Alberto Bietti
NeurIPSW 2024 Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models Frederik Kunstner, Alan Milligan, Robin Yadav, Mark Schmidt, Alberto Bietti
NeurIPSW 2024 Normalization Matters for Optimization Performance on Graph Neural Networks Alan Milligan, Frederik Kunstner, Hamed Shirzad, Mark Schmidt, Danica J. Sutherland