Mannelli, Stefano Sarao

17 publications

ICLR 2025 A Theory of Initialisation's Impact on Specialisation Devon Jarvis, Sebastian Lee, Clémentine Carla Juliette Dominé, Andrew M Saxe, Stefano Sarao Mannelli
ICLR 2025 Optimal Protocols for Continual Learning via Statistical Physics and Control Theory Francesco Mori, Stefano Sarao Mannelli, Francesca Mignacco
NeurIPSW 2024 A Theory of Initialisation's Impact on Specialisation Devon Jarvis, Sebastian Lee, Clémentine Carla Juliette Dominé, Andrew M Saxe, Stefano Sarao Mannelli
NeurIPS 2024 Bias in Motion: Theoretical Insights into the Dynamics of Bias in SGD Training Anchit Jain, Rozhin Nobahari, Aristide Baratin, Stefano Sarao Mannelli
NeurIPSW 2024 Bias in Motion: Theoretical Insights into the Dynamics of Bias in SGD Training Anchit Jain, Rozhin Nobahari, Aristide Baratin, Stefano Sarao Mannelli
ICMLW 2024 Bias-Inducing Geometries: Exactly Solvable Data Model with Fairness Implications Stefano Sarao Mannelli, Federica Gerace, Negar Rostamzadeh, Luca Saglietti
TMLR 2024 How to Choose the Right Transfer Learning Protocol? a Qualitative Analysis in a Controlled Set-up Federica Gerace, Diego Doimo, Stefano Sarao Mannelli, Luca Saglietti, Alessandro Laio
NeurIPSW 2024 Optimal Protocols for Continual Learning via Statistical Physics and Control Theory Francesco Mori, Stefano Sarao Mannelli, Francesca Mignacco
ICML 2024 Tilting the Odds at the Lottery: The Interplay of Overparameterisation and Curricula in Neural Networks Stefano Sarao Mannelli, Yaraslau Ivashynka, Andrew M Saxe, Luca Saglietti
ICML 2024 Why Do Animals Need Shaping? a Theory of Task Composition and Curriculum Learning Jin Hwa Lee, Stefano Sarao Mannelli, Andrew M Saxe
ICLRW 2023 The Rl Perceptron: Dynamics of Policy Learning in High Dimensions Nishil Patel, Sebastian Lee, Stefano Sarao Mannelli, Sebastian Goldt, Andrew M Saxe
ICML 2022 Maslow’s Hammer in Catastrophic Forgetting: Node Re-Use vs. Node Activation Sebastian Lee, Stefano Sarao Mannelli, Claudia Clopath, Sebastian Goldt, Andrew Saxe
NeurIPS 2021 Analytical Study of Momentum-Based Acceleration Methods in Paradigmatic High-Dimensional Non-Convex Problems Stefano Sarao Mannelli, Pierfrancesco Urbani
NeurIPS 2020 Complex Dynamics in Simple Neural Networks: Understanding Gradient Flow in Phase Retrieval Stefano Sarao Mannelli, Giulio Biroli, Chiara Cammarota, Florent Krzakala, Pierfrancesco Urbani, Lenka Zdeborová
NeurIPS 2020 Optimization and Generalization of Shallow Neural Networks with Quadratic Activation Functions Stefano Sarao Mannelli, Eric Vanden-Eijnden, Lenka Zdeborová
ICML 2019 Passed & Spurious: Descent Algorithms and Local Minima in Spiked Matrix-Tensor Models Stefano Sarao Mannelli, Florent Krzakala, Pierfrancesco Urbani, Lenka Zdeborova
NeurIPS 2019 Who Is Afraid of Big Bad Minima? Analysis of Gradient-Flow in Spiked Matrix-Tensor Models Stefano Sarao Mannelli, Giulio Biroli, Chiara Cammarota, Florent Krzakala, Lenka Zdeborová