Agarwala, Atish

17 publications

ICML 2025 Avoiding Spurious Sharpness Minimization Broadens Applicability of SAM Sidak Pal Singh, Hossein Mobahi, Atish Agarwala, Yann Dauphin
ICML 2025 Exact Risk Curves of signSGD in High-Dimensions: Quantifying Preconditioning and Noise-Compression Effects Ke Liang Xiao, Noah Marshall, Atish Agarwala, Elliot Paquette
TMLR 2025 How Far Away Are Truly Hyperparameter-Free Learning Algorithms? Priya Kasimbeg, Vincent Roulet, Naman Agarwal, Sourabh Medapati, Fabian Pedregosa, Atish Agarwala, George E. Dahl
ICML 2025 Scaling Collapse Reveals Universal Dynamics in Compute-Optimally Trained Neural Networks Shikai Qiu, Lechao Xiao, Andrew Gordon Wilson, Jeffrey Pennington, Atish Agarwala
ICLR 2025 To CLIP or Not to CLIP: The Dynamics of SGD with Gradient Clipping in High-Dimensions Noah Marshall, Ke Liang Xiao, Atish Agarwala, Elliot Paquette
TMLR 2024 Feature Learning as Alignment: A Structural Property of Gradient Descent in Non-Linear Neural Networks Daniel Beaglehole, Ioannis Mitliagkas, Atish Agarwala
ICMLW 2024 Gradient Descent Induces Alignment Between Weights and the Pre-Activation Tangents for Deep Non-Linear Networks Daniel Beaglehole, Ioannis Mitliagkas, Atish Agarwala
NeurIPS 2024 Neglected Hessian Component Explains Mysteries in Sharpness Regularization Yann N. Dauphin, Atish Agarwala, Hossein Mobahi
NeurIPSW 2024 Scaling Collapse Reveals Universal Dynamics in Compute-Optimally Trained Neural Networks Shikai Qiu, Atish Agarwala, Jeffrey Pennington, Lechao Xiao
NeurIPS 2024 Stepping on the Edge: Curvature Aware Learning Rate Tuners Vincent Roulet, Atish Agarwala, Jean-Bastien Grill, Grzegorz Swirszcz, Mathieu Blondel, Fabian Pedregosa
NeurIPSW 2023 On the Interplay Between Stepsize Tuning and Progressive Sharpening Vincent Roulet, Atish Agarwala, Fabian Pedregosa
ICML 2023 SAM Operates Far from Home: Eigenvalue Regularization as a Dynamical Phenomenon Atish Agarwala, Yann Dauphin
ICML 2023 Second-Order Regression Models Exhibit Progressive Sharpening to the Edge of Stability Atish Agarwala, Fabian Pedregosa, Jeffrey Pennington
TMLR 2023 Temperature Check: Theory and Practice for Training Models with SoftMax-Cross-Entropy Losses Atish Agarwala, Samuel Stern Schoenholz, Jeffrey Pennington, Yann Dauphin
NeurIPSW 2022 A Second-Order Regression Model Shows Edge of Stability Behavior Fabian Pedregosa, Atish Agarwala, Jeffrey Pennington
ICML 2022 Deep Equilibrium Networks Are Sensitive to Initialization Statistics Atish Agarwala, Samuel S Schoenholz
ICLR 2021 One Network Fits All? Modular Versus Monolithic Task Formulations in Neural Networks Atish Agarwala, Abhimanyu Das, Brendan Juba, Rina Panigrahy, Vatsal Sharan, Xin Wang, Qiuyi Zhang