Ganguli, Surya
61 publications
NeurIPS
2025
Rethinking Fine-Tuning When Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning
NeurIPS
2024
Get Rich Quick: Exact Solutions Reveal How Unbalanced Initializations Promote Rapid Feature Learning
ICMLW
2024
Get Rich Quick: Exact Solutions Reveal How Unbalanced Initializations Promote Rapid Feature Learning
NeurIPS
2023
Pretraining Task Diversity and the Emergence of Non-Bayesian In-Context Learning for Regression
NeurIPS
2023
Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks
ICLR
2022
How Many Degrees of Freedom Do We Need to Train Deep Networks: A Loss Landscape Perspective
NeurIPSW
2022
Unmasking the Lottery Ticket Hypothesis: Efficient Adaptive Pruning for Finding Winning Tickets
ICLR
2019
An Analytic Theory of Generalization Dynamics and Transfer Learning in Deep Linear Networks
NeurIPS
2019
From Deep Learning to Mechanistic Understanding in Neuroscience: The Structure of Retinal Prediction
NeurIPS
2019
Reverse Engineering Recurrent Networks for Sentiment Classification Reveals Line Attractor Dynamics
NeurIPS
2019
Universality and Individuality in Neural Dynamics Across Large Populations of Recurrent Networks
NeurIPS
2018
The Emergence of Multiple Retinal Cell Types Through Efficient Coding of Natural Movies
NeurIPS
2017
Resurrecting the Sigmoid in Deep Learning Through Dynamical Isometry: Theory and Practice