Yamamura, Atsushi

2 publications

NeurIPS 2023 Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks Feng Chen, Daniel Kunin, Atsushi Yamamura, Surya Ganguli
ICLR 2023 The Asymmetric Maximum Margin Bias of Quasi-Homogeneous Neural Networks Daniel Kunin, Atsushi Yamamura, Chao Ma, Surya Ganguli