From High-Dimensional & Mean-Field Dynamics to Dimensionless ODEs: A Unifying Approach to SGD in Two-Layers Networks

Abstract

This manuscript investigates the one-pass stochastic gradient descent (SGD) dynamics of a two-layer neural network trained on Gaussian data and labels generated by a similar, though not necessarily identical, target function. We rigorously analyse the limiting dynamics via a deterministic and low-dimensional description in terms of the sufficient statistics for the population risk. Our unifying analysis bridges different regimes of interest, such as the classical gradient-flow regime of vanishing learning rate, the high-dimensional regime of large input dimension, and the overparameterised “mean-field” regime of large network width, covering as well the intermediate regimes where the limiting dynamics is determined by the interplay between these behaviours. In particular, in the high-dimensional limit, the infinite-width dynamics is found to remain close to a low-dimensional subspace spanned by the target principal directions. Our results therefore provide a unifying picture of the limiting SGD dynamics with synthetic data.

Cite

Text

Arnaboldi et al. "From High-Dimensional & Mean-Field Dynamics to Dimensionless ODEs: A Unifying Approach to SGD in Two-Layers Networks." Conference on Learning Theory, 2023.

Markdown

[Arnaboldi et al. "From High-Dimensional & Mean-Field Dynamics to Dimensionless ODEs: A Unifying Approach to SGD in Two-Layers Networks." Conference on Learning Theory, 2023.](https://mlanthology.org/colt/2023/arnaboldi2023colt-highdimensional/)

BibTeX

@inproceedings{arnaboldi2023colt-highdimensional,
  title     = {{From High-Dimensional & Mean-Field Dynamics to Dimensionless ODEs: A Unifying Approach to SGD in Two-Layers Networks}},
  author    = {Arnaboldi, Luca and Stephan, Ludovic and Krzakala, Florent and Loureiro, Bruno},
  booktitle = {Conference on Learning Theory},
  year      = {2023},
  pages     = {1199-1227},
  volume    = {195},
  url       = {https://mlanthology.org/colt/2023/arnaboldi2023colt-highdimensional/}
}