Small ReLU Networks Are Powerful Memorizers: A Tight Analysis of Memorization Capacity

Abstract

We study finite sample expressivity, i.e., memorization power of ReLU networks. Recent results require $N$ hidden nodes to memorize/interpolate arbitrary $N$ data points. In contrast, by exploiting depth, we show that 3-layer ReLU networks with $\Omega(\sqrt{N})$ hidden nodes can perfectly memorize most datasets with $N$ points. We also prove that width $\Theta(\sqrt{N})$ is necessary and sufficient for memorizing $N$ data points, proving tight bounds on memorization capacity. The sufficiency result can be extended to deeper networks; we show that an $L$-layer network with $W$ parameters in the hidden layers can memorize $N$ data points if $W = \Omega(N)$. Combined with a recent upper bound $O(WL\log W)$ on VC dimension, our construction is nearly tight for any fixed $L$. Subsequently, we analyze memorization capacity of residual networks under a general position assumption; we prove results that substantially reduce the known requirement of $N$ hidden nodes. Finally, we study the dynamics of stochastic gradient descent (SGD), and show that when initialized near a memorizing global minimum of the empirical risk, SGD quickly finds a nearby point with much smaller empirical risk.

Cite

Text

Yun et al. "Small ReLU Networks Are Powerful Memorizers: A Tight Analysis of Memorization Capacity." Neural Information Processing Systems, 2019.

Markdown

[Yun et al. "Small ReLU Networks Are Powerful Memorizers: A Tight Analysis of Memorization Capacity." Neural Information Processing Systems, 2019.](https://mlanthology.org/neurips/2019/yun2019neurips-small/)

BibTeX

@inproceedings{yun2019neurips-small,
  title     = {{Small ReLU Networks Are Powerful Memorizers: A Tight Analysis of Memorization Capacity}},
  author    = {Yun, Chulhee and Sra, Suvrit and Jadbabaie, Ali},
  booktitle = {Neural Information Processing Systems},
  year      = {2019},
  pages     = {15558-15569},
  url       = {https://mlanthology.org/neurips/2019/yun2019neurips-small/}
}