A Universal Compression Theory for Lottery Ticket Hypothesis and Neural Scaling Laws

Wang, Hong-Yi; Luo, Di; Poggio, Tomaso; Chuang, Isaac L.; Ziyin, Liu

A Universal Compression Theory for Lottery Ticket Hypothesis and Neural Scaling Laws

Hong-Yi Wang, Di Luo, Tomaso Poggio, Isaac L. Chuang, Liu Ziyin

ICLR 2026

/iclr/2026/wang2026iclr-universal/

Abstract

When training large-scale models, the performance typically scales with the number of parameters and the dataset size according to a slow power law. A fundamental theoretical and practical question is whether comparable performance can be achieved with significantly smaller models and substantially less data. In this work, we provide a positive and constructive answer. We prove that a generic permutation-invariant function of $d$ objects can be asymptotically compressed into a function of $\operatorname{polylog} d$ objects with vanishing error, which is proved to be the optimal compression rate. This theorem yields two key implications: (Ia) a large neural network can be compressed to polylogarithmic width while preserving its learning dynamics; (Ib) a large dataset can be compressed to polylogarithmic size while leaving the loss landscape of the corresponding model unchanged. Implication (Ia) directly establishes a proof of the dynamical lottery ticket hypothesis, which states that any ordinary network can be strongly compressed such that the learning dynamics and result remain unchanged. (Ib) shows that a neural scaling law of the form $L\sim d^{-\alpha}$ can be boosted to an arbitrarily fast power law decay, and ultimately to $\exp(-\alpha' \sqrt[m]{d})$.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Wang et al. "A Universal Compression Theory for Lottery Ticket Hypothesis and Neural Scaling Laws." International Conference on Learning Representations, 2026.

Markdown

[Wang et al. "A Universal Compression Theory for Lottery Ticket Hypothesis and Neural Scaling Laws." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/wang2026iclr-universal/)

BibTeX

@inproceedings{wang2026iclr-universal,
  title     = {{A Universal Compression Theory for Lottery Ticket Hypothesis and Neural Scaling Laws}},
  author    = {Wang, Hong-Yi and Luo, Di and Poggio, Tomaso and Chuang, Isaac L. and Ziyin, Liu},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/wang2026iclr-universal/}
}