On the Benefits of Weight Normalization for Overparameterized Matrix Sensing

Wei, Yudong; Zhang, Liang; Li, Bingcong; He, Niao

On the Benefits of Weight Normalization for Overparameterized Matrix Sensing

Yudong Wei, Liang Zhang, Bingcong Li, Niao He

ICLR 2026

/iclr/2026/wei2026iclr-benefits/

Abstract

While normalization techniques are widely used in deep learning, their theoretical understanding remains relatively limited. In this work, we establish the benefits of (generalized) weight normalization (WN) applied to the overparameterized matrix sensing problem. We prove that WN with Riemannian optimization achieves linear convergence, yielding an $\textit{exponential}$ speedup over standard methods that do not use WN. Our analysis further demonstrates that both iteration and sample complexity improve polynomially as the level of overparameterization increases. To the best of our knowledge, this work provides the first characterization of how WN leverages overparameterization for faster convergence in matrix sensing.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Wei et al. "On the Benefits of Weight Normalization for Overparameterized Matrix Sensing." International Conference on Learning Representations, 2026.

Markdown

[Wei et al. "On the Benefits of Weight Normalization for Overparameterized Matrix Sensing." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/wei2026iclr-benefits/)

BibTeX

@inproceedings{wei2026iclr-benefits,
  title     = {{On the Benefits of Weight Normalization for Overparameterized Matrix Sensing}},
  author    = {Wei, Yudong and Zhang, Liang and Li, Bingcong and He, Niao},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/wei2026iclr-benefits/}
}