Learning to Compress: Local Rank and Information Compression in Deep Neural Networks

Abstract

Deep neural networks tend to exhibit a bias toward low-rank solutions during training, implicitly learning low-dimensional feature representations. This paper investigates how deep multilayer perceptrons (MLPs) encode these feature manifolds and connects this behavior to the Information Bottleneck (IB) theory. We introduce the concept of \emph{local rank} as a measure of feature manifold dimensionality and demonstrate, both theoretically and empirically, that this rank decreases during the final phase of training. We argue that networks that reduce the rank of their learned representations also compress mutual information between inputs and intermediate layers. This work bridges the gap between feature manifold rank and information compression, offering new insights into the interplay between information bottlenecks and representation learning.

Cite

Text

Patel and Shwartz-Ziv. "Learning to Compress: Local Rank and Information Compression in Deep Neural Networks." NeurIPS 2024 Workshops: Compression, 2024.

Markdown

[Patel and Shwartz-Ziv. "Learning to Compress: Local Rank and Information Compression in Deep Neural Networks." NeurIPS 2024 Workshops: Compression, 2024.](https://mlanthology.org/neuripsw/2024/patel2024neuripsw-learning/)

BibTeX

@inproceedings{patel2024neuripsw-learning,
  title     = {{Learning to Compress: Local Rank and Information Compression in Deep Neural Networks}},
  author    = {Patel, Niket Nikul and Shwartz-Ziv, Ravid},
  booktitle = {NeurIPS 2024 Workshops: Compression},
  year      = {2024},
  url       = {https://mlanthology.org/neuripsw/2024/patel2024neuripsw-learning/}
}