Cut Less, Fold More: Model Compression Through the Lens of Projection Geometry

Saukh, Olga; Wang, Dong; Šikić, Haris; Cheng, Yun; Thiele, Lothar

Cut Less, Fold More: Model Compression Through the Lens of Projection Geometry

Olga Saukh, Dong Wang, Haris Šikić, Yun Cheng, Lothar Thiele

ICLR 2026

/iclr/2026/saukh2026iclr-cut/

Abstract

Compressing neural networks without retraining is vital for deployment at scale. We study calibration-free compression through the lens of projection geometry: structured pruning is an axis-aligned projection, whereas model folding performs a low-rank projection via weight clustering. We formalize both as orthogonal operators and show that, within a rank distance of one, folding provably yields smaller parameter reconstruction error, and under mild smoothness assumptions, smaller functional perturbations than pruning. At scale, we evaluate >1'000 checkpoints spanning ResNet18, PreActResNet18, ViT-B/32, and CLIP ViT-B/32 on CIFAR-10 and ImageNet-1K, covering diverse training hyperparameters (optimizers, learning rates, augmentations, regularization, sharpness-aware training). We show that folding typically achieves higher post-compression accuracy, with the largest gains at moderate–high compression. The gap narrows and occasionally reverses at specific training setups. Our results position folding as a geometry-aware, calibration-free alternative to pruning that is often superior in practice and principled in theory.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Saukh et al. "Cut Less, Fold More: Model Compression Through the Lens of Projection Geometry." International Conference on Learning Representations, 2026.

Markdown

[Saukh et al. "Cut Less, Fold More: Model Compression Through the Lens of Projection Geometry." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/saukh2026iclr-cut/)

BibTeX

@inproceedings{saukh2026iclr-cut,
  title     = {{Cut Less, Fold More: Model Compression Through the Lens of Projection Geometry}},
  author    = {Saukh, Olga and Wang, Dong and Šikić, Haris and Cheng, Yun and Thiele, Lothar},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/saukh2026iclr-cut/}
}