SVD-LLM: Truncation-Aware Singular Value Decomposition for Large Language Model Compression

Xin Wang, Yu Zheng, Zhongwei Wan, Mi Zhang

ICLR 2025

/iclr/2025/wang2025iclr-svdllm/

Abstract

The advancements in Large Language Models (LLMs) have been hindered by their substantial sizes, which necessitates LLM compression methods for practical deployment. Singular Value Decomposition (SVD) offers a promising solution for LLM compression. However, state-of-the-art SVD-based LLM compression meth- ods have two key limitations: truncating smaller singular values may lead to higher compression loss, and the lack of update on the compressed weights after SVD truncation. In this work, we propose SVD-LLM, a SVD-based post-training LLM compression method that addresses the limitations of existing methods. SVD-LLM incorporates a truncation-aware data whitening technique to ensure a direct map- ping between singular values and compression loss. Moreover, SVD-LLM adopts a parameter update with sequential low-rank approximation to compensate for the accuracy degradation after SVD compression. We evaluate SVD-LLM on 10 datasets and seven models from three different LLM families at three different scales. Our results demonstrate the superiority of SVD-LLM over state-of-the-arts, especially at high model compression ratios.

PDF ICLR Semantic Scholar

Cite

Text

Wang et al. "SVD-LLM: Truncation-Aware Singular Value Decomposition for Large Language Model Compression." International Conference on Learning Representations, 2025.

Markdown

[Wang et al. "SVD-LLM: Truncation-Aware Singular Value Decomposition for Large Language Model Compression." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/wang2025iclr-svdllm/)

BibTeX

@inproceedings{wang2025iclr-svdllm,
  title     = {{SVD-LLM: Truncation-Aware Singular Value Decomposition for Large Language Model Compression}},
  author    = {Wang, Xin and Zheng, Yu and Wan, Zhongwei and Zhang, Mi},
  booktitle = {International Conference on Learning Representations},
  year      = {2025},
  url       = {https://mlanthology.org/iclr/2025/wang2025iclr-svdllm/}
}