Group and Shuffle: Efficient Structured Orthogonal Parametrization

Abstract

The increasing size of neural networks has led to a growing demand for methods of efficient finetuning. Recently, an orthogonal finetuning paradigm was introduced that uses orthogonal matrices for adapting the weights of a pretrained model. In this paper, we introduce a new class of structured matrices, which unifies and generalizes structured classes from previous works. We examine properties of this class and build a structured orthogonal parametrization upon it. We then use this parametrization to modify the orthogonal finetuning framework, improving parameter efficiency. We empirically validate our method on different domains, including adapting of text-to-image diffusion models and downstream task finetuning in language modeling. Additionally, we adapt our construction for orthogonal convolutions and conduct experiments with 1-Lipschitz neural networks.

Cite

Text

Gorbunov et al. "Group and Shuffle: Efficient Structured Orthogonal Parametrization." Neural Information Processing Systems, 2024. doi:10.52202/079017-2195

Markdown

[Gorbunov et al. "Group and Shuffle: Efficient Structured Orthogonal Parametrization." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/gorbunov2024neurips-group/) doi:10.52202/079017-2195

BibTeX

@inproceedings{gorbunov2024neurips-group,
  title     = {{Group and Shuffle: Efficient Structured Orthogonal Parametrization}},
  author    = {Gorbunov, Mikhail and Yudin, Nikolay and Soboleva, Vera and Alanov, Aibek and Naumov, Alexey and Rakhuba, Maxim},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-2195},
  url       = {https://mlanthology.org/neurips/2024/gorbunov2024neurips-group/}
}