MoRe Fine-Tuning with 10x Fewer Parameters

Tan, Wenxuan; Roberts, Nicholas; Huang, Tzu-Heng; Zhao, Jitian; Cooper, John; Guo, Samuel; Duan, Chengyu; Sala, Frederic

MoRe Fine-Tuning with 10x Fewer Parameters

Wenxuan Tan, Nicholas Roberts, Tzu-Heng Huang, Jitian Zhao, John Cooper, Samuel Guo, Chengyu Duan, Frederic Sala

ICMLW 2024

/icmlw/2024/tan2024icmlw-more/

Abstract

Parameter-efficient fine-tuning (PEFT) techniques have unlocked the potential to cheaply and easily specialize large pretrained models. However, the most prominent approaches, like low-rank adapters (LoRA) depend on heuristics or rules-of-thumb for their architectural choices---potentially limiting their performance for new models and architectures. This limitation suggests that techniques from neural architecture search could be used to obtain optimal adapter architectures, but these are often expensive and difficult to implement. We address this challenge with Monarch Rectangular Fine-tuning (MoRe), a simple framework to search over adapter architectures that relies on the Monarch matrix class. Theoretically, we show that MoRe is more expressive than LoRA. Empirically, our approach is more parameter-efficient and performant than state-of-the-art PEFTs on a range of tasks and models, with as few as 5% of LoRA's parameters.

PDF ICMLW OpenReview Semantic Scholar

Cite

Text

Tan et al. "MoRe Fine-Tuning with 10x Fewer Parameters." ICML 2024 Workshops: ES-FoMo-II, 2024.

Markdown

[Tan et al. "MoRe Fine-Tuning with 10x Fewer Parameters." ICML 2024 Workshops: ES-FoMo-II, 2024.](https://mlanthology.org/icmlw/2024/tan2024icmlw-more/)

BibTeX

@inproceedings{tan2024icmlw-more,
  title     = {{MoRe Fine-Tuning with 10x Fewer Parameters}},
  author    = {Tan, Wenxuan and Roberts, Nicholas and Huang, Tzu-Heng and Zhao, Jitian and Cooper, John and Guo, Samuel and Duan, Chengyu and Sala, Frederic},
  booktitle = {ICML 2024 Workshops: ES-FoMo-II},
  year      = {2024},
  url       = {https://mlanthology.org/icmlw/2024/tan2024icmlw-more/}
}