Parameter-Efficient Tuning of Large-Scale Multimodal Foundation Model
Abstract
Driven by the progress of large-scale pre-training, parameter-efficient transfer learning has gained immense popularity across different subfields of Artificial Intelligence. The core is to adapt the model to downstream tasks with only a small set of parameters. Recently, researchers have leveraged such proven techniques in multimodal tasks and achieve promising results. However, two critical issues remain unresolved: how to further reduce the complexity with lightweight design and how to boost alignment between modalities under extremely low parameters. In this paper, we propose A gracefUl pRompt framewOrk for cRoss-modal trAnsfer (AURORA) to overcome these challenges. Considering the redundancy in existing architectures, we first utilize the mode approximation to generate 0.1M trainable parameters to implement the multimodal parameter-efficient tuning, which explores the low intrinsic dimension with only 0.04% parameters of the pre-trained model. Then, for better modality alignment, we propose the Informative Context Enhancement and Gated Query Transformation module under extremely few parameters scenes. A thorough evaluation on six cross-modal benchmarks shows that it not only outperforms the state-of-the-art but even outperforms the full fine-tuning approach. Our code is available at: https://github.com/WillDreamer/Aurora.
Cite
Text
Wang et al. "Parameter-Efficient Tuning of Large-Scale Multimodal Foundation Model." Neural Information Processing Systems, 2023.Markdown
[Wang et al. "Parameter-Efficient Tuning of Large-Scale Multimodal Foundation Model." Neural Information Processing Systems, 2023.](https://mlanthology.org/neurips/2023/wang2023neurips-parameterefficient/)BibTeX
@inproceedings{wang2023neurips-parameterefficient,
title = {{Parameter-Efficient Tuning of Large-Scale Multimodal Foundation Model}},
author = {Wang, Haixin and Yang, Xinlong and Chang, Jianlong and Jin, Dian and Sun, Jinan and Zhang, Shikun and Luo, Xiao and Tian, Qi},
booktitle = {Neural Information Processing Systems},
year = {2023},
url = {https://mlanthology.org/neurips/2023/wang2023neurips-parameterefficient/}
}