Tuning Pre-Trained Model via Moment Probing
Abstract
Recently, efficient fine-tuning of large-scale pre-trained models has attracted increasing research interests, where linear probing (LP) as a fundamental module is involved in exploiting the final representations for task-dependent classification. However, most of the existing methods focus on how to effectively introduce a few of learnable parameters, and little work pays attention to the commonly used LP module. In this paper, we propose a novel Moment Probing (MP) method to further explore the potential of LP. Distinguished from LP which builds a linear classification head based on the mean of final features (e.g., word tokens for ViT) or classification tokens, our MP performs a linear classifier on feature distribution, which provides a stronger representation ability by exploiting richer statistical information inherent in features. Specifically, we represent feature distribution by its characteristic function, which is efficiently approximated by using first- and second-order moments of features. Furthermore, we propose a multi-head convolutional cross-covariance to compute second-order moments in an efficient and effective manner. By considering that MP could affect feature learning, we introduce a partially shared module to learn two recalibrating parameters (PSRP) for backbones based on MP, namely MP+. Extensive experiments on ten benchmarks using various models show that our MP significantly outperforms LP and is competitive with counterparts at less training cost, while our MP+ achieves state-of-the-art performance.
Cite
Text
Gao et al. "Tuning Pre-Trained Model via Moment Probing." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.01084Markdown
[Gao et al. "Tuning Pre-Trained Model via Moment Probing." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/gao2023iccv-tuning/) doi:10.1109/ICCV51070.2023.01084BibTeX
@inproceedings{gao2023iccv-tuning,
title = {{Tuning Pre-Trained Model via Moment Probing}},
author = {Gao, Mingze and Wang, Qilong and Lin, Zhenyi and Zhu, Pengfei and Hu, Qinghua and Zhou, Jingbo},
booktitle = {International Conference on Computer Vision},
year = {2023},
pages = {11803-11813},
doi = {10.1109/ICCV51070.2023.01084},
url = {https://mlanthology.org/iccv/2023/gao2023iccv-tuning/}
}