Voting from Nearest Tasks: Meta-Vote Pruning of Pre-Trained Models for Downstream Tasks

Abstract

As large-scale pre-trained models have become the major choices of various applications, new challenges arise for model pruning, e.g., can we avoid pruning the same model from scratch for downstream tasks? How to reuse the pruning results of previous tasks to accelerate the pruning for new tasks? To address these challenges, we create a small model for a new task from the pruned models of similar tasks. We show that a few fine-tuning steps on this model suffice to produce a promising pruned model for the new task. We study this “meta-pruning” from nearest tasks on two major classes of pre-trained models, convolutional neural network and vision transformer, under a limited budget of pruning iterations. Our study begins by investigating the overlap of pruned models for similar tasks and how the overlap changes over different layers and blocks. Inspired by these discoveries, we develop a simple but effective “Meta-Vote Pruning” method that significantly reduces the pruning iterations for a new task by initializing a sub-network from the pruned models of its nearest tasks. In experiments, we demonstrate MVP’s accuracy, efficiency, and generalization advantages through extensive empirical studies and comparisons with popular pruning methods over several datasets.

Cite

Text

Zhao et al. "Voting from Nearest Tasks: Meta-Vote Pruning of Pre-Trained Models for Downstream Tasks." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2023. doi:10.1007/978-3-031-43415-0_4

Markdown

[Zhao et al. "Voting from Nearest Tasks: Meta-Vote Pruning of Pre-Trained Models for Downstream Tasks." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2023.](https://mlanthology.org/ecmlpkdd/2023/zhao2023ecmlpkdd-voting/) doi:10.1007/978-3-031-43415-0_4

BibTeX

@inproceedings{zhao2023ecmlpkdd-voting,
  title     = {{Voting from Nearest Tasks: Meta-Vote Pruning of Pre-Trained Models for Downstream Tasks}},
  author    = {Zhao, Haiyan and Zhou, Tianyi and Long, Guodong and Jiang, Jing and Zhang, Chengqi},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2023},
  pages     = {52-68},
  doi       = {10.1007/978-3-031-43415-0_4},
  url       = {https://mlanthology.org/ecmlpkdd/2023/zhao2023ecmlpkdd-voting/}
}