UPDP: A Unified Progressive Depth Pruner for CNN and Vision Transformer

Abstract

Traditional channel-wise pruning methods by reducing network channels struggle to effectively prune efficient CNN models with depth-wise convolutional layers and certain efficient modules, such as popular inverted residual blocks. Prior depth pruning methods by reducing network depths are not suitable for pruning some efficient models due to the existence of some normalization layers. Moreover, finetuning subnet with directly removing activation layers would corrupt the original model weights, hindering the pruned model from achieving high performance. To address these issues, we propose a novel depth pruning method for efficient models. Our approach proposes a novel block pruning strategy and progressive training method for the subnet. Additionally, we extend our pruning method to vision transformer models. Experimental results demonstrate that our method consistently outperforms existing depth pruning methods across various pruning configurations. We obtained three pruned ConvNeXtV1 models with our method applying on ConvNeXtV1, which surpass most SOTA efficient models with comparable inference performance. Our method also achieves state-of-the-art pruning performance on the vision transformer model.

Cite

Text

Liu et al. "UPDP: A Unified Progressive Depth Pruner for CNN and Vision Transformer." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I12.29296

Markdown

[Liu et al. "UPDP: A Unified Progressive Depth Pruner for CNN and Vision Transformer." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/liu2024aaai-updp/) doi:10.1609/AAAI.V38I12.29296

BibTeX

@inproceedings{liu2024aaai-updp,
  title     = {{UPDP: A Unified Progressive Depth Pruner for CNN and Vision Transformer}},
  author    = {Liu, Ji and Tang, Dehua and Huang, Yuanxian and Zhang, Li and Zeng, Xiaocheng and Li, Dong and Lu, Mingjie and Peng, Jinzhang and Wang, Yu and Jiang, Fan and Tian, Lu and Sirasao, Ashish},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {13891-13899},
  doi       = {10.1609/AAAI.V38I12.29296},
  url       = {https://mlanthology.org/aaai/2024/liu2024aaai-updp/}
}