Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso Under Information Bottleneck Principle

Abstract

Most existing neural network pruning methods hand-crafted their importance criteria and structures to prune. This constructs heavy and unintended dependencies on heuristics and expert experience for both the objective and the parameters of the pruning approach. In this paper, we try to solve this problem by introducing a principled and unified framework based on Information Bottleneck (IB) theory, which further guides us to an automatic pruning approach. Specifically, we first formulate the channel pruning problem from an IB perspective, and then implement the IB principle by solving a Hilbert-Schmidt Independence Criterion (HSIC) Lasso problem under certain conditions. Based on the theoretical guidance, we then provide an automatic pruning scheme by searching for global penalty coefficients. Verified by extensive experiments, our method yields state-of-the-art performance on various benchmark networks and datasets. For example, with VGG-16, we achieve a 60%-FLOPs reduction by removing 76% of the parameters, with an improvement of 0.40% in top-1 accuracy on CIFAR-10. With ResNet-50, we achieve a 56%-FLOPs reduction by removing 50% of the parameters, with a small loss of 0.08% in the top-1 accuracy on ImageNet. The code is available at https://github.com/sunggo/APIB.

Cite

Text

Guo et al. "Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso Under Information Bottleneck Principle." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.01601

Markdown

[Guo et al. "Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso Under Information Bottleneck Principle." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/guo2023iccv-automatic/) doi:10.1109/ICCV51070.2023.01601

BibTeX

@inproceedings{guo2023iccv-automatic,
  title     = {{Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso Under Information Bottleneck Principle}},
  author    = {Guo, Song and Zhang, Lei and Zheng, Xiawu and Wang, Yan and Li, Yuchao and Chao, Fei and Wu, Chenglin and Zhang, Shengchuan and Ji, Rongrong},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {17458-17469},
  doi       = {10.1109/ICCV51070.2023.01601},
  url       = {https://mlanthology.org/iccv/2023/guo2023iccv-automatic/}
}