Compacting Binary Neural Networks by Sparse Kernel Selection

Abstract

Binary Neural Network (BNN) represents convolution weights with 1-bit values, which enhances the efficiency of storage and computation. This paper is motivated by a previously revealed phenomenon that the binary kernels in successful BNNs are nearly power-law distributed: their values are mostly clustered into a small number of codewords. This phenomenon encourages us to compact typical BNNs and obtain further close performance through learning non-repetitive kernels within a binary kernel subspace. Specifically, we regard the binarization process as kernel grouping in terms of a binary codebook, and our task lies in learning to select a smaller subset of codewords from the full codebook. We then leverage the Gumbel-Sinkhorn technique to approximate the codeword selection process, and develop the Permutation Straight-Through Estimator (PSTE) that is able to not only optimize the selection process end-to-end but also maintain the non-repetitive occupancy of selected codewords. Experiments verify that our method reduces both the model size and bit-wise computational costs, and achieves accuracy improvements compared with state-of-the-art BNNs under comparable budgets.

Cite

Text

Wang et al. "Compacting Binary Neural Networks by Sparse Kernel Selection." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.02335

Markdown

[Wang et al. "Compacting Binary Neural Networks by Sparse Kernel Selection." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/wang2023cvpr-compacting/) doi:10.1109/CVPR52729.2023.02335

BibTeX

@inproceedings{wang2023cvpr-compacting,
  title     = {{Compacting Binary Neural Networks by Sparse Kernel Selection}},
  author    = {Wang, Yikai and Huang, Wenbing and Dong, Yinpeng and Sun, Fuchun and Yao, Anbang},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {24374-24383},
  doi       = {10.1109/CVPR52729.2023.02335},
  url       = {https://mlanthology.org/cvpr/2023/wang2023cvpr-compacting/}
}