Accelerable Lottery Tickets with the Mixed-Precision Quantization

Abstract

In recent years, the lottery tickets hypothesis has gained widespread popularity as a means of network compression. However, the practical application of lottery tickets for hardware acceleration is difficult due to their element-wise unstructured sparsity nature. In this paper, we argue that network pruning can be seen as a special case of network quantization, and relax the hard network pruning with mixed-precision quantization in an unstructured manner, which makes it possible for real hardware acceleration. We successfully validate the wide existence of quantized lottery tickets, namely MPQ-tickets, that can match or even surpass the performance of corresponding full-precision dense networks on various representative benchmarks. Also, we demonstrate that MPQ-tickets have much higher flexibility than vanilla lottery tickets, and largely benefit from pruning when compared to QNNs. Moreover, the MPQ-tickets achieve up to 8× hardware acceleration of inference speed and 14× less memory consumption than full-precision models.

Cite

Text

Li et al. "Accelerable Lottery Tickets with the Mixed-Precision Quantization." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023. doi:10.1109/CVPRW59228.2023.00485

Markdown

[Li et al. "Accelerable Lottery Tickets with the Mixed-Precision Quantization." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023.](https://mlanthology.org/cvprw/2023/li2023cvprw-accelerable/) doi:10.1109/CVPRW59228.2023.00485

BibTeX

@inproceedings{li2023cvprw-accelerable,
  title     = {{Accelerable Lottery Tickets with the Mixed-Precision Quantization}},
  author    = {Li, Zhangheng and Gong, Yu and Zhang, Zhenyu and Xue, Xingyun and Chen, Tianlong and Liang, Yi and Yuan, Bo and Wang, Zhangyang},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2023},
  pages     = {4604-4612},
  doi       = {10.1109/CVPRW59228.2023.00485},
  url       = {https://mlanthology.org/cvprw/2023/li2023cvprw-accelerable/}
}