Rethinking Differentiable Search for Mixed-Precision Neural Networks
Abstract
Low-precision networks, with weights and activations quantized to low bit-width, are widely used to accelerate inference on edge devices. However, current solutions are uniform, using identical bit-width for all filters. This fails to account for the different sensitivities of different filters and is suboptimal. Mixed-precision networks address this problem, by tuning the bit-width to individual filter requirements. In this work, the problem of optimal mixed-precision network search (MPS) is considered. To circumvent its difficulties of discrete search space and combinatorial optimization, a new differentiable search architecture is proposed, with several novel contributions to advance the efficiency by leveraging the unique properties of the MPS problem. The resulting Efficient differentiable MIxed-Precision network Search (EdMIPS) method is effective at finding the optimal bit allocation for multiple popular networks, and can search a large model, e.g. Inception-V3, directly on ImageNet without proxy task in a reasonable amount of time. The learned mixed-precision networks significantly outperform their uniform counterparts.
Cite
Text
Cai and Vasconcelos. "Rethinking Differentiable Search for Mixed-Precision Neural Networks." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00242Markdown
[Cai and Vasconcelos. "Rethinking Differentiable Search for Mixed-Precision Neural Networks." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/cai2020cvpr-rethinking/) doi:10.1109/CVPR42600.2020.00242BibTeX
@inproceedings{cai2020cvpr-rethinking,
title = {{Rethinking Differentiable Search for Mixed-Precision Neural Networks}},
author = {Cai, Zhaowei and Vasconcelos, Nuno},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2020},
doi = {10.1109/CVPR42600.2020.00242},
url = {https://mlanthology.org/cvpr/2020/cai2020cvpr-rethinking/}
}