ProxyBNN: Learning Binarized Neural Networks via Proxy Matrices
Abstract
Training Binarized Neural Networks (BNNs) is challenging due to the discreteness. In order to efficiently optimize BNNs through backward propagations, real-valued auxiliary variables are commonly used to accumulate gradient updates. Those auxiliary variables are then directly quantized to binary weights in the forward pass, which brings about large quantization errors. In this paper, by introducing an appropriate proxy matrix, we reduce the weights quantization error while circumventing explicit binary regularizations on the full-precision auxiliary variables. Specifically, we regard pre-binarization weights as a linear combination of the basis vectors. The matrix composed of basis vectors is referred to as the proxy matrix, and auxiliary variables serve as the coefficients of this linear combination. We are the first to empirically identify and study the effectiveness of learning both basis and coefficients to construct the pre-binarization weights. This new proxy learning contributes to new leading performances on benchmark datasets.
Cite
Text
He et al. "ProxyBNN: Learning Binarized Neural Networks via Proxy Matrices." Proceedings of the European Conference on Computer Vision (ECCV), 2020. doi:10.1007/978-3-030-58580-8_14Markdown
[He et al. "ProxyBNN: Learning Binarized Neural Networks via Proxy Matrices." Proceedings of the European Conference on Computer Vision (ECCV), 2020.](https://mlanthology.org/eccv/2020/he2020eccv-proxybnn/) doi:10.1007/978-3-030-58580-8_14BibTeX
@inproceedings{he2020eccv-proxybnn,
title = {{ProxyBNN: Learning Binarized Neural Networks via Proxy Matrices}},
author = {He, Xiangyu and Mo, Zitao and Cheng, Ke and Xu, Weixiang and Hu, Qinghao and Wang, Peisong and Liu, Qingshan and Cheng, Jian},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2020},
doi = {10.1007/978-3-030-58580-8_14},
url = {https://mlanthology.org/eccv/2020/he2020eccv-proxybnn/}
}