High-Capacity Expert Binary Networks

Abstract

Network binarization is a promising hardware-aware direction for creating efficient deep models. Despite its memory and computational advantages, reducing the accuracy gap between binary models and their real-valued counterparts remains an unsolved challenging research problem. To this end, we make the following 3 contributions: (a) To increase model capacity, we propose Expert Binary Convolution, which, for the first time, tailors conditional computing to binary networks by learning to select one data-specific expert binary filter at a time conditioned on input features. (b) To increase representation capacity, we propose to address the inherent information bottleneck in binary networks by introducing an efficient width expansion mechanism which keeps the binary operations within the same budget. (c) To improve network design, we propose a principled binary network growth mechanism that unveils a set of network topologies of favorable properties. Overall, our method improves upon prior work, with no increase in computational cost, by $\sim6 \%$, reaching a groundbreaking $\sim 71\%$ on ImageNet classification. Code will be made available $\href{https://www.adrianbulat.com/binary-networks}{here}$.

Cite

Text

Bulat et al. "High-Capacity Expert Binary Networks." International Conference on Learning Representations, 2021.

Markdown

[Bulat et al. "High-Capacity Expert Binary Networks." International Conference on Learning Representations, 2021.](https://mlanthology.org/iclr/2021/bulat2021iclr-highcapacity/)

BibTeX

@inproceedings{bulat2021iclr-highcapacity,
  title     = {{High-Capacity Expert Binary Networks}},
  author    = {Bulat, Adrian and Martinez, Brais and Tzimiropoulos, Georgios},
  booktitle = {International Conference on Learning Representations},
  year      = {2021},
  url       = {https://mlanthology.org/iclr/2021/bulat2021iclr-highcapacity/}
}