Building Efficient Deep Neural Networks with Unitary Group Convolutions

Abstract

We propose unitary group convolutions (UGConvs), a building block for CNNs which compose a group convolution with unitary transforms in feature space to learn a richer set of representations than group convolution alone. UGConvs generalize two disparate ideas in CNN architecture, channel shuffling (i.e. ShuffleNet) and block-circulant networks (i.e. CirCNN), and provide unifying insights that lead to a deeper understanding of each technique. We experimentally demonstrate that dense unitary transforms can outperform channel shuffling in DNN accuracy. On the other hand, different dense transforms exhibit comparable accuracy performance. Based on these observations we propose HadaNet, a UGConv network using Hadamard transforms. HadaNets achieve similar accuracy to circulant networks with lower computation complexity, and better accuracy than ShuffleNets with the same number of parameters and floating-point multiplies.

Cite

Text

Zhao et al. "Building Efficient Deep Neural Networks with Unitary Group Convolutions." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.01156

Markdown

[Zhao et al. "Building Efficient Deep Neural Networks with Unitary Group Convolutions." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/zhao2019cvpr-building/) doi:10.1109/CVPR.2019.01156

BibTeX

@inproceedings{zhao2019cvpr-building,
  title     = {{Building Efficient Deep Neural Networks with Unitary Group Convolutions}},
  author    = {Zhao, Ritchie and Hu, Yuwei and Dotzel, Jordan and De Sa, Christopher and Zhang, Zhiru},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2019},
  doi       = {10.1109/CVPR.2019.01156},
  url       = {https://mlanthology.org/cvpr/2019/zhao2019cvpr-building/}
}