Neural Network Compression Using Higher-Order Statistics and Auxiliary Reconstruction Losses
Abstract
In this paper, the problem of pruning and compressing the weights of various layers of deep neural networks is investigated. The proposed method aims to remove redundant filters from the network to reduce computational complexity and storage requirements, while improving the performance of the original network. More specifically, a novel filter selection criterion is introduced based on the fact that filters whose weights follow a Gaussian distribution correspond to hidden units that do not capture important aspects of data. To this end, Higher Order Statistics (HOS) are used and filters with low cumulant values that do not deviate significantly from Gaussian distribution are identified and removed from the network. In addition, a novel pruning strategy is proposed aiming to decide on the pruning ratio of each layer using the Shapiro-Wilk normality test. The use of auxiliary MSE losses (intermediate and after the softmax layer) during the fine-tuning phase further improves the overall performance of the compressed network. Extensive experiments with different network architectures and comparison with state-of-the-art approaches on well-known public datasets, such as CIFAR-10, CIFAR-100 and ILSCVR-12, demonstrate the great potential of the proposed approach.
Cite
Text
Chatzikonstantinou et al. "Neural Network Compression Using Higher-Order Statistics and Auxiliary Reconstruction Losses." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020. doi:10.1109/CVPRW50498.2020.00366Markdown
[Chatzikonstantinou et al. "Neural Network Compression Using Higher-Order Statistics and Auxiliary Reconstruction Losses." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020.](https://mlanthology.org/cvprw/2020/chatzikonstantinou2020cvprw-neural/) doi:10.1109/CVPRW50498.2020.00366BibTeX
@inproceedings{chatzikonstantinou2020cvprw-neural,
title = {{Neural Network Compression Using Higher-Order Statistics and Auxiliary Reconstruction Losses}},
author = {Chatzikonstantinou, Christos and Papadopoulos, Georgios Th. and Dimitropoulos, Kosmas and Daras, Petros},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2020},
pages = {3077-3086},
doi = {10.1109/CVPRW50498.2020.00366},
url = {https://mlanthology.org/cvprw/2020/chatzikonstantinou2020cvprw-neural/}
}