Spectral Leakage and Rethinking the Kernel Size in CNNs

Abstract

Convolutional layers in CNNs implement linear filters which decompose the input into different frequency bands. However, most modern architectures neglect standard principles of filter design when optimizing their model choices regarding the size and shape of the convolutional kernel. In this work, we consider the well-known problem of spectral leakage caused by windowing artifacts in filtering operations in the context of CNNs. We show that the small size of CNN kernels make them susceptible to spectral leakage, which may induce performance-degrading artifacts. To address this issue, we propose the use of larger kernel sizes along with the Hamming window function to alleviate leakage in CNN architectures. We demonstrate improved classification accuracy on multiple benchmark datasets including Fashion-MNIST, CIFAR-10, CIFAR-100 and ImageNet with the simple use of a standard window function in convolutional layers. Finally, we show that CNNs employing the Hamming window display increased robustness against various adversarial attacks.

Cite

Text

Tomen and van Gemert. "Spectral Leakage and Rethinking the Kernel Size in CNNs." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.00509

Markdown

[Tomen and van Gemert. "Spectral Leakage and Rethinking the Kernel Size in CNNs." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/tomen2021iccv-spectral/) doi:10.1109/ICCV48922.2021.00509

BibTeX

@inproceedings{tomen2021iccv-spectral,
  title     = {{Spectral Leakage and Rethinking the Kernel Size in CNNs}},
  author    = {Tomen, Nergis and van Gemert, Jan C.},
  booktitle = {International Conference on Computer Vision},
  year      = {2021},
  pages     = {5138-5147},
  doi       = {10.1109/ICCV48922.2021.00509},
  url       = {https://mlanthology.org/iccv/2021/tomen2021iccv-spectral/}
}