Revisiting Batch Norm Initialization

Davis, Jim; Frank, Logan

doi:10.1007/978-3-031-19803-8_13

Revisiting Batch Norm Initialization

Jim Davis, Logan Frank

ECCV 2022

doi:10.1007/978-3-031-19803-8_13 /eccv/2022/davis2022eccv-revisiting/

Abstract

Batch normalization (BN) is comprised of a normalization component followed by an affine transformation and has become essential for training deep neural networks. Standard initialization of each BN in a network sets the affine transformation scale and shift to 1 and 0, respectively. However, after training we have observed that these parameters do not alter much from their initialization. Furthermore, we have noticed that the normalization process can still yield overly large values, which is undesirable for training. We revisit the BN formulation and present a new initialization method and update approach for BN to address the aforementioned issues. Experiments are designed to emphasize and demonstrate the positive influence of proper BN scale initialization on performance, and use rigorous statistical significance tests for evaluation. The approach can be used with existing implementations at no additional computational cost. Source code is available at https://github.com/osu-cvl/revisiting-bn-init.

PDF ECCV Semantic Scholar

Cite

Text

Davis and Frank. "Revisiting Batch Norm Initialization." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19803-8_13

Markdown

[Davis and Frank. "Revisiting Batch Norm Initialization." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/davis2022eccv-revisiting/) doi:10.1007/978-3-031-19803-8_13

BibTeX

@inproceedings{davis2022eccv-revisiting,
  title     = {{Revisiting Batch Norm Initialization}},
  author    = {Davis, Jim and Frank, Logan},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19803-8_13},
  url       = {https://mlanthology.org/eccv/2022/davis2022eccv-revisiting/}
}