Local Regularizer Improves Generalization

Abstract

Regularization plays an important role in generalization of deep learning. In this paper, we study the generalization power of an unbiased regularizor for training algorithms in deep learning. We focus on training methods called Locally Regularized Stochastic Gradient Descent (LRSGD). An LRSGD leverages a proximal type penalty in gradient descent steps to regularize SGD in training. We show that by carefully choosing relevant parameters, LRSGD generalizes better than SGD. Our thorough theoretical analysis is supported by experimental evidence. It advances our theoretical understanding of deep learning and provides new perspectives on designing training algorithms. The code is available at https://github.com/huiqu18/LRSGD.

Cite

Text

Zhang et al. "Local Regularizer Improves Generalization." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I04.6167

Markdown

[Zhang et al. "Local Regularizer Improves Generalization." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/zhang2020aaai-local/) doi:10.1609/AAAI.V34I04.6167

BibTeX

@inproceedings{zhang2020aaai-local,
  title     = {{Local Regularizer Improves Generalization}},
  author    = {Zhang, Yikai and Qu, Hui and Metaxas, Dimitris N. and Chen, Chao},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {6861-6868},
  doi       = {10.1609/AAAI.V34I04.6167},
  url       = {https://mlanthology.org/aaai/2020/zhang2020aaai-local/}
}