Why Regularized Auto-Encoders Learn Sparse Representation?

Abstract

Sparse distributed representation is the key to learning useful features in deep learning algorithms, because not only it is an efficient mode of data representation, but also – more importantly – it captures the generation process of most real world data. While a number of regularized auto-encoders (AE) enforce sparsity explicitly in their learned representation and others don’t, there has been little formal analysis on what encourages sparsity in these models in general. Our objective is to formally study this general problem for regularized auto-encoders. We provide sufficient conditions on both regularization and activation functions that encourage sparsity. We show that multiple popular models (de-noising and contractive auto encoders, e.g.) and activations (rectified linear and sigmoid, e.g.) satisfy these conditions; thus, our conditions help explain sparsity in their learned representation. Thus our theoretical and empirical analysis together shed light on the properties of regularization/activation that are conductive to sparsity and unify a number of existing auto-encoder models and activation functions under the same analytical framework.

Cite

Text

Arpit et al. "Why Regularized Auto-Encoders Learn Sparse Representation?." International Conference on Machine Learning, 2016.

Markdown

[Arpit et al. "Why Regularized Auto-Encoders Learn Sparse Representation?." International Conference on Machine Learning, 2016.](https://mlanthology.org/icml/2016/arpit2016icml-regularized/)

BibTeX

@inproceedings{arpit2016icml-regularized,
  title     = {{Why Regularized Auto-Encoders Learn Sparse Representation?}},
  author    = {Arpit, Devansh and Zhou, Yingbo and Ngo, Hung and Govindaraju, Venu},
  booktitle = {International Conference on Machine Learning},
  year      = {2016},
  pages     = {136-144},
  volume    = {48},
  url       = {https://mlanthology.org/icml/2016/arpit2016icml-regularized/}
}