Building High-Level Features Using Large Scale Unsupervised Learning

Abstract

We consider the problem of building high-level, class-specific feature detectors from only unlabeled data. For example, is it possible to learn a face detector using only unlabeled images? To answer this, we train a deep sparse autoencoder on a large dataset of images (the model has 1 billion connections, the dataset has 10 million 200×200 pixel images downloaded from the Internet). We train this network using model parallelism and asynchronous SGD on a cluster with 1,000 machines (16,000 cores) for three days. Contrary to what appears to be a widely-held intuition, our experimental results reveal that it is possible to train a face detector without having to label images as containing a face or not. Control experiments show that this feature detector is robust not only to translation but also to scaling and out-of-plane rotation. We also find that the same network is sensitive to other high-level concepts such as cat faces and human bodies. Starting from these learned features, we trained our network to recognize 22,000 object categories from ImageNet and achieve a leap of 70% relative improvement over the previous state-of-the-art.

Cite

Text

Le et al. "Building High-Level Features Using Large Scale Unsupervised Learning." International Conference on Machine Learning, 2012. doi:10.1109/ICASSP.2013.6639343

Markdown

[Le et al. "Building High-Level Features Using Large Scale Unsupervised Learning." International Conference on Machine Learning, 2012.](https://mlanthology.org/icml/2012/le2012icml-building/) doi:10.1109/ICASSP.2013.6639343

BibTeX

@inproceedings{le2012icml-building,
  title     = {{Building High-Level Features Using Large Scale Unsupervised Learning}},
  author    = {Le, Quoc V. and Ranzato, Marc'Aurelio and Monga, Rajat and Devin, Matthieu and Corrado, Greg and Chen, Kai and Dean, Jeffrey and Ng, Andrew Y.},
  booktitle = {International Conference on Machine Learning},
  year      = {2012},
  doi       = {10.1109/ICASSP.2013.6639343},
  url       = {https://mlanthology.org/icml/2012/le2012icml-building/}
}