Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles

Abstract

We propose a novel unsupervised learning approach to build features suitable for object detection and classification. The features are pre-trained on a large dataset without human annotation and later transferred via fine-tuning on a different, smaller and labeled dataset. The pre-training consists of solving jigsaw puzzles of natural images. To facilitate the transfer of features to other tasks, we introduce the context-free network (CFN), a siamese-ennead convolutional neural network. The features correspond to the columns of the CFN and they process image tiles independently (i.e., free of context). The later layers of the CFN then use the features to identify their geometric arrangement. Our experimental evaluations show that the learned features capture semantically relevant content. We pre-train the CFN on the training set of the ILSVRC2012 dataset and transfer the features on the combined training and validation set of Pascal VOC 2007 for object detection (via fast RCNN) and classification. These features outperform all current unsupervised features with \(51.8\,\%\) for detection and \(68.6\,\%\) for classification, and reduce the gap with supervised learning (\(56.5\,\%\) and \(78.2\,\%\) respectively).

Cite

Text

Noroozi and Favaro. "Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles." European Conference on Computer Vision, 2016. doi:10.1007/978-3-319-46466-4_5

Markdown

[Noroozi and Favaro. "Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles." European Conference on Computer Vision, 2016.](https://mlanthology.org/eccv/2016/noroozi2016eccv-unsupervised/) doi:10.1007/978-3-319-46466-4_5

BibTeX

@inproceedings{noroozi2016eccv-unsupervised,
  title     = {{Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles}},
  author    = {Noroozi, Mehdi and Favaro, Paolo},
  booktitle = {European Conference on Computer Vision},
  year      = {2016},
  pages     = {69-84},
  doi       = {10.1007/978-3-319-46466-4_5},
  url       = {https://mlanthology.org/eccv/2016/noroozi2016eccv-unsupervised/}
}