A Probabilistic Model Behind Self- Supervised Learning

Abstract

In self-supervised learning (SSL), representations are learned via an auxiliary task without annotated labels. A common task is to classify augmentations or different modalities of the data, which share semantic _content_ (e.g. an object in an image) but differ in _style_ (e.g. the object's location). Many approaches to self-supervised learning have been proposed, e.g. SimCLR, CLIP and VicREG, which have recently gained much attention for their representations achieving downstream performance comparable to supervised learning. However, a theoretical understanding of the mechanism behind self-supervised methods eludes. Addressing this, we present a generative latent variable model for self-supervised learning and show that several families of discriminative SSL, including contrastive methods, induce a comparable distribution over representations, providing a unifying theoretical framework for these methods. The proposed model also justifies connections drawn to mutual information and the use of a ``projection head''. Learning representations by fitting the model generatively (termed SimVAE) improves performance over discriminative and other VAE-based methods on simple image benchmarks and significantly narrows the gap between generative and discriminative representation learning in more complex settings. Importantly, as our analysis predicts, SimVAE outperforms self-supervised learning where style information is required, taking an important step toward understanding self-supervised methods and achieving task-agnostic representations.

Cite

Text

Bizeul et al. "A Probabilistic Model Behind Self- Supervised Learning." Transactions on Machine Learning Research, 2024.

Markdown

[Bizeul et al. "A Probabilistic Model Behind Self- Supervised Learning." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/bizeul2024tmlr-probabilistic/)

BibTeX

@article{bizeul2024tmlr-probabilistic,
  title     = {{A Probabilistic Model Behind Self- Supervised Learning}},
  author    = {Bizeul, Alice and Schölkopf, Bernhard and Allen, Carl},
  journal   = {Transactions on Machine Learning Research},
  year      = {2024},
  url       = {https://mlanthology.org/tmlr/2024/bizeul2024tmlr-probabilistic/}
}