A Probabilistic Model Behind Self- Supervised Learning
Abstract
In self-supervised learning (SSL), representations are learned via an auxiliary task without annotated labels. A common task is to classify augmentations or different modalities of the data, which share semantic _content_ (e.g. an object in an image) but differ in _style_ (e.g. the object's location). Many approaches to self-supervised learning have been proposed, e.g. SimCLR, CLIP and VicREG, which have recently gained much attention for their representations achieving downstream performance comparable to supervised learning. However, a theoretical understanding of the mechanism behind self-supervised methods eludes. Addressing this, we present a generative latent variable model for self-supervised learning and show that several families of discriminative SSL, including contrastive methods, induce a comparable distribution over representations, providing a unifying theoretical framework for these methods. The proposed model also justifies connections drawn to mutual information and the use of a ``projection head''. Learning representations by fitting the model generatively (termed SimVAE) improves performance over discriminative and other VAE-based methods on simple image benchmarks and significantly narrows the gap between generative and discriminative representation learning in more complex settings. Importantly, as our analysis predicts, SimVAE outperforms self-supervised learning where style information is required, taking an important step toward understanding self-supervised methods and achieving task-agnostic representations.
Cite
Text
Bizeul et al. "A Probabilistic Model Behind Self- Supervised Learning." Transactions on Machine Learning Research, 2024.Markdown
[Bizeul et al. "A Probabilistic Model Behind Self- Supervised Learning." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/bizeul2024tmlr-probabilistic/)BibTeX
@article{bizeul2024tmlr-probabilistic,
title = {{A Probabilistic Model Behind Self- Supervised Learning}},
author = {Bizeul, Alice and Schölkopf, Bernhard and Allen, Carl},
journal = {Transactions on Machine Learning Research},
year = {2024},
url = {https://mlanthology.org/tmlr/2024/bizeul2024tmlr-probabilistic/}
}