Max-Margin Contrastive Learning

Abstract

Standard contrastive learning approaches usually require a large number of negatives for effective unsupervised learning and often exhibit slow convergence. We suspect this behavior is due to the suboptimal selection of negatives used for offering contrast to the positives. We counter this difficulty by taking inspiration from support vector machines (SVMs) to present max-margin contrastive learning (MMCL). Our approach selects negatives as the sparse support vectors obtained via a quadratic optimization problem, and contrastiveness is enforced by maximizing the decision margin. As SVM optimization can be computationally demanding, especially in an end-to-end setting, we present simplifications that alleviate the computational burden. We validate our approach on standard vision benchmark datasets, demonstrating better performance in unsupervised representation learning over state-of-the-art, while having better empirical convergence properties.

Cite

Text

Shah et al. "Max-Margin Contrastive Learning." AAAI Conference on Artificial Intelligence, 2022. doi:10.1609/AAAI.V36I8.20796

Markdown

[Shah et al. "Max-Margin Contrastive Learning." AAAI Conference on Artificial Intelligence, 2022.](https://mlanthology.org/aaai/2022/shah2022aaai-max/) doi:10.1609/AAAI.V36I8.20796

BibTeX

@inproceedings{shah2022aaai-max,
  title     = {{Max-Margin Contrastive Learning}},
  author    = {Shah, Anshul and Sra, Suvrit and Chellappa, Rama and Cherian, Anoop},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2022},
  pages     = {8220-8230},
  doi       = {10.1609/AAAI.V36I8.20796},
  url       = {https://mlanthology.org/aaai/2022/shah2022aaai-max/}
}