Max-Margin Contrastive Learning
Abstract
Standard contrastive learning approaches usually require a large number of negatives for effective unsupervised learning and often exhibit slow convergence. We suspect this behavior is due to the suboptimal selection of negatives used for offering contrast to the positives. We counter this difficulty by taking inspiration from support vector machines (SVMs) to present max-margin contrastive learning (MMCL). Our approach selects negatives as the sparse support vectors obtained via a quadratic optimization problem, and contrastiveness is enforced by maximizing the decision margin. As SVM optimization can be computationally demanding, especially in an end-to-end setting, we present simplifications that alleviate the computational burden. We validate our approach on standard vision benchmark datasets, demonstrating better performance in unsupervised representation learning over state-of-the-art, while having better empirical convergence properties.
Cite
Text
Shah et al. "Max-Margin Contrastive Learning." AAAI Conference on Artificial Intelligence, 2022. doi:10.1609/AAAI.V36I8.20796Markdown
[Shah et al. "Max-Margin Contrastive Learning." AAAI Conference on Artificial Intelligence, 2022.](https://mlanthology.org/aaai/2022/shah2022aaai-max/) doi:10.1609/AAAI.V36I8.20796BibTeX
@inproceedings{shah2022aaai-max,
title = {{Max-Margin Contrastive Learning}},
author = {Shah, Anshul and Sra, Suvrit and Chellappa, Rama and Cherian, Anoop},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2022},
pages = {8220-8230},
doi = {10.1609/AAAI.V36I8.20796},
url = {https://mlanthology.org/aaai/2022/shah2022aaai-max/}
}