Large-Margin Contrastive Learning with Distance Polarization Regularizer

Abstract

\emph{Contrastive learning} (CL) pretrains models in a pairwise manner, where given a data point, other data points are all regarded as dissimilar, including some that are \emph{semantically} similar. The issue has been addressed by properly weighting similar and dissimilar pairs as in \emph{positive-unlabeled learning}, so that the objective of CL is \emph{unbiased} and CL is \emph{consistent}. However, in this paper, we argue that this great solution is still not enough: its weighted objective \emph{hides} the issue where the semantically similar pairs are still pushed away; as CL is pretraining, this phenomenon is not our desideratum and might affect downstream tasks. To this end, we propose \emph{large-margin contrastive learning} (LMCL) with \emph{distance polarization regularizer}, motivated by the distribution characteristic of pairwise distances in \emph{metric learning}. In LMCL, we can distinguish between \emph{intra-cluster} and \emph{inter-cluster} pairs, and then only push away inter-cluster pairs, which \emph{solves} the above issue explicitly. Theoretically, we prove a tighter error bound for LMCL; empirically, the superiority of LMCL is demonstrated across multiple domains, \emph{i.e.}, image classification, sentence representation, and reinforcement learning.

Cite

Text

Chen et al. "Large-Margin Contrastive Learning with Distance Polarization Regularizer." International Conference on Machine Learning, 2021.

Markdown

[Chen et al. "Large-Margin Contrastive Learning with Distance Polarization Regularizer." International Conference on Machine Learning, 2021.](https://mlanthology.org/icml/2021/chen2021icml-largemargin/)

BibTeX

@inproceedings{chen2021icml-largemargin,
  title     = {{Large-Margin Contrastive Learning with Distance Polarization Regularizer}},
  author    = {Chen, Shuo and Niu, Gang and Gong, Chen and Li, Jun and Yang, Jian and Sugiyama, Masashi},
  booktitle = {International Conference on Machine Learning},
  year      = {2021},
  pages     = {1673-1683},
  volume    = {139},
  url       = {https://mlanthology.org/icml/2021/chen2021icml-largemargin/}
}