UniCon: Unified Framework for Efficient Contrastive Alignment via Kernels

Abstract

Contrastive objectives power state-of-the-art multimodal models, but their training remains slow, relying on long stochastic optimization. We propose a Unified Framework for Efficient Contrastive Alignment via Kernels (UniCon), which spans linear and nonlinear encoders as well as one-to-one and many-to-many alignments. At its core, UniCon introduces the contrastive similarity weight matrix $S(\gamma)$, which enables closed-form global solutions that provably replace minibatch back-propagation with exact updates. Through the lens of reproducing kernel Hilbert spaces (RKHS), UniCon provides a kernelized perspective that unifies contrastive alignment and reveals its connection to spectral methods. To validate the theory, we conduct experiments on synthetic, unimodal, multimodal, and zero-shot tasks, demonstrating that UniCon achieves substantial efficiency gains while preserving generality and strong empirical performance.

Cite

Text

Sui et al. "UniCon: Unified Framework for Efficient Contrastive Alignment via Kernels." International Conference on Learning Representations, 2026.

Markdown

[Sui et al. "UniCon: Unified Framework for Efficient Contrastive Alignment via Kernels." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/sui2026iclr-unicon/)

BibTeX

@inproceedings{sui2026iclr-unicon,
  title     = {{UniCon: Unified Framework for Efficient Contrastive Alignment via Kernels}},
  author    = {Sui, Hangke and Wang, Yuqing and Do, Minh N.},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/sui2026iclr-unicon/}
}