Mini-Batch Optimization of Contrastive Loss

Kartik Sreenivasan, Keon Lee, Jeong-Gwan Lee, Anna Lee, Jaewoong Cho, Jy-yong Sohn, Dimitris Papailiopoulos, Kangwook Lee

ICLRW 2023

/iclrw/2023/sreenivasan2023iclrw-minibatch/

Abstract

In this paper, we study the effect of mini-batch selection on contrastive loss and propose new mini-batch selection methods to improve efficiency. Theoretically, we show that both the full-batch and mini-batch settings share the same solution, the simplex Equiangular Tight Frame (ETF), if all $\binom{N}{B}$ mini-batches are seen during training. However, when not all possible batches are seen, mini-batch training can lead to suboptimal solutions. To address this issue, we propose efficient mini-batch selection methods that compare favorably with existing methods. Our experimental results demonstrate the effectiveness of our proposed methods in finding a near-optimal solution with a reduced number of gradient steps and outperforming existing mini-batch selection methods.

PDF ICLRW OpenReview Semantic Scholar

Cite

Text

Sreenivasan et al. "Mini-Batch Optimization of Contrastive Loss." ICLR 2023 Workshops: ME-FoMo, 2023.

Markdown

[Sreenivasan et al. "Mini-Batch Optimization of Contrastive Loss." ICLR 2023 Workshops: ME-FoMo, 2023.](https://mlanthology.org/iclrw/2023/sreenivasan2023iclrw-minibatch/)

BibTeX

@inproceedings{sreenivasan2023iclrw-minibatch,
  title     = {{Mini-Batch Optimization of Contrastive Loss}},
  author    = {Sreenivasan, Kartik and Lee, Keon and Lee, Jeong-Gwan and Lee, Anna and Cho, Jaewoong and Sohn, Jy-yong and Papailiopoulos, Dimitris and Lee, Kangwook},
  booktitle = {ICLR 2023 Workshops: ME-FoMo},
  year      = {2023},
  url       = {https://mlanthology.org/iclrw/2023/sreenivasan2023iclrw-minibatch/}
}