PCA++: How Uniformity Induces Robustness to Background Noise in Contrastive Learning

Abstract

High-dimensional data often conceal low-dimensional signals beneath structured background noise, limiting standard PCA. Motivated by contrastive learning, we address the problem of recovering shared signal subspaces from positive pairs--paired observations sharing the same signal but differing in background. Our baseline, PCA+, uses alignment-only contrastive learning and succeeds when background variation is mild, but fails under strong noise or high-dimensional regimes. To address this, we introduce PCA++, a hard uniformity-constrained contrastive PCA that enforces identity covariance on projected features. PCA++ has a closed-form solution via a generalized eigenproblem, remains stable in high dimensions, and provably regularizes against background interference. We provide exact high-dimensional asymptotics in both fixed-aspect-ratio and growing-spike regimes, showing uniformity’s role in robust signal recovery. Empirically, PCA++ outperforms standard PCA and alignment-only PCA+ on simulations, corrupted-MNIST, and single-cell transcriptomics, reliably recovering condition-invariant structure. More broadly, we clarify uniformity’s role in contrastive learning—showing that explicit feature dispersion defends against structured noise and enhances robustness.

Cite

Text

Wu et al. "PCA++: How Uniformity Induces Robustness to Background Noise in Contrastive Learning." Advances in Neural Information Processing Systems, 2025.

Markdown

[Wu et al. "PCA++: How Uniformity Induces Robustness to Background Noise in Contrastive Learning." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/wu2025neurips-pca/)

BibTeX

@inproceedings{wu2025neurips-pca,
  title     = {{PCA++: How Uniformity Induces Robustness to Background Noise in Contrastive Learning}},
  author    = {Wu, Mingqi and Sun, Qiang and Yang, Archer Y.},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/wu2025neurips-pca/}
}