A Probabilistic Graph Coupling View of Dimension Reduction

Abstract

Most popular dimension reduction (DR) methods like t-SNE and UMAP are based on minimizing a cost between input and latent pairwise similarities. Though widely used, these approaches lack clear probabilistic foundations to enable a full understanding of their properties and limitations. To that extent, we introduce a unifying statistical framework based on the coupling of hidden graphs using cross entropy. These graphs induce a Markov random field dependency structure among the observations in both input and latent spaces. We show that existing pairwise similarity DR methods can be retrieved from our framework with particular choices of priors for the graphs. Moreover this reveals that these methods relying on shift-invariant kernels suffer from a statistical degeneracy that explains poor performances in conserving coarse-grain dependencies. New links are drawn with PCA which appears as a non-degenerate graph coupling model.

Cite

Text

Van Assel et al. "A Probabilistic Graph Coupling View of Dimension Reduction." Neural Information Processing Systems, 2022.

Markdown

[Van Assel et al. "A Probabilistic Graph Coupling View of Dimension Reduction." Neural Information Processing Systems, 2022.](https://mlanthology.org/neurips/2022/assel2022neurips-probabilistic/)

BibTeX

@inproceedings{assel2022neurips-probabilistic,
  title     = {{A Probabilistic Graph Coupling View of Dimension Reduction}},
  author    = {Van Assel, Hugues and Espinasse, Thibault and Chiquet, Julien and Picard, Franck},
  booktitle = {Neural Information Processing Systems},
  year      = {2022},
  url       = {https://mlanthology.org/neurips/2022/assel2022neurips-probabilistic/}
}