Euclidean Embedding of Co-Occurrence Data

Abstract

Embedding algorithms search for low dimensional structure in complex data, but most algorithms only handle objects of a single type for which pairwise distances are specified. This paper describes a method for em- bedding objects of different types, such as images and text, into a single common Euclidean space based on their co-occurrence statistics. The joint distributions are modeled as exponentials of Euclidean distances in the low-dimensional embedding space, which links the problem to con- vex optimization over positive semidefinite matrices. The local struc- ture of our embedding corresponds to the statistical correlations via ran- dom walks in the Euclidean space. We quantify the performance of our method on two text datasets, and show that it consistently and signifi- cantly outperforms standard methods of statistical correspondence mod- eling, such as multidimensional scaling and correspondence analysis.

Cite

Text

Globerson et al. "Euclidean Embedding of Co-Occurrence Data." Neural Information Processing Systems, 2004.

Markdown

[Globerson et al. "Euclidean Embedding of Co-Occurrence Data." Neural Information Processing Systems, 2004.](https://mlanthology.org/neurips/2004/globerson2004neurips-euclidean/)

BibTeX

@inproceedings{globerson2004neurips-euclidean,
  title     = {{Euclidean Embedding of Co-Occurrence Data}},
  author    = {Globerson, Amir and Chechik, Gal and Pereira, Fernando and Tishby, Naftali},
  booktitle = {Neural Information Processing Systems},
  year      = {2004},
  pages     = {497-504},
  url       = {https://mlanthology.org/neurips/2004/globerson2004neurips-euclidean/}
}