Euclidean Embedding of Co-Occurrence Data
Abstract
Embedding algorithms search for low dimensional structure in complex data, but most algorithms only handle objects of a single type for which pairwise distances are specified. This paper describes a method for em- bedding objects of different types, such as images and text, into a single common Euclidean space based on their co-occurrence statistics. The joint distributions are modeled as exponentials of Euclidean distances in the low-dimensional embedding space, which links the problem to con- vex optimization over positive semidefinite matrices. The local struc- ture of our embedding corresponds to the statistical correlations via ran- dom walks in the Euclidean space. We quantify the performance of our method on two text datasets, and show that it consistently and signifi- cantly outperforms standard methods of statistical correspondence mod- eling, such as multidimensional scaling and correspondence analysis.
Cite
Text
Globerson et al. "Euclidean Embedding of Co-Occurrence Data." Neural Information Processing Systems, 2004.Markdown
[Globerson et al. "Euclidean Embedding of Co-Occurrence Data." Neural Information Processing Systems, 2004.](https://mlanthology.org/neurips/2004/globerson2004neurips-euclidean/)BibTeX
@inproceedings{globerson2004neurips-euclidean,
title = {{Euclidean Embedding of Co-Occurrence Data}},
author = {Globerson, Amir and Chechik, Gal and Pereira, Fernando and Tishby, Naftali},
booktitle = {Neural Information Processing Systems},
year = {2004},
pages = {497-504},
url = {https://mlanthology.org/neurips/2004/globerson2004neurips-euclidean/}
}