Geometric Structure of Graph Laplacian Embeddings
Abstract
We analyze the spectral clustering procedure for identifying coarse structure in a data set $\mathbf{x}_1, \dots, \mathbf{x}_n$, and in particular study the geometry of graph Laplacian embeddings which form the basis for spectral clustering algorithms. More precisely, we assume that the data are sampled from a mixture model supported on a manifold $\mathcal{M}$ embedded in $\mathbb{R}^d$, and pick a connectivity length-scale $\varepsilon>0$ to construct a kernelized graph Laplacian. We introduce a notion of a well-separated mixture model which only depends on the model itself, and prove that when the model is well separated, with high probability the embedded data set concentrates on cones that are centered around orthogonal vectors. Our results are meaningful in the regime where $\varepsilon = \varepsilon(n)$ is allowed to decay to zero at a slow enough rate as the number of data points grows. This rate depends on the intrinsic dimension of the manifold on which the data is supported.
Cite
Text
Trillos et al. "Geometric Structure of Graph Laplacian Embeddings." Journal of Machine Learning Research, 2021.Markdown
[Trillos et al. "Geometric Structure of Graph Laplacian Embeddings." Journal of Machine Learning Research, 2021.](https://mlanthology.org/jmlr/2021/trillos2021jmlr-geometric/)BibTeX
@article{trillos2021jmlr-geometric,
title = {{Geometric Structure of Graph Laplacian Embeddings}},
author = {Trillos, Nicolás García and Hoffmann, Franca and Hosseini, Bamdad},
journal = {Journal of Machine Learning Research},
year = {2021},
pages = {1-55},
volume = {22},
url = {https://mlanthology.org/jmlr/2021/trillos2021jmlr-geometric/}
}