A Statistical Mechanics Analysis of Gram Matrix Eigenvalue Spectra

Abstract

The Gram matrix plays a central role in many kernel methods. Knowledge about the distribution of eigenvalues of the Gram matrix is useful for developing appropriate model selection methods for kernel PCA. We use methods adapted from the statistical physics of classical fluids in order to study the averaged spectrum of the Gram matrix. We focus in particular on a variational mean-field theory and related diagrammatic approach. We show that the mean-field theory correctly reproduces previously obtained asymptotic results for standard PCA. Comparison with simulations for data distributed uniformly on the sphere shows that the method provides a good qualitative approximation to the averaged spectrum for kernel PCA with a Gaussian Radial Basis Function kernel. We also develop an analytical approximation to the spectral density that agrees closely with the numerical solution and provides insight into the number of samples required to resolve the corresponding process eigenvalues of a given order.

Cite

Text

Hoyle and Rattray. "A Statistical Mechanics Analysis of Gram Matrix Eigenvalue Spectra." Annual Conference on Computational Learning Theory, 2004. doi:10.1007/978-3-540-27819-1_40

Markdown

[Hoyle and Rattray. "A Statistical Mechanics Analysis of Gram Matrix Eigenvalue Spectra." Annual Conference on Computational Learning Theory, 2004.](https://mlanthology.org/colt/2004/hoyle2004colt-statistical/) doi:10.1007/978-3-540-27819-1_40

BibTeX

@inproceedings{hoyle2004colt-statistical,
  title     = {{A Statistical Mechanics Analysis of Gram Matrix Eigenvalue Spectra}},
  author    = {Hoyle, David C. and Rattray, Magnus},
  booktitle = {Annual Conference on Computational Learning Theory},
  year      = {2004},
  pages     = {579-593},
  doi       = {10.1007/978-3-540-27819-1_40},
  url       = {https://mlanthology.org/colt/2004/hoyle2004colt-statistical/}
}