Toward Learning Gaussian Mixtures with Arbitrary Separation
Abstract
In recent years analysis of complexity of learning Gaussian mixture models from sampled data has received significant attention in computational machine learning and theory communities. In this paper we present the first result showing that polynomial time learning of multidimensional Gaussian Mixture distributions is possible when the separation between the component means is arbitrarily small. Specifically, we present an algorithm for learning the parameters of a mixture of k identical spherical Gaussians in n-dimensional space with an arbitrarily small separation between the components, which is polynomial in dimension, inverse component separation and other input parameters for a fixed number of components k. The algorithm uses a projection to k dimensions and then a reduction to the 1-dimensional case. It relies on a theoretical analysis showing that two 1-dimensional mixtures whose densities are close in the L2 norm must have similar means and mixing coefficients. To produce the necessary lower bound for the L2 norm in terms of the distances between the corresponding means, we analyze the behavior of the Fourier transform of a mixture of Gaussians in one dimension around the origin, which turns out to be closely related to the properties of the Vandermonde matrix obtained from the component means. Analysis of minors of the Vandermonde matrix together with basic function approximation results allows us to provide a lower bound for the norm of the mixture in the Fourier domain and hence a bound in the original space. Additionally, we present a separate argument for reconstructing variance.
Cite
Text
Belkin and Sinha. "Toward Learning Gaussian Mixtures with Arbitrary Separation." Annual Conference on Computational Learning Theory, 2010.Markdown
[Belkin and Sinha. "Toward Learning Gaussian Mixtures with Arbitrary Separation." Annual Conference on Computational Learning Theory, 2010.](https://mlanthology.org/colt/2010/belkin2010colt-learning/)BibTeX
@inproceedings{belkin2010colt-learning,
title = {{Toward Learning Gaussian Mixtures with Arbitrary Separation}},
author = {Belkin, Mikhail and Sinha, Kaushik},
booktitle = {Annual Conference on Computational Learning Theory},
year = {2010},
pages = {407-419},
url = {https://mlanthology.org/colt/2010/belkin2010colt-learning/}
}