Learning General Gaussian Mixtures with Efficient Score Matching
Abstract
We study the problem of learning mixtures of $k$ Gaussians in $d$ dimensions. We make no separation assumptions on the underlying mixture components: we only require that the covariance matrices have bounded condition number and that the means and covariances lie in a ball of bounded radius. We give an algorithm that draws $d^{\textrm{poly}(k/\epsilon)}$ samples from the target mixture, runs in sample-polynomial time, and constructs a sampler whose output distribution is $\epsilon$-close from the unknown mixture in total variation. Prior works for this problem either (i) required exponential runtime in the dimension $d$, (ii) placed strong assumptions on the instance (e.g., spherical covariances or clusterability), or (iii) had doubly exponential dependence on the number of components $k$. Our approach departs from commonly used techniques for this problem like the method of moments. Instead, we leverage a recently developed reduction, based on diffusion models, from distribution learning to a supervised learning task called score matching. We give an algorithm for the latter by proving a structural result showing that the score function of a Gaussian mixture can be approximated by a piecewise-polynomial function, and there is an efficient algorithm for finding it. To our knowledge, this is the first example of diffusion models achieving a state-of-the-art theoretical guarantee for an unsupervised learning task.
Cite
Text
Chen et al. "Learning General Gaussian Mixtures with Efficient Score Matching." Proceedings of Thirty Eighth Conference on Learning Theory, 2025.Markdown
[Chen et al. "Learning General Gaussian Mixtures with Efficient Score Matching." Proceedings of Thirty Eighth Conference on Learning Theory, 2025.](https://mlanthology.org/colt/2025/chen2025colt-learning/)BibTeX
@inproceedings{chen2025colt-learning,
title = {{Learning General Gaussian Mixtures with Efficient Score Matching}},
author = {Chen, Sitan and Kontonis, Vasilis and Shah, Kulin},
booktitle = {Proceedings of Thirty Eighth Conference on Learning Theory},
year = {2025},
pages = {1029-1090},
volume = {291},
url = {https://mlanthology.org/colt/2025/chen2025colt-learning/}
}