Near-Optimal Quantum Coreset Construction Algorithms for Clustering

Abstract

$k$-Clustering in $\mathbb{R}^d$ (e.g., $k$-median and $k$-means) is a fundamental machine learning problem. While near-linear time approximation algorithms were known in the classical setting for a dataset with cardinality $n$, it remains open to find sublinear-time quantum algorithms. We give quantum algorithms that find coresets for $k$-clustering in $\mathbb{R}^d$ with $\tilde{O}(\sqrt{nk}d^{3/2})$ query complexity. Our coreset reduces the input size from $n$ to $\mathrm{poly}(k\epsilon^{-1}d)$, so that existing $\alpha$-approximation algorithms for clustering can run on top of it and yield $(1 + \epsilon)\alpha$-approximation. This eventually yields a quadratic speedup for various $k$-clustering approximation algorithms. We complement our algorithm with a nearly matching lower bound, that any quantum algorithm must make $\Omega(\sqrt{nk})$ queries in order to achieve even $O(1)$-approximation for $k$-clustering.

Cite

Text

Xue et al. "Near-Optimal Quantum Coreset Construction Algorithms for Clustering." International Conference on Machine Learning, 2023.

Markdown

[Xue et al. "Near-Optimal Quantum Coreset Construction Algorithms for Clustering." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/xue2023icml-nearoptimal/)

BibTeX

@inproceedings{xue2023icml-nearoptimal,
  title     = {{Near-Optimal Quantum Coreset Construction Algorithms for Clustering}},
  author    = {Xue, Yecheng and Chen, Xiaoyu and Li, Tongyang and Jiang, Shaofeng H.-C.},
  booktitle = {International Conference on Machine Learning},
  year      = {2023},
  pages     = {38881-38912},
  volume    = {202},
  url       = {https://mlanthology.org/icml/2023/xue2023icml-nearoptimal/}
}