Dynamic Similarity Graph Construction with Kernel Density Estimation
Abstract
In the kernel density estimation (KDE) problem, we are given a set $X$ of data points in $\mathbb{R}^d$, a kernel function $k: \mathbb{R}^d \times \mathbb{R}^d \rightarrow \mathbb{R}$, and a query point $\mathbf{q} \in \mathbb{R}^d$, and the objective is to quickly output an estimate of $\sum_{\mathbf{x} \in X} k(\mathbf{q}, \mathbf{x})$. In this paper, we consider $\textsf{KDE}$ in the dynamic setting, and introduce a data structure that efficiently maintains the estimates for a set of query points as data points are added to $X$ over time. Based on this, we design a dynamic data structure that maintains a sparse approximation of the fully connected similarity graph on $X$, and develop a fast dynamic spectral clustering algorithm. We further evaluate the effectiveness of our algorithms on both synthetic and real-world datasets.
Cite
Text
Laenen et al. "Dynamic Similarity Graph Construction with Kernel Density Estimation." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Laenen et al. "Dynamic Similarity Graph Construction with Kernel Density Estimation." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/laenen2025icml-dynamic/)BibTeX
@inproceedings{laenen2025icml-dynamic,
title = {{Dynamic Similarity Graph Construction with Kernel Density Estimation}},
author = {Laenen, Steinar and Macgregor, Peter and Sun, He},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {32130-32163},
volume = {267},
url = {https://mlanthology.org/icml/2025/laenen2025icml-dynamic/}
}