Dynamic Similarity Graph Construction with Kernel Density Estimation

Abstract

In the kernel density estimation (KDE) problem, we are given a set $X$ of data points in $\mathbb{R}^d$, a kernel function $k: \mathbb{R}^d \times \mathbb{R}^d \rightarrow \mathbb{R}$, and a query point $\mathbf{q} \in \mathbb{R}^d$, and the objective is to quickly output an estimate of $\sum_{\mathbf{x} \in X} k(\mathbf{q}, \mathbf{x})$. In this paper, we consider $\textsf{KDE}$ in the dynamic setting, and introduce a data structure that efficiently maintains the estimates for a set of query points as data points are added to $X$ over time. Based on this, we design a dynamic data structure that maintains a sparse approximation of the fully connected similarity graph on $X$, and develop a fast dynamic spectral clustering algorithm. We further evaluate the effectiveness of our algorithms on both synthetic and real-world datasets.

Cite

Text

Laenen et al. "Dynamic Similarity Graph Construction with Kernel Density Estimation." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Laenen et al. "Dynamic Similarity Graph Construction with Kernel Density Estimation." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/laenen2025icml-dynamic/)

BibTeX

@inproceedings{laenen2025icml-dynamic,
  title     = {{Dynamic Similarity Graph Construction with Kernel Density Estimation}},
  author    = {Laenen, Steinar and Macgregor, Peter and Sun, He},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {32130-32163},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/laenen2025icml-dynamic/}
}