Density Level Set Estimation on Manifolds with DBSCAN
Abstract
We show that DBSCAN can estimate the connected components of the $\lambda$-density level set $\{ x : f(x) \ge \lambda\}$ given $n$ i.i.d. samples from an unknown density $f$. We characterize the regularity of the level set boundaries using parameter $\beta > 0$ and analyze the estimation error under the Hausdorff metric. When the data lies in $\mathbb{R}^D$ we obtain a rate of $\widetilde{O}(n^{-1/(2\beta + D)})$, which matches known lower bounds up to logarithmic factors. When the data lies on an embedded unknown $d$-dimensional manifold in $\mathbb{R}^D$, then we obtain a rate of $\widetilde{O}(n^{-1/(2\beta + d\cdot \max\{1, \beta \})})$. Finally, we provide adaptive parameter tuning in order to attain these rates with no a priori knowledge of the intrinsic dimension, density, or $\beta$.
Cite
Text
Jiang. "Density Level Set Estimation on Manifolds with DBSCAN." International Conference on Machine Learning, 2017.Markdown
[Jiang. "Density Level Set Estimation on Manifolds with DBSCAN." International Conference on Machine Learning, 2017.](https://mlanthology.org/icml/2017/jiang2017icml-density/)BibTeX
@inproceedings{jiang2017icml-density,
title = {{Density Level Set Estimation on Manifolds with DBSCAN}},
author = {Jiang, Heinrich},
booktitle = {International Conference on Machine Learning},
year = {2017},
pages = {1684-1693},
volume = {70},
url = {https://mlanthology.org/icml/2017/jiang2017icml-density/}
}