Enhanced Denesity Peak Clustering for High-Dimensional Data
Abstract
As a foundational clustering paradigm, Density Peak Clustering (DPC) partitions samples into clusters based on their density peaks, garnering widespread attention. However, traditional DPC methods usually focus on high-density regions, neglecting representative peaks in relatively low-density areas, particularly in datasets with varying densities and multiple peaks. Moreover, existing DPC variants struggle to identify clusters correctly in high-dimensional spaces due to the indistinct distance differences among samples and sparse data distributions. Additionally, existing methods typically adopt a one-step label assignment strategy, making them prone to cascading errors when initial misassignments occur. To address these challenges, we propose an Enhanced Density Peak Clustering (EDPC) method, which creatively incorporates multilayer perceptron (MLP)-based dimensionality reduction and a hierarchical label assignment strategy to significantly improve clustering performance in high-dimensional scenarios. Specifically, we introduce an effective selection condition that combines average densities and density-related distances to generate potential cluster centers, ensuring that peaks across different density regions are considered simultaneously. Furthermore, an MLP, guided by pseudo-labels from sub-clusters, is designed to learn low-dimensional embeddings for high-dimensional data, preserving data locality while enhancing clusterability. Extensive experiments demonstrate the effectiveness and superiority of EDPC against state-of-the-art DPC methods.
Cite
Text
Wang et al. "Enhanced Denesity Peak Clustering for High-Dimensional Data." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I20.35442Markdown
[Wang et al. "Enhanced Denesity Peak Clustering for High-Dimensional Data." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/wang2025aaai-enhanced/) doi:10.1609/AAAI.V39I20.35442BibTeX
@inproceedings{wang2025aaai-enhanced,
title = {{Enhanced Denesity Peak Clustering for High-Dimensional Data}},
author = {Wang, Zhongli and Yang, Jie and Guan, Junyi and Zhang, Chenglong and Liang, Xinyan and Jiang, Bingbing and Sheng, Weiguo},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {21411-21419},
doi = {10.1609/AAAI.V39I20.35442},
url = {https://mlanthology.org/aaai/2025/wang2025aaai-enhanced/}
}