DB-CSC: A Density-Based Approach for Subspace Clustering in Graphs with Feature Vectors

Abstract

Data sources representing attribute information in combination with network information are widely available in today’s applications. To realize the full potential for knowledge extraction, mining techniques like clustering should consider both information types simultaneously. Recent clustering approaches combine subspace clustering with dense subgraph mining to identify groups of objects that are similar in subsets of their attributes as well as densely connected within the network. While those approaches successfully circumvent the problem of full-space clustering, their limited cluster definitions are restricted to clusters of certain shapes. In this work, we introduce a density-based cluster definition taking the attribute similarity in subspaces and the graph density into account. This novel cluster model enables us to detect clusters of arbitrary shape and size. We avoid redundancy in the result by selecting only the most interesting non-redundant clusters. Based on this model, we introduce the clustering algorithm DB-CSC. In thorough experiments we demonstrate the strength of DB-CSC in comparison to related approaches.

Cite

Text

Günnemann et al. "DB-CSC: A Density-Based Approach for Subspace Clustering in Graphs with Feature Vectors." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2011. doi:10.1007/978-3-642-23780-5_46

Markdown

[Günnemann et al. "DB-CSC: A Density-Based Approach for Subspace Clustering in Graphs with Feature Vectors." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2011.](https://mlanthology.org/ecmlpkdd/2011/gunnemann2011ecmlpkdd-dbcsc/) doi:10.1007/978-3-642-23780-5_46

BibTeX

@inproceedings{gunnemann2011ecmlpkdd-dbcsc,
  title     = {{DB-CSC: A Density-Based Approach for Subspace Clustering in Graphs with Feature Vectors}},
  author    = {Günnemann, Stephan and Boden, Brigitte and Seidl, Thomas},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2011},
  pages     = {565-580},
  doi       = {10.1007/978-3-642-23780-5_46},
  url       = {https://mlanthology.org/ecmlpkdd/2011/gunnemann2011ecmlpkdd-dbcsc/}
}