Mining Clustering Dimensions

Dasgupta, Sajib; Ng, Vincent

Mining Clustering Dimensions

ICML 2010 pp. 263-270

/icml/2010/dasgupta2010icml-mining/

Abstract

Although it is common practice to produce only a single clustering of a dataset, in many cases text documents can be clustered along different dimensions. Unfortunately, not only do traditional clustering algorithms fail to produce multiple clusterings of a dataset, the only clustering they produce may not be the one that the user desires. To address this major limitation, we propose a novel clustering algorithm for inducing multiple clusterings along the important dimensions of a dataset. Its ability to reveal the important clustering dimensions of a dataset in an un-supervised manner is particularly appealing for those users who have no idea of how a data can possibly be clustered. We demonstrate its viability on several challenging text classification tasks.

PDF Semantic Scholar

Cite

Text

Dasgupta and Ng. "Mining Clustering Dimensions." International Conference on Machine Learning, 2010.

Markdown

[Dasgupta and Ng. "Mining Clustering Dimensions." International Conference on Machine Learning, 2010.](https://mlanthology.org/icml/2010/dasgupta2010icml-mining/)

BibTeX

@inproceedings{dasgupta2010icml-mining,
  title     = {{Mining Clustering Dimensions}},
  author    = {Dasgupta, Sajib and Ng, Vincent},
  booktitle = {International Conference on Machine Learning},
  year      = {2010},
  pages     = {263-270},
  url       = {https://mlanthology.org/icml/2010/dasgupta2010icml-mining/}
}