Learning Taxonomies by Dependence Maximization

Abstract

We introduce a family of unsupervised algorithms, numerical taxonomy clustering, to simultaneously cluster data, and to learn a taxonomy that encodes the relationship between the clusters. The algorithms work by maximizing the dependence between the taxonomy and the original data. The resulting taxonomy is a more informative visualization of complex data than simple clustering; in addition, taking into account the relations between different clusters is shown to substantially improve the quality of the clustering, when compared with state-of-the-art algorithms in the literature (both spectral clustering and a previous dependence maximization approach). We demonstrate our algorithm on image and text data.

Cite

Text

Blaschko and Gretton. "Learning Taxonomies by Dependence Maximization." Neural Information Processing Systems, 2008.

Markdown

[Blaschko and Gretton. "Learning Taxonomies by Dependence Maximization." Neural Information Processing Systems, 2008.](https://mlanthology.org/neurips/2008/blaschko2008neurips-learning/)

BibTeX

@inproceedings{blaschko2008neurips-learning,
  title     = {{Learning Taxonomies by Dependence Maximization}},
  author    = {Blaschko, Matthew and Gretton, Arthur},
  booktitle = {Neural Information Processing Systems},
  year      = {2008},
  pages     = {153-160},
  url       = {https://mlanthology.org/neurips/2008/blaschko2008neurips-learning/}
}