Clustering with the Fisher Score

Abstract

Recently the Fisher score (or the Fisher kernel) is increasingly used as a feature extractor for classification problems. The Fisher score is a vector of parameter derivatives of loglikelihood of a probabilistic model. This paper gives a theoretical analysis about how class information is pre- served in the space of the Fisher score, which turns out that the Fisher score consists of a few important dimensions with class information and many nuisance dimensions. When we perform clustering with the Fisher score, K-Means type methods are obviously inappropriate because they make use of all dimensions. So we will develop a novel but simple clus- tering algorithm specialized for the Fisher score, which can exploit im- portant dimensions. This algorithm is successfully tested in experiments with artificial data and real data (amino acid sequences).

Cite

Text

Tsuda et al. "Clustering with the Fisher Score." Neural Information Processing Systems, 2002.

Markdown

[Tsuda et al. "Clustering with the Fisher Score." Neural Information Processing Systems, 2002.](https://mlanthology.org/neurips/2002/tsuda2002neurips-clustering/)

BibTeX

@inproceedings{tsuda2002neurips-clustering,
  title     = {{Clustering with the Fisher Score}},
  author    = {Tsuda, Koji and Kawanabe, Motoaki and Müller, Klaus-Robert},
  booktitle = {Neural Information Processing Systems},
  year      = {2002},
  pages     = {745-752},
  url       = {https://mlanthology.org/neurips/2002/tsuda2002neurips-clustering/}
}