Spectral Clustering of Biological Sequence Data

Abstract

In this paper, we apply spectral techniques to clustering biological sequence data that has proved more difficult to cluster effectively. For this purpose, we have to (1) extend spectral clustering algorithms to deal with asymmetric affinities, like the alignment scores used in the comparison of biological sequences, and (2) devise a hierarchical algorithm that can handle many clusters with imbalanced sizes robustly. We present an algorithm for clustering asymmetric affinity data, and demonstrate the performance of this algorithm at recovering the higher levels of the Structural Classification of Proteins (SCOP) on a data base of highly conserved subsequences.

Cite

Text

Pentney and Meila. "Spectral Clustering of Biological Sequence Data." AAAI Conference on Artificial Intelligence, 2005.

Markdown

[Pentney and Meila. "Spectral Clustering of Biological Sequence Data." AAAI Conference on Artificial Intelligence, 2005.](https://mlanthology.org/aaai/2005/pentney2005aaai-spectral/)

BibTeX

@inproceedings{pentney2005aaai-spectral,
  title     = {{Spectral Clustering of Biological Sequence Data}},
  author    = {Pentney, William and Meila, Marina},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2005},
  pages     = {845-850},
  url       = {https://mlanthology.org/aaai/2005/pentney2005aaai-spectral/}
}