Improved Smoothing for Probabilistic Suffix Trees Seen as Variable Order Markov Chains

Abstract

In this paper, we compare Probabilistic Suffix Trees (PST), recently proposed, to a specific smoothing of Markov chains and show that they both induce the same model, namely a variable order Markov chain. We show a weakness of PST in terms of smoothing and propose to use an enhanced smoothing. We show that the model based on enhanced smoothing outperform the PST while needing less parameters on a protein domain detection task on public databases.

Cite

Text

Kermorvant and Dupont. "Improved Smoothing for Probabilistic Suffix Trees Seen as Variable Order Markov Chains." European Conference on Machine Learning, 2002. doi:10.1007/3-540-36755-1_16

Markdown

[Kermorvant and Dupont. "Improved Smoothing for Probabilistic Suffix Trees Seen as Variable Order Markov Chains." European Conference on Machine Learning, 2002.](https://mlanthology.org/ecmlpkdd/2002/kermorvant2002ecml-improved/) doi:10.1007/3-540-36755-1_16

BibTeX

@inproceedings{kermorvant2002ecml-improved,
  title     = {{Improved Smoothing for Probabilistic Suffix Trees Seen as Variable Order Markov Chains}},
  author    = {Kermorvant, Christopher and Dupont, Pierre},
  booktitle = {European Conference on Machine Learning},
  year      = {2002},
  pages     = {185-194},
  doi       = {10.1007/3-540-36755-1_16},
  url       = {https://mlanthology.org/ecmlpkdd/2002/kermorvant2002ecml-improved/}
}