Improved Smoothing for Probabilistic Suffix Trees Seen as Variable Order Markov Chains
Abstract
In this paper, we compare Probabilistic Suffix Trees (PST), recently proposed, to a specific smoothing of Markov chains and show that they both induce the same model, namely a variable order Markov chain. We show a weakness of PST in terms of smoothing and propose to use an enhanced smoothing. We show that the model based on enhanced smoothing outperform the PST while needing less parameters on a protein domain detection task on public databases.
Cite
Text
Kermorvant and Dupont. "Improved Smoothing for Probabilistic Suffix Trees Seen as Variable Order Markov Chains." European Conference on Machine Learning, 2002. doi:10.1007/3-540-36755-1_16Markdown
[Kermorvant and Dupont. "Improved Smoothing for Probabilistic Suffix Trees Seen as Variable Order Markov Chains." European Conference on Machine Learning, 2002.](https://mlanthology.org/ecmlpkdd/2002/kermorvant2002ecml-improved/) doi:10.1007/3-540-36755-1_16BibTeX
@inproceedings{kermorvant2002ecml-improved,
title = {{Improved Smoothing for Probabilistic Suffix Trees Seen as Variable Order Markov Chains}},
author = {Kermorvant, Christopher and Dupont, Pierre},
booktitle = {European Conference on Machine Learning},
year = {2002},
pages = {185-194},
doi = {10.1007/3-540-36755-1_16},
url = {https://mlanthology.org/ecmlpkdd/2002/kermorvant2002ecml-improved/}
}