A Hierarchical Bayesian Markovian Model for Motifs in Biopolymer Sequences

Abstract

We propose a dynamic Bayesian model for motifs in biopolymer se- quences which captures rich biological prior knowledge and positional dependencies in motif structure in a principled way. Our model posits that the position-specific multinomial parameters for monomer distribu- tion are distributed as a latent Dirichlet-mixture random variable, and the position-specific Dirichlet component is determined by a hidden Markov process. Model parameters can be fit on training motifs using a vari- ational EM algorithm within an empirical Bayesian framework. Varia- tional inference is also used for detecting hidden motifs. Our model im- proves over previous models that ignore biological priors and positional dependence. It has much higher sensitivity to motifs during detection and a notable ability to distinguish genuine motifs from false recurring patterns.

Cite

Text

Xing et al. "A Hierarchical Bayesian Markovian Model for Motifs in Biopolymer Sequences." Neural Information Processing Systems, 2002.

Markdown

[Xing et al. "A Hierarchical Bayesian Markovian Model for Motifs in Biopolymer Sequences." Neural Information Processing Systems, 2002.](https://mlanthology.org/neurips/2002/xing2002neurips-hierarchical/)

BibTeX

@inproceedings{xing2002neurips-hierarchical,
  title     = {{A Hierarchical Bayesian Markovian Model for Motifs in Biopolymer Sequences}},
  author    = {Xing, Eric P. and Jordan, Michael I. and Karp, Richard M. and Russell, Stuart},
  booktitle = {Neural Information Processing Systems},
  year      = {2002},
  pages     = {1513-1520},
  url       = {https://mlanthology.org/neurips/2002/xing2002neurips-hierarchical/}
}