Markov Processes on Curves for Automatic Speech Recognition
Abstract
We investigate a probabilistic framework for automatic speech recognition based on the intrinsic geometric properties of curves. In particular, we analyze the setting in which two variables-one continuous (~), one discrete (s )-evolve jointly in time. We sup(cid:173) pose that the vector ~ traces out a smooth multidimensional curve and that the variable s evolves stochastically as a function of the arc length traversed along this curve. Since arc length does not depend on the rate at which a curve is traversed, this gives rise to a family of Markov processes whose predictions, Pr[sl~]' are invariant to nonlinear warpings of time. We describe the use of such models, known as Markov processes on curves (MPCs), for automatic speech recognition, where ~ are acoustic feature trajec(cid:173) tories and s are phonetic transcriptions. On two tasks-recognizing New Jersey town names and connected alpha-digits- we find that MPCs yield lower word error rates than comparably trained hidden Markov models.
Cite
Text
Saul and Rahim. "Markov Processes on Curves for Automatic Speech Recognition." Neural Information Processing Systems, 1998.Markdown
[Saul and Rahim. "Markov Processes on Curves for Automatic Speech Recognition." Neural Information Processing Systems, 1998.](https://mlanthology.org/neurips/1998/saul1998neurips-markov/)BibTeX
@inproceedings{saul1998neurips-markov,
title = {{Markov Processes on Curves for Automatic Speech Recognition}},
author = {Saul, Lawrence K. and Rahim, Mazin G.},
booktitle = {Neural Information Processing Systems},
year = {1998},
pages = {751-760},
url = {https://mlanthology.org/neurips/1998/saul1998neurips-markov/}
}