Speech Recognition Using Demi-Syllable Neural Prediction Model
Abstract
The Neural Prediction Model is the speech recognition model based on pattern prediction by multilayer perceptrons. Its effectiveness was con(cid:173) firmed by the speaker-independent digit recognition experiments. This paper presents an improvement in the model and its application to large vocabulary speech recognition, based on subword units. The improvement involves an introduction of "backward prediction," which further improves the prediction accuracy of the original model with only "forward predic(cid:173) tion". In application of the model to speaker-dependent large vocabulary speech recognition, the demi-syllable unit is used as a subword recognition unit. Experimental results indicated a 95.2% recognition accuracy for a 5000 word test set and the effectiveness was confirmed for the proposed model improvement and the demi-syllable subword units.
Cite
Text
Iso and Watanabe. "Speech Recognition Using Demi-Syllable Neural Prediction Model." Neural Information Processing Systems, 1990.Markdown
[Iso and Watanabe. "Speech Recognition Using Demi-Syllable Neural Prediction Model." Neural Information Processing Systems, 1990.](https://mlanthology.org/neurips/1990/iso1990neurips-speech/)BibTeX
@inproceedings{iso1990neurips-speech,
title = {{Speech Recognition Using Demi-Syllable Neural Prediction Model}},
author = {Iso, Ken-ichi and Watanabe, Takao},
booktitle = {Neural Information Processing Systems},
year = {1990},
pages = {227-233},
url = {https://mlanthology.org/neurips/1990/iso1990neurips-speech/}
}