A Recurrent Neural Network for Word Identification from Continuous Phoneme Strings

Abstract

A neural network architecture was designed for locating word boundaries and identifying words from phoneme sequences. This architecture was tested in three sets of studies. First, a highly redundant corpus with a restricted vocabulary was generated and the network was trained with a limited number of phonemic variations for the words in the corpus. Tests of network performance on a transfer set yielded a very low error rate. In a second study, a network was trained to identify words from expert transcriptions of speech. On a transfer test, error rate for correct simultaneous identification of words and word boundaries was 18%. The third study used the output of a phoneme classifier as the input to the word and word boundary identification network. The error rate on a transfer test set was 49% for this task. Overall, these studies provide a first step at identifying words in connected discourse with a neural network.

Cite

Text

Allen and Kamm. "A Recurrent Neural Network for Word Identification from Continuous Phoneme Strings." Neural Information Processing Systems, 1990.

Markdown

[Allen and Kamm. "A Recurrent Neural Network for Word Identification from Continuous Phoneme Strings." Neural Information Processing Systems, 1990.](https://mlanthology.org/neurips/1990/allen1990neurips-recurrent/)

BibTeX

@inproceedings{allen1990neurips-recurrent,
  title     = {{A Recurrent Neural Network for Word Identification from Continuous Phoneme Strings}},
  author    = {Allen, Robert B. and Kamm, Candace A.},
  booktitle = {Neural Information Processing Systems},
  year      = {1990},
  pages     = {206-212},
  url       = {https://mlanthology.org/neurips/1990/allen1990neurips-recurrent/}
}