A Segment-Based Automatic Language Identification System

Abstract

We have developed a four-language automatic language identification sys(cid:173) tem for high-quality speech. The system uses a neural network-based segmentation algorithm to segment speech into seven broad phonetic cat(cid:173) egories. Phonetic and prosodic features computed on these categories are then input to a second network that performs the language classification. The system was trained and tested on separate sets of speakers of Ameri(cid:173) can English, Japanese, Mandarin Chinese and Tamil. It currently performs with an accuracy of 89.5% on the utterances of the test set.

Cite

Text

Muthusamy and Cole. "A Segment-Based Automatic Language Identification System." Neural Information Processing Systems, 1991.

Markdown

[Muthusamy and Cole. "A Segment-Based Automatic Language Identification System." Neural Information Processing Systems, 1991.](https://mlanthology.org/neurips/1991/muthusamy1991neurips-segmentbased/)

BibTeX

@inproceedings{muthusamy1991neurips-segmentbased,
  title     = {{A Segment-Based Automatic Language Identification System}},
  author    = {Muthusamy, Yeshwant K. and Cole, Ronald A.},
  booktitle = {Neural Information Processing Systems},
  year      = {1991},
  pages     = {241-248},
  url       = {https://mlanthology.org/neurips/1991/muthusamy1991neurips-segmentbased/}
}