Toward a Better Understanding of Deep Neural Network Based Acoustic Modelling: An Empirical Investigation
Abstract
Recently, deep neural networks (DNNs) have outperformed traditional acoustic models on a variety of speech recognition benchmarks.However, due to system differences across research groups, although a tremendous breadth and depth of related work has been established, it is still not easy to assess the performance improvements of a particular architectural variant from examining the literature when building DNN acoustic models. Our work aims to uncover which variations among baseline systems are most relevant for automatic speech recognition (ASR) performance via a series of systematic tests on the limits of the major architectural choices.By holding all the other components fixed, we are able to explore the design and training decisions without being confounded by the other influencing factors. Our experiment results suggest that a relatively simple DNN architecture and optimization technique produces strong results.These findings, along with previous work, not only help build a better understanding towards why DNN acoustic models perform well or how they might be improved, but also help establish a set of best practices for new speech corpora and language understanding task variants.
Cite
Text
Wang et al. "Toward a Better Understanding of Deep Neural Network Based Acoustic Modelling: An Empirical Investigation." AAAI Conference on Artificial Intelligence, 2016. doi:10.1609/AAAI.V30I1.10256Markdown
[Wang et al. "Toward a Better Understanding of Deep Neural Network Based Acoustic Modelling: An Empirical Investigation." AAAI Conference on Artificial Intelligence, 2016.](https://mlanthology.org/aaai/2016/wang2016aaai-better/) doi:10.1609/AAAI.V30I1.10256BibTeX
@inproceedings{wang2016aaai-better,
title = {{Toward a Better Understanding of Deep Neural Network Based Acoustic Modelling: An Empirical Investigation}},
author = {Wang, Xingfu and Wang, Lin and Chen, Jing and Wu, Litao},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2016},
pages = {2173-2179},
doi = {10.1609/AAAI.V30I1.10256},
url = {https://mlanthology.org/aaai/2016/wang2016aaai-better/}
}