A Comparison of Image Processing Techniques for Visual Speech Recognition Applications
Abstract
We examine eight different techniques for developing visual rep(cid:173) resentations in machine vision tasks. In particular we compare different versions of principal component and independent com(cid:173) ponent analysis in combination with stepwise regression methods for variable selection. We found that local methods, based on the statistics of image patches, consistently outperformed global meth(cid:173) ods based on the statistics of entire images. This result is consistent with previous work on emotion and facial expression recognition. In addition, the use of a stepwise regression technique for selecting variables and regions of interest substantially boosted performance.
Cite
Text
Gray et al. "A Comparison of Image Processing Techniques for Visual Speech Recognition Applications." Neural Information Processing Systems, 2000.Markdown
[Gray et al. "A Comparison of Image Processing Techniques for Visual Speech Recognition Applications." Neural Information Processing Systems, 2000.](https://mlanthology.org/neurips/2000/gray2000neurips-comparison/)BibTeX
@inproceedings{gray2000neurips-comparison,
title = {{A Comparison of Image Processing Techniques for Visual Speech Recognition Applications}},
author = {Gray, Michael S. and Sejnowski, Terrence J. and Movellan, Javier R.},
booktitle = {Neural Information Processing Systems},
year = {2000},
pages = {939-945},
url = {https://mlanthology.org/neurips/2000/gray2000neurips-comparison/}
}