A Comparison of Image Processing Techniques for Visual Speech Recognition Applications

Michael S. Gray, Terrence J. Sejnowski, Javier R. Movellan

NeurIPS 2000 pp. 939-945

/neurips/2000/gray2000neurips-comparison/

Abstract

We examine eight different techniques for developing visual rep(cid:173) resentations in machine vision tasks. In particular we compare different versions of principal component and independent com(cid:173) ponent analysis in combination with stepwise regression methods for variable selection. We found that local methods, based on the statistics of image patches, consistently outperformed global meth(cid:173) ods based on the statistics of entire images. This result is consistent with previous work on emotion and facial expression recognition. In addition, the use of a stepwise regression technique for selecting variables and regions of interest substantially boosted performance.

PDF NeurIPS Semantic Scholar

Cite

Text

Gray et al. "A Comparison of Image Processing Techniques for Visual Speech Recognition Applications." Neural Information Processing Systems, 2000.

Markdown

[Gray et al. "A Comparison of Image Processing Techniques for Visual Speech Recognition Applications." Neural Information Processing Systems, 2000.](https://mlanthology.org/neurips/2000/gray2000neurips-comparison/)

BibTeX

@inproceedings{gray2000neurips-comparison,
  title     = {{A Comparison of Image Processing Techniques for Visual Speech Recognition Applications}},
  author    = {Gray, Michael S. and Sejnowski, Terrence J. and Movellan, Javier R.},
  booktitle = {Neural Information Processing Systems},
  year      = {2000},
  pages     = {939-945},
  url       = {https://mlanthology.org/neurips/2000/gray2000neurips-comparison/}
}