Planar Hidden Markov Modeling: From Speech to Optical Character Recognition
Abstract
We propose in this paper a statistical model (planar hidden Markov model - PHMM) describing statistical properties of images. The model generalizes the single-dimensional HMM, used for speech processing, to the planar case. For this model to be useful an efficient segmentation algorithm, similar to the Viterbi algorithm for HMM, must exist We present conditions in terms of the PHMM parameters that are sufficient to guarantee that the planar segmentation problem can be solved in polynomial time, and describe an algorithm for that. This algorithm aligns optimally the image with the model, and therefore is insensitive to elastic distortions of images. Using this algorithm a joint optima1 segmentation and recognition of the image can be performed, thus overcoming the weakness of traditional OCR systems where segmentation is performed independently before the recognition leading to unrecoverable recognition errors.
Cite
Text
Levin and Pieraccini. "Planar Hidden Markov Modeling: From Speech to Optical Character Recognition." Neural Information Processing Systems, 1992.Markdown
[Levin and Pieraccini. "Planar Hidden Markov Modeling: From Speech to Optical Character Recognition." Neural Information Processing Systems, 1992.](https://mlanthology.org/neurips/1992/levin1992neurips-planar/)BibTeX
@inproceedings{levin1992neurips-planar,
title = {{Planar Hidden Markov Modeling: From Speech to Optical Character Recognition}},
author = {Levin, Esther and Pieraccini, Roberto},
booktitle = {Neural Information Processing Systems},
year = {1992},
pages = {731-738},
url = {https://mlanthology.org/neurips/1992/levin1992neurips-planar/}
}