Scene Text Extraction and Translation for Handheld Devices

Abstract

We describe a scene text extraction system for handheld devices to provide enhanced information perception services to the user. It uses a color camera attached to a personal digital assistant as an input device to capture scene images from the real world and it employs image enhancement and segmentation methods to extract written information from the scene, convert them to text information and show them to the user so that he/she can see both the real world and information together. We implemented a prototype application: an automatic sign/text language translation for foreign travelers, where people can use the system whenever they want to see text or signs in their own language where they are originally written in a foreign language in the scene.

Cite

Text

Haritaoglu. "Scene Text Extraction and Translation for Handheld Devices." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2001. doi:10.1109/CVPR.2001.990990

Markdown

[Haritaoglu. "Scene Text Extraction and Translation for Handheld Devices." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2001.](https://mlanthology.org/cvpr/2001/haritaoglu2001cvpr-scene/) doi:10.1109/CVPR.2001.990990

BibTeX

@inproceedings{haritaoglu2001cvpr-scene,
  title     = {{Scene Text Extraction and Translation for Handheld Devices}},
  author    = {Haritaoglu, Ismail},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2001},
  pages     = {II:408-413},
  doi       = {10.1109/CVPR.2001.990990},
  url       = {https://mlanthology.org/cvpr/2001/haritaoglu2001cvpr-scene/}
}