Detecting Text in Natural Scenes with Stroke Width Transform

Abstract

We present a novel image operator that seeks to find the value of stroke width for each image pixel, and demonstrate its use on the task of text detection in natural images. The suggested operator is local and data dependent, which makes it fast and robust enough to eliminate the need for multi-scale computation or scanning windows. Extensive testing shows that the suggested scheme outperforms the latest published algorithms. Its simplicity allows the algorithm to detect texts in many fonts and languages.

Cite

Text

Epshtein et al. "Detecting Text in Natural Scenes with Stroke Width Transform." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010. doi:10.1109/CVPR.2010.5540041

Markdown

[Epshtein et al. "Detecting Text in Natural Scenes with Stroke Width Transform." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010.](https://mlanthology.org/cvpr/2010/epshtein2010cvpr-detecting/) doi:10.1109/CVPR.2010.5540041

BibTeX

@inproceedings{epshtein2010cvpr-detecting,
  title     = {{Detecting Text in Natural Scenes with Stroke Width Transform}},
  author    = {Epshtein, Boris and Ofek, Eyal and Wexler, Yonatan},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2010},
  pages     = {2963-2970},
  doi       = {10.1109/CVPR.2010.5540041},
  url       = {https://mlanthology.org/cvpr/2010/epshtein2010cvpr-detecting/}
}