Robust and Accurate Text Stroke Segmentation

Abstract

We propose a new technique for the accurate segmentation of text strokes from an image. The algorithm takes in a cropped image containing a word. It first performs a coarse segmentation using a Fully Convolutional Network (FCN). While not accurate, this initial segmentation can usually identify most of the text stroke content even in difficult situations, with uneven lighting and non-uniform background. The segmentation is then refined using a fully connected Conditional Random Field (CRF) with a novel kernel definition that includes stroke width information. In order to train the network, we created a new synthetic data set with 100K text images. Tested against standard benchmarks with pixellevel annotation (ICDAR 2003, ICDAR 2011, and SVT) our algorithm outperforms the state of the art by a noticeable margin.

Cite

Text

Qin et al. "Robust and Accurate Text Stroke Segmentation." IEEE/CVF Winter Conference on Applications of Computer Vision, 2018. doi:10.1109/WACV.2018.00033

Markdown

[Qin et al. "Robust and Accurate Text Stroke Segmentation." IEEE/CVF Winter Conference on Applications of Computer Vision, 2018.](https://mlanthology.org/wacv/2018/qin2018wacv-robust/) doi:10.1109/WACV.2018.00033

BibTeX

@inproceedings{qin2018wacv-robust,
  title     = {{Robust and Accurate Text Stroke Segmentation}},
  author    = {Qin, Siyang and Ren, Peng and Kim, Seongdo and Manduchi, Roberto},
  booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
  year      = {2018},
  pages     = {242-250},
  doi       = {10.1109/WACV.2018.00033},
  url       = {https://mlanthology.org/wacv/2018/qin2018wacv-robust/}
}