TextBoxes: A Fast Text Detector with a Single Deep Neural Network

Abstract

This paper presents an end-to-end trainable fast scene text detector, named TextBoxes, which detects scene text with both high accuracy and efficiency in a single network forward pass, involving no post-process except for a standard non-maximum suppression. TextBoxes outperforms competing methods in terms of text localization accuracy and is much faster, taking only 0.09s per image in a fast implementation. Furthermore, combined with a text recognizer, TextBoxes significantly outperforms state-of-the-art approaches on word spotting and end-to-end text recognition tasks.

Cite

Text

Liao et al. "TextBoxes: A Fast Text Detector with a Single Deep Neural Network." AAAI Conference on Artificial Intelligence, 2017. doi:10.1609/AAAI.V31I1.11196

Markdown

[Liao et al. "TextBoxes: A Fast Text Detector with a Single Deep Neural Network." AAAI Conference on Artificial Intelligence, 2017.](https://mlanthology.org/aaai/2017/liao2017aaai-textboxes/) doi:10.1609/AAAI.V31I1.11196

BibTeX

@inproceedings{liao2017aaai-textboxes,
  title     = {{TextBoxes: A Fast Text Detector with a Single Deep Neural Network}},
  author    = {Liao, Minghui and Shi, Baoguang and Bai, Xiang and Wang, Xinggang and Liu, Wenyu},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2017},
  pages     = {4161-4167},
  doi       = {10.1609/AAAI.V31I1.11196},
  url       = {https://mlanthology.org/aaai/2017/liao2017aaai-textboxes/}
}