Recognizing Text-Based Traffic Guide Panels with Cascaded Localization Network

Abstract

In this paper, we introduce a new top-down framework for automatic localization and recognition of text-based traffic guide panels ( http://tinyurl.com/wiki-guide-signs ) captured by car-mounted cameras from natural scene images. The proposed framework involves two contributions. First, a novel Cascaded Localization Network (CLN) joining two customized convolutional nets is proposed to detect the guide panels and the scene text on them in a coarse-to-fine manner. In this network, the popular character-wise text saliency detection is replaced with string-wise text region detection, which avoids numerous bottom-up processing steps such as character clustering and segmentation. Text information contained within detected text regions is then interpreted by a deep recurrent model without character segmentation required. Second, a temporal fusion of text region proposals across consecutive frames is introduced to significantly reduce the redundant computation in neighboring frames. A new challenging Traffic Guide Panel dataset is collected to train and evaluate the proposed framework, instead of the unsuited symbol-based traffic sign datasets. Experimental results demonstrate that our proposed framework outperforms multiple recently published text spotting frameworks in real highway scenarios.

Cite

Text

Rong et al. "Recognizing Text-Based Traffic Guide Panels with Cascaded Localization Network." European Conference on Computer Vision Workshops, 2016. doi:10.1007/978-3-319-46604-0_8

Markdown

[Rong et al. "Recognizing Text-Based Traffic Guide Panels with Cascaded Localization Network." European Conference on Computer Vision Workshops, 2016.](https://mlanthology.org/eccvw/2016/rong2016eccvw-recognizing/) doi:10.1007/978-3-319-46604-0_8

BibTeX

@inproceedings{rong2016eccvw-recognizing,
  title     = {{Recognizing Text-Based Traffic Guide Panels with Cascaded Localization Network}},
  author    = {Rong, Xuejian and Yi, Chucai and Tian, Yingli},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2016},
  pages     = {109-121},
  doi       = {10.1007/978-3-319-46604-0_8},
  url       = {https://mlanthology.org/eccvw/2016/rong2016eccvw-recognizing/}
}