Learning to Read Irregular Text with Attention Mechanisms

Yang, Xiao; He, Dafang; Zhou, Zihan; Kifer, Daniel; Giles, C. Lee

doi:10.24963/IJCAI.2017/458

Learning to Read Irregular Text with Attention Mechanisms

Xiao Yang, Dafang He, Zihan Zhou, Daniel Kifer, C. Lee Giles

IJCAI 2017 pp. 3280-3286

doi:10.24963/IJCAI.2017/458 /ijcai/2017/yang2017ijcai-learning/

Abstract

We present a robust end-to-end neural-based model to attentively recognize text in natural images. Particularly, we focus on accurately identifying irregular (perspectively distorted or curved) text, which has not been well addressed in the previous literature. Previous research on text reading often works with regular (horizontal and frontal) text and does not adequately generalize to processing text with perspective distortion or curving effects. Our work proposes to overcome this difficulty by introducing two learning components: (1) an auxiliary dense character detection task that helps to learn text specific visual patterns, (2) an alignment loss that provides guidance to the training of an attention model. We show with experiments that these two components are crucial for achieving fast convergence and high classification accuracy for irregular text recognition. Our model outperforms previous work on two irregular-text datasets: SVT-Perspective and CUTE80, and is also highly-competitive on several regular-text datasets containing primarily horizontal and frontal text.

PDF IJCAI Semantic Scholar

Cite

Text

Yang et al. "Learning to Read Irregular Text with Attention Mechanisms." International Joint Conference on Artificial Intelligence, 2017. doi:10.24963/IJCAI.2017/458

Markdown

[Yang et al. "Learning to Read Irregular Text with Attention Mechanisms." International Joint Conference on Artificial Intelligence, 2017.](https://mlanthology.org/ijcai/2017/yang2017ijcai-learning/) doi:10.24963/IJCAI.2017/458

BibTeX

@inproceedings{yang2017ijcai-learning,
  title     = {{Learning to Read Irregular Text with Attention Mechanisms}},
  author    = {Yang, Xiao and He, Dafang and Zhou, Zihan and Kifer, Daniel and Giles, C. Lee},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2017},
  pages     = {3280-3286},
  doi       = {10.24963/IJCAI.2017/458},
  url       = {https://mlanthology.org/ijcai/2017/yang2017ijcai-learning/}
}