Learning to Draw Text in Natural Images with Conditional Adversarial Networks

Abstract

In this work, we propose an entirely learning-based method to automatically synthesize text sequence in natural images leveraging conditional adversarial networks. As vanilla GANs are clumsy to capture structural text patterns, directly employing GANs for text image synthesis typically results in illegible images. Therefore, we design a two-stage architecture to generate repeated characters in images. Firstly, a character generator attempts to synthesize local character appearance independently, so that the legible characters in sequence can be obtained. To achieve style consistency of characters, we propose a novel style loss based on variance-minimization. Secondly, we design a pixel-manipulation word generator constrained by self-regularization, which learns to convert local characters to plausible word image. Experiments on SVHN dataset and ICDAR, IIIT5K datasets demonstrate our method is able to synthesize visually appealing text images. Besides, we also show the high-quality images synthesized by our method can be used to boost the performance of a scene text recognition algorithm.

Cite

Text

Fang et al. "Learning to Draw Text in Natural Images with Conditional Adversarial Networks." International Joint Conference on Artificial Intelligence, 2019. doi:10.24963/IJCAI.2019/101

Markdown

[Fang et al. "Learning to Draw Text in Natural Images with Conditional Adversarial Networks." International Joint Conference on Artificial Intelligence, 2019.](https://mlanthology.org/ijcai/2019/fang2019ijcai-learning/) doi:10.24963/IJCAI.2019/101

BibTeX

@inproceedings{fang2019ijcai-learning,
  title     = {{Learning to Draw Text in Natural Images with Conditional Adversarial Networks}},
  author    = {Fang, Shancheng and Xie, Hongtao and Chen, Jianjun and Tan, Jianlong and Zhang, Yongdong},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2019},
  pages     = {715-722},
  doi       = {10.24963/IJCAI.2019/101},
  url       = {https://mlanthology.org/ijcai/2019/fang2019ijcai-learning/}
}