Self-Supervised Attribute-Aware Refinement Network for Low-Quality Text Recognition

Abstract

Scene texts collected from unconstrained environments encompass various types of degradation, including low-resolution, cluttered backgrounds, and irregular shapes. Training a model for text recognition with such types of degradations is notoriously hard. In this work, we analyze this problem in terms of two attributes: semantic and a geometric attribute, which are crucial cues for describing low-quality text. To handle this issue, we propose a new Self-supervised Attribute-Aware Refinement Network (SAAR-Net) that addresses these attributes simultaneously. Specifically, a novel text refining mechanism is combined with self-supervised learning for multiple auxiliary tasks to solve this problem. In addition, it can extract semantic and geometric attributes important to text recognition by introducing mutual information constraint that explicitly preserves invariant and discriminative information across different tasks. Such learned representation encourages our method to evidently generate a clear image, thus leading to better recognition performance. Extensive results demonstrate the effectiveness in refinement and recognition simultaneously.

Cite

Text

Lee et al. "Self-Supervised Attribute-Aware Refinement Network for Low-Quality Text Recognition." European Conference on Computer Vision Workshops, 2020. doi:10.1007/978-3-030-68238-5_17

Markdown

[Lee et al. "Self-Supervised Attribute-Aware Refinement Network for Low-Quality Text Recognition." European Conference on Computer Vision Workshops, 2020.](https://mlanthology.org/eccvw/2020/lee2020eccvw-selfsupervised/) doi:10.1007/978-3-030-68238-5_17

BibTeX

@inproceedings{lee2020eccvw-selfsupervised,
  title     = {{Self-Supervised Attribute-Aware Refinement Network for Low-Quality Text Recognition}},
  author    = {Lee, Younkwan and Yoo, Heongjun and Kim, Yechan and Jeong, Jihun and Jeon, Moongu},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2020},
  pages     = {228-244},
  doi       = {10.1007/978-3-030-68238-5_17},
  url       = {https://mlanthology.org/eccvw/2020/lee2020eccvw-selfsupervised/}
}