PGNet: Real-Time Arbitrarily-Shaped Text Spotting with Point Gathering Network
Abstract
The reading of arbitrarily-shaped text has received increasing research attention. However, existing text spotters are mostly built on two-stage frameworks or character-based methods, which suffer from either Non-Maximum Suppression (NMS), Region-of-Interest (RoI) operations, or character-level annotations. In this paper, to address the above problems, we propose a novel fully convolutional Point Gathering Network (PGNet) for reading arbitrarily-shaped text in real-time. The PGNet is a single-shot text spotter, where the pixel-level character classification map is learned with proposed PG-CTC loss avoiding the usage of character-level annotations. With a PG-CTC decoder, we gather high-level character classification vectors from two-dimensional space and decode them into text symbols without NMS and RoI operations involved, which guarantees high efficiency. Additionally, reasoning the relations between each character and its neighbors, a graph refinement module (GRM) is proposed to optimize the coarse recognition and improve the end-to-end performance. Experiments prove that the proposed method achieves competitive accuracy, meanwhile significantly improving the running speed. In particular, in Total-Text, it runs at 46.7 FPS, surpassing the previous spotters with a large margin.
Cite
Text
Wang et al. "PGNet: Real-Time Arbitrarily-Shaped Text Spotting with Point Gathering Network." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I4.16383Markdown
[Wang et al. "PGNet: Real-Time Arbitrarily-Shaped Text Spotting with Point Gathering Network." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/wang2021aaai-pgnet/) doi:10.1609/AAAI.V35I4.16383BibTeX
@inproceedings{wang2021aaai-pgnet,
title = {{PGNet: Real-Time Arbitrarily-Shaped Text Spotting with Point Gathering Network}},
author = {Wang, Pengfei and Zhang, Chengquan and Qi, Fei and Liu, Shanshan and Zhang, Xiaoqiang and Lyu, Pengyuan and Han, Junyu and Liu, Jingtuo and Ding, Errui and Shi, Guangming},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2021},
pages = {2782-2790},
doi = {10.1609/AAAI.V35I4.16383},
url = {https://mlanthology.org/aaai/2021/wang2021aaai-pgnet/}
}