Open-Set Text Recognition via Character-Context Decoupling
Abstract
The open-set text recognition task is an emerging challenge that requires an extra capability to cognize novel characters during evaluation. We argue that a major cause of the limited performance for current methods is the confounding effect of contextual information over the visual information of individual characters. Under open-set scenarios, the intractable bias in contextual information can be passed down to visual information, consequently impairing the classification performance. In this paper, a Character-Context Decoupling framework is proposed to alleviate this problem by separating contextual information and character-visual information. Contextual information can be decomposed into temporal information and linguistic information. Here, temporal information that models character order and word length is isolated with a detached temporal attention module. Linguistic information that models n-gram and other linguistic statistics is separated with a decoupled context anchor mechanism. A variety of quantitative and qualitative experiments show that our method achieves promising performance on open-set, zero-shot, and close-set text recognition datasets.
Cite
Text
Liu et al. "Open-Set Text Recognition via Character-Context Decoupling." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00448Markdown
[Liu et al. "Open-Set Text Recognition via Character-Context Decoupling." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/liu2022cvpr-openset/) doi:10.1109/CVPR52688.2022.00448BibTeX
@inproceedings{liu2022cvpr-openset,
title = {{Open-Set Text Recognition via Character-Context Decoupling}},
author = {Liu, Chang and Yang, Chun and Yin, Xu-Cheng},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2022},
pages = {4523-4532},
doi = {10.1109/CVPR52688.2022.00448},
url = {https://mlanthology.org/cvpr/2022/liu2022cvpr-openset/}
}