Recognizing Text with Perspective Distortion in Natural Scenes
Abstract
This paper presents an approach to text recognition in natural scene images. Unlike most existing works which assume that texts are horizontal and frontal parallel to the image plane, our method is able to recognize perspective texts of arbitrary orientations. For individual character recognition, we adopt a bag-of-keypoints approach, in which Scale Invariant Feature Transform (SIFT) descriptors are extracted densely and quantized using a pre-trained vocabulary. Following [1, 2], the context information is utilized through lexicons. We formulate word recognition as finding the optimal alignment between the set of characters and the list of lexicon words. Furthermore, we introduce a new dataset called StreetViewText-Perspective, which contains texts in street images with a great variety of viewpoints. Experimental results on public datasets and the proposed dataset show that our method significantly outperforms the state-of-the-art on perspective texts of arbitrary orientations.
Cite
Text
Phan et al. "Recognizing Text with Perspective Distortion in Natural Scenes." International Conference on Computer Vision, 2013. doi:10.1109/ICCV.2013.76Markdown
[Phan et al. "Recognizing Text with Perspective Distortion in Natural Scenes." International Conference on Computer Vision, 2013.](https://mlanthology.org/iccv/2013/phan2013iccv-recognizing/) doi:10.1109/ICCV.2013.76BibTeX
@inproceedings{phan2013iccv-recognizing,
title = {{Recognizing Text with Perspective Distortion in Natural Scenes}},
author = {Phan, Trung Quy and Shivakumara, Palaiahnakote and Tian, Shangxuan and Tan, Chew Lim},
booktitle = {International Conference on Computer Vision},
year = {2013},
doi = {10.1109/ICCV.2013.76},
url = {https://mlanthology.org/iccv/2013/phan2013iccv-recognizing/}
}