Detecting Tampered Scene Text in the Wild
Abstract
Text manipulation technologies cause serious worries in recent years, however, corresponding tampering detection methods have not been well explored. In this paper, we introduce a new task, named Tampered Scene Text Detection (TSTD), to localize text instances and recognize the texture authenticity in an end-to-end manner. Different from the general scene text detection (STD) task, TSTD further introduces the fine-grained classification, i.e. the tampered and real-world texts share a semantic space (text position and geometric structure) but have different local textures. To this end, we propose a simple yet effective modification strategy to migrate existing STD methods to TSTD task, keeping the semantic invariance while explicitly guiding the class-specific texture feature learning. Furthermore, we discuss the potential of frequency information for distinguishing feature learning, and propose a parallel-branch feature extractor to enhance the feature representation capability. To evaluate the effectiveness of our method, a new TSTD dataset (Tampered-IC13) is proposed and released at https://github.com/wangyuxin87/Tampered-IC13.
Cite
Text
Wang et al. "Detecting Tampered Scene Text in the Wild." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19815-1_13Markdown
[Wang et al. "Detecting Tampered Scene Text in the Wild." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/wang2022eccv-detecting/) doi:10.1007/978-3-031-19815-1_13BibTeX
@inproceedings{wang2022eccv-detecting,
title = {{Detecting Tampered Scene Text in the Wild}},
author = {Wang, Yuxin and Xie, Hongtao and Xing, Mengting and Wang, Jing and Zhu, Shenggao and Zhang, Yongdong},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2022},
doi = {10.1007/978-3-031-19815-1_13},
url = {https://mlanthology.org/eccv/2022/wang2022eccv-detecting/}
}