Enhancing Tampered Text Detection Through Frequency Feature Fusion and Decomposition

Chen, Zhongxi; Chen, Shen; Yao, Taiping; Sun, Ke; Ding, Shouhong; Lin, Xianming; Cao, Liujuan; Ji, Rongrong

doi:10.1007/978-3-031-73414-4_12

Enhancing Tampered Text Detection Through Frequency Feature Fusion and Decomposition

Zhongxi Chen, Shen Chen, Taiping Yao, Ke Sun, Shouhong Ding, Xianming Lin, Liujuan Cao, Rongrong Ji

ECCV 2024

doi:10.1007/978-3-031-73414-4_12 /eccv/2024/chen2024eccv-enhancing-a/

Abstract

Document image tampering poses a grave risk to the veracity of information, with potential consequences ranging from misinformation dissemination to financial and identity fraud. Current detection methods use frequency information to uncover tampering that is invisible to the naked eye. However, these methods often fail to integrate this information effectively, thereby compromising RGB detection capabilities and missing the high-frequency details necessary to detect subtle tampering. To address these gaps, we introduce a Feature Fusion and Decomposition Network (FFDN) that combines a Visual Enhancement Module (VEM) with a Wavelet-like Frequency Enhancement (WFE). Specifically, the VEM makes tampering traces visible while preserving the integrity of original RGB features using zero-initialized convolutions. Meanwhile, the WFE decomposes the features to explicitly retain high-frequency details that are often overlooked during downsampling, focusing on small but critical tampering clues. Rigorous testing on the DocTamper dataset confirms FFDN’s preeminence, significantly outperforming existing state-of-the-art methods in detecting tampering.

PDF ECCV Semantic Scholar

Cite

Text

Chen et al. "Enhancing Tampered Text Detection Through Frequency Feature Fusion and Decomposition." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73414-4_12

Markdown

[Chen et al. "Enhancing Tampered Text Detection Through Frequency Feature Fusion and Decomposition." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/chen2024eccv-enhancing-a/) doi:10.1007/978-3-031-73414-4_12

BibTeX

@inproceedings{chen2024eccv-enhancing-a,
  title     = {{Enhancing Tampered Text Detection Through Frequency Feature Fusion and Decomposition}},
  author    = {Chen, Zhongxi and Chen, Shen and Yao, Taiping and Sun, Ke and Ding, Shouhong and Lin, Xianming and Cao, Liujuan and Ji, Rongrong},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73414-4_12},
  url       = {https://mlanthology.org/eccv/2024/chen2024eccv-enhancing-a/}
}