IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning

Abstract

Using extensive training data from SA-1B, the Segment Anything Model (SAM) has demonstrated exceptional generalization and zero-shot capabilities, attracting widespread attention in areas such as medical image segmentation and remote sensing image segmentation. However, its performance in the field of image manipulation detection remains largely unexplored and unconfirmed. There are two main challenges in applying SAM to image manipulation detection: a) reliance on manual prompts, and b) the difficulty of single-view information in supporting cross-dataset generalization. To address these challenges, we develops a cross-view prompt learning paradigm called IMDPrompter based on SAM. Benefiting from the design of automated prompts, IMDPrompter no longer relies on manual guidance, enabling automated detection and localization. Additionally, we propose components such as Cross-view Feature Perception, Optimal Prompt Selection, and Cross-View Prompt Consistency, which facilitate cross-view perceptual learning and guide SAM to generate accurate masks. Extensive experimental results from five datasets (CASIA, Columbia, Coverage, IMD2020, and NIST16) validate the effectiveness of our proposed method.

Cite

Text

Zhang et al. "IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning." International Conference on Learning Representations, 2025.

Markdown

[Zhang et al. "IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/zhang2025iclr-imdprompter/)

BibTeX

@inproceedings{zhang2025iclr-imdprompter,
  title     = {{IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning}},
  author    = {Zhang, Quan and Qi, Yuxin and Tang, Xi and Fang, Jinwei and Lin, Xi and Zhang, Ke and Yuan, Chun},
  booktitle = {International Conference on Learning Representations},
  year      = {2025},
  url       = {https://mlanthology.org/iclr/2025/zhang2025iclr-imdprompter/}
}