Can ChatGPT Detect DeepFakes? a Study of Using Multimodal Large Language Models for Media Forensics

Jia, Shan; Lyu, Reilin; Zhao, Kangran; Chen, Yize; Yan, Zhiyuan; Ju, Yan; Hu, Chuanbo; Li, Xin; Wu, Baoyuan; Lyu, Siwei

doi:10.1109/CVPRW63382.2024.00436

Can ChatGPT Detect DeepFakes? a Study of Using Multimodal Large Language Models for Media Forensics

Shan Jia, Reilin Lyu, Kangran Zhao, Yize Chen, Zhiyuan Yan, Yan Ju, Chuanbo Hu, Xin Li, Baoyuan Wu, Siwei Lyu

CVPRW 2024 pp. 4324-4333

doi:10.1109/CVPRW63382.2024.00436 /cvprw/2024/jia2024cvprw-chatgpt/

Abstract

DeepFakes, which refer to AI-generated media content, have become an increasing concern due to their use as a means for disinformation. Detecting DeepFakes is currently solved with programmed machine learning algorithms. In this work, we investigate the capabilities of multimodal large language models (LLMs) in DeepFake detection. We conducted qualitative and quantitative experiments to demonstrate multimodal LLMs and show that they can expose AI-generated images through careful experimental design and prompt engineering. This is interesting, considering that LLMs are not inherently tailored for media forensic tasks, and the process does not require programming. We discuss the limitations of multimodal LLMs for these tasks and suggest possible improvements.

PDF CVPRW Semantic Scholar

Cite

Text

Jia et al. "Can ChatGPT Detect DeepFakes? a Study of Using Multimodal Large Language Models for Media Forensics." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00436

Markdown

[Jia et al. "Can ChatGPT Detect DeepFakes? a Study of Using Multimodal Large Language Models for Media Forensics." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/jia2024cvprw-chatgpt/) doi:10.1109/CVPRW63382.2024.00436

BibTeX

@inproceedings{jia2024cvprw-chatgpt,
  title     = {{Can ChatGPT Detect DeepFakes? a Study of Using Multimodal Large Language Models for Media Forensics}},
  author    = {Jia, Shan and Lyu, Reilin and Zhao, Kangran and Chen, Yize and Yan, Zhiyuan and Ju, Yan and Hu, Chuanbo and Li, Xin and Wu, Baoyuan and Lyu, Siwei},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2024},
  pages     = {4324-4333},
  doi       = {10.1109/CVPRW63382.2024.00436},
  url       = {https://mlanthology.org/cvprw/2024/jia2024cvprw-chatgpt/}
}