Can We Leave Deepfake Data Behind in Training Deepfake Detector?

Cheng, Jikang; Yan, Zhiyuan; Zhang, Ying; Luo, Yuhao; Wang, Zhongyuan; Li, Chen

doi:10.52202/079017-0691

Can We Leave Deepfake Data Behind in Training Deepfake Detector?

Jikang Cheng, Zhiyuan Yan, Ying Zhang, Yuhao Luo, Zhongyuan Wang, Chen Li

NeurIPS 2024

doi:10.52202/079017-0691 /neurips/2024/cheng2024neurips-we/

Abstract

The generalization ability of deepfake detectors is vital for their applications in real-world scenarios. One effective solution to enhance this ability is to train the models with manually-blended data, which we termed ''blendfake'', encouraging models to learn generic forgery artifacts like blending boundary. Interestingly, current SoTA methods utilize blendfake $\textit{without}$ incorporating any deepfake data in their training process. This is likely because previous empirical observations suggest that vanilla hybrid training (VHT), which combines deepfake and blendfake data, results in inferior performance to methods using only blendfake data (so-called "1+1<2"). Therefore, a critical question arises: Can we leave deepfake behind and rely solely on blendfake data to train an effective deepfake detector? Intuitively, as deepfakes also contain additional informative forgery clues ($\textit{e.g.,}$ deep generative artifacts), excluding all deepfake data in training deepfake detectors seems counter-intuitive. In this paper, we rethink the role of blendfake in detecting deepfakes and formulate the process from "real to blendfake to deepfake" to be a $\textit{progressive transition}$. Specifically, blendfake and deepfake can be explicitly delineated as the oriented pivot anchors between "real-to-fake" transitions. The accumulation of forgery information should be oriented and progressively increasing during this transition process. To this end, we propose an $\underline{O}$riented $\underline{P}$rogressive $\underline{R}$egularizor (OPR) to establish the constraints that compel the distribution of anchors to be discretely arranged. Furthermore, we introduce feature bridging to facilitate the smooth transition between adjacent anchors. Extensive experiments confirm that our design allows leveraging forgery information from both blendfake and deepfake effectively and comprehensively. Code is available at https://github.com/beautyremain/ProDet.

PDF NeurIPS OpenReview Semantic Scholar

Cite

Text

Cheng et al. "Can We Leave Deepfake Data Behind in Training Deepfake Detector?." Neural Information Processing Systems, 2024. doi:10.52202/079017-0691

Markdown

[Cheng et al. "Can We Leave Deepfake Data Behind in Training Deepfake Detector?." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/cheng2024neurips-we/) doi:10.52202/079017-0691

BibTeX

@inproceedings{cheng2024neurips-we,
  title     = {{Can We Leave Deepfake Data Behind in Training Deepfake Detector?}},
  author    = {Cheng, Jikang and Yan, Zhiyuan and Zhang, Ying and Luo, Yuhao and Wang, Zhongyuan and Li, Chen},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-0691},
  url       = {https://mlanthology.org/neurips/2024/cheng2024neurips-we/}
}