Text-Guided Fine-Grained Counterfactual Inference for Short Video Fake News Detection
Abstract
Detecting fake news in short videos is crucial for combating misinformation. Existing methods utilize topic modeling and co-attention mechanism, overlooking the modality heterogeneity and resulting in suboptimal performance. To address this issue, we introduce Text-Guided Fine-grained Counterfactual Inference for Short Video Fake News detection (TGFC-SVFN). TGFC-SVFN leverages modality bias removal and teacher-model-enhanced inter-modal knowledge distillation to integrate the heterogeneous modalities in short videos. Specifically, we use causality-based reasoning prompts guided text as teacher model, which then transfers knowledge to the video and audio student models. Subsequently, a multi-head attention mechanism is employed to fuse information from different modalities. In each module, we utilize fine-grained counterfactual inference based on a diffusion model to eliminate modality bias. Experimental results on publicly available fake short video news datasets demonstrate that our method outperforms state-of-the-art techniques.
Cite
Text
Zong et al. "Text-Guided Fine-Grained Counterfactual Inference for Short Video Fake News Detection." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I1.32112Markdown
[Zong et al. "Text-Guided Fine-Grained Counterfactual Inference for Short Video Fake News Detection." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zong2025aaai-text/) doi:10.1609/AAAI.V39I1.32112BibTeX
@inproceedings{zong2025aaai-text,
title = {{Text-Guided Fine-Grained Counterfactual Inference for Short Video Fake News Detection}},
author = {Zong, Linlin and Lin, Wenmin and Zhou, Jiahui and Liu, Xinyue and Zhang, Xianchao and Xu, Bo and Wu, Shimin},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {1237-1245},
doi = {10.1609/AAAI.V39I1.32112},
url = {https://mlanthology.org/aaai/2025/zong2025aaai-text/}
}