Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model

Zhou, Hang; Cai, Jiale; Ye, Yuteng; Feng, Yonghui; Gao, Chenxing; Yu, Junqing; Song, Zikai; Yang, Wei

doi:10.1609/AAAI.V39I10.33169

Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model

Hang Zhou, Jiale Cai, Yuteng Ye, Yonghui Feng, Chenxing Gao, Junqing Yu, Zikai Song, Wei Yang

AAAI 2025 pp. 10761-10769

doi:10.1609/AAAI.V39I10.33169 /aaai/2025/zhou2025aaai-video/

Abstract

A recent endeavor in one class of video anomaly detection is to leverage diffusion models and posit the task as a generation problem, where the diffusion model is trained to recover normal patterns exclusively, thus reporting abnormal patterns as outliers. Yet, existing attempts neglect the various formations of anomaly and predict normal samples at the feature level regardless that abnormal objects in surveillance videos are often relatively small. To address this, a novel patch-based diffusion model is proposed, specifically engineered to capture fine-grained local information. We further observe that anomalies in videos manifest themselves as deviations in both appearance and motion. Therefore, we argue that a comprehensive solution must consider both of these aspects simultaneously to achieve accurate frame prediction. To address this, we introduce innovative motion and appearance conditions that are seamlessly integrated into our patch diffusion model. These conditions are designed to guide the model in generating coherent and contextually appropriate predictions for both semantic content and motion relations. Experimental results on four challenging video anomaly detection datasets empirically substantiate the efficacy of our proposed approach, demonstrating that it consistently outperforms most existing methods in detecting abnormal behaviors.

PDF AAAI Semantic Scholar

Cite

Text

Zhou et al. "Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I10.33169

Markdown

[Zhou et al. "Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhou2025aaai-video/) doi:10.1609/AAAI.V39I10.33169

BibTeX

@inproceedings{zhou2025aaai-video,
  title     = {{Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model}},
  author    = {Zhou, Hang and Cai, Jiale and Ye, Yuteng and Feng, Yonghui and Gao, Chenxing and Yu, Junqing and Song, Zikai and Yang, Wei},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {10761-10769},
  doi       = {10.1609/AAAI.V39I10.33169},
  url       = {https://mlanthology.org/aaai/2025/zhou2025aaai-video/}
}