Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model
Abstract
A recent endeavor in one class of video anomaly detection is to leverage diffusion models and posit the task as a generation problem, where the diffusion model is trained to recover normal patterns exclusively, thus reporting abnormal patterns as outliers. Yet, existing attempts neglect the various formations of anomaly and predict normal samples at the feature level regardless that abnormal objects in surveillance videos are often relatively small. To address this, a novel patch-based diffusion model is proposed, specifically engineered to capture fine-grained local information. We further observe that anomalies in videos manifest themselves as deviations in both appearance and motion. Therefore, we argue that a comprehensive solution must consider both of these aspects simultaneously to achieve accurate frame prediction. To address this, we introduce innovative motion and appearance conditions that are seamlessly integrated into our patch diffusion model. These conditions are designed to guide the model in generating coherent and contextually appropriate predictions for both semantic content and motion relations. Experimental results on four challenging video anomaly detection datasets empirically substantiate the efficacy of our proposed approach, demonstrating that it consistently outperforms most existing methods in detecting abnormal behaviors.
Cite
Text
Zhou et al. "Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I10.33169Markdown
[Zhou et al. "Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhou2025aaai-video/) doi:10.1609/AAAI.V39I10.33169BibTeX
@inproceedings{zhou2025aaai-video,
title = {{Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model}},
author = {Zhou, Hang and Cai, Jiale and Ye, Yuteng and Feng, Yonghui and Gao, Chenxing and Yu, Junqing and Song, Zikai and Yang, Wei},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {10761-10769},
doi = {10.1609/AAAI.V39I10.33169},
url = {https://mlanthology.org/aaai/2025/zhou2025aaai-video/}
}