Unsupervised Anomaly Detection via Masked Diffusion Posterior Sampling
Abstract
Modeling complicated interactions among the ego-vehicle, road agents, and map elements has been a crucial part for safety-critical autonomous driving. Previous work on end-to-end autonomous driving relies on the attention mechanism to handle heterogeneous interactions, which fails to capture geometric priors and is also computationally intensive. In this paper, we propose the Interaction Scene Graph (ISG) as a unified method to model the interactions among the ego-vehicle, road agents, and map elements. With the representation of the ISG, the driving agents aggregate essential information from the most influential elements, including the road agents with potential collisions and the map elements to follow. Since a mass of unnecessary interactions are omitted, the more efficient scene-graph-based framework is able to focus on indispensable connections and leads to better performance. We evaluate the proposed method for end-to-end autonomous driving on the nuScenes dataset. Compared with strong baselines, our method significantly outperforms in full-stack driving tasks.
Cite
Text
Wu et al. "Unsupervised Anomaly Detection via Masked Diffusion Posterior Sampling." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/270Markdown
[Wu et al. "Unsupervised Anomaly Detection via Masked Diffusion Posterior Sampling." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/wu2024ijcai-unsupervised/) doi:10.24963/ijcai.2024/270BibTeX
@inproceedings{wu2024ijcai-unsupervised,
title = {{Unsupervised Anomaly Detection via Masked Diffusion Posterior Sampling}},
author = {Wu, Di and Fan, Shicai and Zhou, Xue and Yu, Li and Deng, Yuzhong and Zou, Jianxiao and Lin, Baihong},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2024},
pages = {2442-2450},
doi = {10.24963/ijcai.2024/270},
url = {https://mlanthology.org/ijcai/2024/wu2024ijcai-unsupervised/}
}