Did You Hear That? Introducing AADG: A Framework for Generating Benchmark Data in Audio Anomaly Detection

Abstract

We introduce a novel, general purpose audio generation framework specifically designed for Audio Anomaly Detection and Localization. Unlike existing datasets that predominantly focus on industrial and machine-related sounds, our framework focuses a broader range of environments, particularly useful in real-world scenarios where only audio data are available, such as in video-derived or telephonic audio. To generate such data, we propose a new method, Audio Anomaly Data Generation(AADG), inspired by the LLM-Modulo framework, which leverages Large Language Models(LLMs) as world models to simulate such real-world scenarios. This tool is modular, allowing for a plug-and-play approach. It works by first using LLMs to predict plausible real-world scenarios. An LLM further extracts the constituent sounds, the order and the way in which these should be merged to create coherent wholes. We include a rigorous verification of each output stage, ensuring the reliability of the generated data. The data produced using the framework serves as a benchmark for anomaly detection applications, potentially enhancing the performance of models trained on audio data, particularly in handling out-of-distribution cases. Our contributions thus fill a critical void in audio anomaly detection resources and provide a scalable tool for generating diverse, realistic audio data.

Cite

Text

Raghavan et al. "Did You Hear That? Introducing AADG: A Framework for Generating Benchmark Data in Audio Anomaly Detection." ICLR 2025 Workshops: SynthData, 2025.

Markdown

[Raghavan et al. "Did You Hear That? Introducing AADG: A Framework for Generating Benchmark Data in Audio Anomaly Detection." ICLR 2025 Workshops: SynthData, 2025.](https://mlanthology.org/iclrw/2025/raghavan2025iclrw-you/)

BibTeX

@inproceedings{raghavan2025iclrw-you,
  title     = {{Did You Hear That? Introducing AADG: A Framework for Generating Benchmark Data in Audio Anomaly Detection}},
  author    = {Raghavan, Ksheeraja and Gode, Samiran and Shah, Ankit and Raghavan, Surabhi and Burgard, Wolfram and Raj, Bhiksha and Singh, Rita},
  booktitle = {ICLR 2025 Workshops: SynthData},
  year      = {2025},
  url       = {https://mlanthology.org/iclrw/2025/raghavan2025iclrw-you/}
}