SQUAD: Scalar Quantized Representation Learning for Unsupervised Anomaly Detection and Localization

Abstract

The practical application of reconstruction-based models faces persistent challenges in enhancing reconstruction quality and accurately addressing anomalies, often called the “identical shortcut” phenomenon. This limitation has led to a gradual decline in the use of such methods. We introduce Scalar Quantized Unsupervised Anomaly Detection (SQUAD), an advanced framework based on the Vector Quantized Variational Autoencoder (VQ-VAE), to address these issues. SQUAD incorporates a novel quantization module, Finite Scalar Quantization (FSQ), alongside a discriminator, effectively overcoming problems like the identical shortcut phenomenon and codebook collapse. By leveraging advanced feature extraction and image reconstruction techniques, SQUAD achieves superior anomaly localization, consistently surpassing state-of-the-art methods across several benchmark datasets, including MVTecAD, ViSA, MPDD, BTAD, and KSDD2. Furthermore, SQUAD demonstrates superior results in image and pixel-level AUROC evaluations, mainly performing well in Pixel AP and PRO metrics.

Cite

Text

Lin and Lai. "SQUAD: Scalar Quantized Representation Learning for Unsupervised Anomaly Detection and Localization." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-92805-5_11

Markdown

[Lin and Lai. "SQUAD: Scalar Quantized Representation Learning for Unsupervised Anomaly Detection and Localization." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/lin2024eccvw-squad/) doi:10.1007/978-3-031-92805-5_11

BibTeX

@inproceedings{lin2024eccvw-squad,
  title     = {{SQUAD: Scalar Quantized Representation Learning for Unsupervised Anomaly Detection and Localization}},
  author    = {Lin, Shih-Chih and Lai, Shang-Hong},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2024},
  pages     = {166-182},
  doi       = {10.1007/978-3-031-92805-5_11},
  url       = {https://mlanthology.org/eccvw/2024/lin2024eccvw-squad/}
}