Supervised Score Aggregation for Active Anomaly Detection

Abstract

Detecting rare anomalies in batches of multidimensional data is challenging. We propose an original supervised active-learning framework that sends a small number of data points from each batch to an expert for labeling as `anomaly' or `nominal' via two mechanisms: (i) points most likely to be anomalies in the eyes of a supervised classifier trained on previously-labeled data; and (ii) points suggested by an active learner. Instead of training the supervised classifier directly on currently-labeled raw data, we treat the scores calculated by an ensemble of $M$ user-defined unsupervised anomaly detectors as if they were the learner's input features. Our approach generalizes earlier attempts to linearly aggregate unsupervised anomaly detector scores, and broadens the scope of these methods from unordered bags of data to ordered data such as time series. Simulated and real data trials suggest that this method usually outperforms---often significantly---linear strategies. The Python library acanag implements our proposed method.

Cite

Text

Bleakley et al. "Supervised Score Aggregation for Active Anomaly Detection." Transactions on Machine Learning Research, 2026.

Markdown

[Bleakley et al. "Supervised Score Aggregation for Active Anomaly Detection." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/bleakley2026tmlr-supervised/)

BibTeX

@article{bleakley2026tmlr-supervised,
  title     = {{Supervised Score Aggregation for Active Anomaly Detection}},
  author    = {Bleakley, Kevin and Royer, Martin and Auder, Benjamin},
  journal   = {Transactions on Machine Learning Research},
  year      = {2026},
  url       = {https://mlanthology.org/tmlr/2026/bleakley2026tmlr-supervised/}
}