Unsupervised and Active Learning Using Maximin-Based Anomaly Detection
Abstract
Unsupervised anomaly detection is commonly performed using a distance or density based technique, such as K-Nearest neighbours, Local Outlier Factor or One-class Support Vector Machines. One-class Support Vector Machines reduce the computational cost of testing new data by providing sparse solutions. However, all these techniques have relatively high computational requirements for training. Moreover, identifying anomalies based solely on density or distance is not sufficient when both point (isolated) and cluster anomalies exist in an unlabelled training set. Finally, these unsupervised anomaly detection techniques are not readily adapted for active learning, where the training algorithm should identify examples for which labelling would make a significant impact on the accuracy of the learned model. In this paper, we propose a novel technique called Maximin-based Anomaly Detection that addresses these challenges by selecting a representative subset of data in combination with a kernel-based model construction. We show that the proposed technique (a) provides a statistically significant improvement in the accuracy as well as the computation time required for training and testing compared to several benchmark unsupervised anomaly detection techniques, and (b) effectively uses active learning with a limited budget.
Cite
Text
Ghafoori et al. "Unsupervised and Active Learning Using Maximin-Based Anomaly Detection." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2019. doi:10.1007/978-3-030-46150-8_6Markdown
[Ghafoori et al. "Unsupervised and Active Learning Using Maximin-Based Anomaly Detection." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2019.](https://mlanthology.org/ecmlpkdd/2019/ghafoori2019ecmlpkdd-unsupervised/) doi:10.1007/978-3-030-46150-8_6BibTeX
@inproceedings{ghafoori2019ecmlpkdd-unsupervised,
title = {{Unsupervised and Active Learning Using Maximin-Based Anomaly Detection}},
author = {Ghafoori, Zahra and Bezdek, James C. and Leckie, Christopher and Karunasekera, Shanika},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2019},
pages = {90-106},
doi = {10.1007/978-3-030-46150-8_6},
url = {https://mlanthology.org/ecmlpkdd/2019/ghafoori2019ecmlpkdd-unsupervised/}
}