Finite Sample Complexity of Rare Pattern Anomaly Detection
Abstract
Anomaly detection is a fundamental problem for which a wide variety of algorithms have been developed. However, compared to supervised learning, there has been very little work aimed at understanding the sample complexity of anomaly detection. In this paper, we take a step in this direction by introducing a Probably Approximately Correct (PAC) framework for anomaly detection based on the identification of rare patterns. In analogy with the PAC framework for supervised learning, we develop sample complexity results that relate the complexity of the pattern space to the data requirements needed for PAC guarantees. We instantiate the general result for a number of pattern spaces, some of which are implicit in current state-of-the-art anomaly detectors. Finally, we design a new simple anomaly detection algorithm motivated by our analysis and show experimentally on several benchmark problems that it is competitive with a state-of-the-art detector using the same pattern space.
Cite
Text
Siddiqui et al. "Finite Sample Complexity of Rare Pattern Anomaly Detection." Conference on Uncertainty in Artificial Intelligence, 2016.Markdown
[Siddiqui et al. "Finite Sample Complexity of Rare Pattern Anomaly Detection." Conference on Uncertainty in Artificial Intelligence, 2016.](https://mlanthology.org/uai/2016/siddiqui2016uai-finite/)BibTeX
@inproceedings{siddiqui2016uai-finite,
title = {{Finite Sample Complexity of Rare Pattern Anomaly Detection}},
author = {Siddiqui, Md Amran and Fern, Alan and Dietterich, Thomas G. and Das, Shubhomoy},
booktitle = {Conference on Uncertainty in Artificial Intelligence},
year = {2016},
url = {https://mlanthology.org/uai/2016/siddiqui2016uai-finite/}
}