Fast Hoeffding Drift Detection Method for Evolving Data Streams
Abstract
Decision makers increasingly require near-instant models to make sense of fast evolving data streams. Learning from such evolving environments is, however, a challenging task. This challenge is partially due to the fact that the distribution of data often changes over time, thus potentially leading to degradation in the overall performance. In particular, classification algorithms need to adapt their models after facing such distributional changes (also referred to as concept drifts). Usually, drift detection methods are utilized in order to accomplish this task. It follows that detecting concept drifts as soon as possible, while resulting in fewer false positives and false negatives, is a major objective of drift detectors. To this end, we introduce the Fast Hoeffding Drift Detection Method (FHDDM) which detects the drift points using a sliding window and Hoeffding’s inequality. FHDDM detects a drift when a significant difference between the maximum probability of correct predictions and the most recent probability of correct predictions is observed. Experimental results confirm that FHDDM detects drifts with less detection delay, less false positive and less false negative, when compared to the state-of-the-art.
Cite
Text
Pesaranghader and Viktor. "Fast Hoeffding Drift Detection Method for Evolving Data Streams." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2016. doi:10.1007/978-3-319-46227-1_7Markdown
[Pesaranghader and Viktor. "Fast Hoeffding Drift Detection Method for Evolving Data Streams." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2016.](https://mlanthology.org/ecmlpkdd/2016/pesaranghader2016ecmlpkdd-fast/) doi:10.1007/978-3-319-46227-1_7BibTeX
@inproceedings{pesaranghader2016ecmlpkdd-fast,
title = {{Fast Hoeffding Drift Detection Method for Evolving Data Streams}},
author = {Pesaranghader, Ali and Viktor, Herna L.},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2016},
pages = {96-111},
doi = {10.1007/978-3-319-46227-1_7},
url = {https://mlanthology.org/ecmlpkdd/2016/pesaranghader2016ecmlpkdd-fast/}
}