Concept Neurons - Handling Drift Issues for Real-Time Industrial Data Mining
Abstract
Learning from data streams is a challenge faced by data science professionals from multiple industries. Most of them struggle hardly on applying traditional Machine Learning algorithms to solve these problems. It happens so due to their high availability on ready-to-use software libraries on big data technologies (e.g. SparkML). Nevertheless, most of them cannot cope with the key characteristics of this type of data such as high arrival rate and/or non-stationary distributions. In this paper, we introduce a generic and yet simplistic framework to fill this gap denominated Concept Neurons. It leverages on a combination of continuous inspection schemas and residual-based updates over the model parameters and/or the model output. Such framework can empower the resistance of most of induction learning algorithms to concept drifts. Two distinct and hence closely related flavors are introduced to handle different drift types. Experimental results on successful distinct applications on different domains along transportation industry are presented to uncover the hidden potential of this methodology.
Cite
Text
Moreira-Matias et al. "Concept Neurons - Handling Drift Issues for Real-Time Industrial Data Mining." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2016. doi:10.1007/978-3-319-46131-1_18Markdown
[Moreira-Matias et al. "Concept Neurons - Handling Drift Issues for Real-Time Industrial Data Mining." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2016.](https://mlanthology.org/ecmlpkdd/2016/moreiramatias2016ecmlpkdd-concept/) doi:10.1007/978-3-319-46131-1_18BibTeX
@inproceedings{moreiramatias2016ecmlpkdd-concept,
title = {{Concept Neurons - Handling Drift Issues for Real-Time Industrial Data Mining}},
author = {Moreira-Matias, Luís and Gama, João and Mendes-Moreira, João},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2016},
pages = {96-111},
doi = {10.1007/978-3-319-46131-1_18},
url = {https://mlanthology.org/ecmlpkdd/2016/moreiramatias2016ecmlpkdd-concept/}
}