RDDM: Reactive drift detection method

doi:10.1016/j.eswa.2017.08.023

Expert Systems with Applications

Volume 90, 30 December 2017, Pages 344-355

https://doi.org/10.1016/j.eswa.2017.08.023 Get rights and content

Highlights

•
RDDM: a new concept drift detection method inspired on DDM.
•
Tackles the lack of sensitivity problem of DDM when concepts are very large.
•
Tested against DDM, ECDD and STEPD using Naive Bayes as base learner.
•
RDDM was significantly superior to the other three methods in accuracy.
•
RDDM presented the best balance of false negative and false positive detections.

Abstract

Concept drift detectors are online learning software that mostly attempt to estimate the drift positions in data streams in order to modify the base classifier after these changes and improve accuracy. This is very important in applications such as the detection of anomalies in TCP/IP traffic and/or frauds in financial transactions. Drift Detection Method (DDM) is a simple, efficient, well-known method whose performance is often impaired when the concepts are very long. This article proposes the Reactive Drift Detection Method (RDDM), which is based on DDM and, among other modifications, discards older instances of very long concepts aiming to detect drifts earlier, improving the final accuracy. Experiments run in MOA, using abrupt and gradual concept drift versions of different dataset generators and sizes (48 artificial datasets in total), as well as three real-world datasets, suggest RDDM beats the accuracy results of DDM, ECDD, and STEPD in most scenarios.

Introduction

Data stream environments frequently contain very large amounts of data, which may be infinite, flowing rapidly and continuously. Thus, methods that learn from data streams normally are under restrictions regarding the usage of memory and run-time. Also, reading the same instance of data more than once is usually not possible. In addition, this scenario, often referred to as online learning, considers the possibility of concept drift (Gama, Žliobaitė, Bifet, Pechenizkiy, & Bouchachia, 2014), a situation commonly characterized by changes in the target distribution of the data over time.

The most common classification of concept drift is based on the speed of the changes. When the changes from one concept to another are sudden and/or rapid, they are called abrupt and, when the transitions between concepts occur over a number of instances, they are called gradual.

There are many examples of online learning applications, including the detection of climate change or spam in e-mail messages, as well as monitoring movement data from sensors or changes in water temperature, among others.

Drift Detection Method (DDM) (Gama, Medas, Castillo, & Rodrigues, 2004) is probably the best known, most used, and cited drift detector, especially because it presents a good all-round performance (Gonçalves, Santos, Barros, & Vieira, 2014), despite being reasonably simple.

One of the well-known problems with DDM is that its performance usually worsens when the concepts are very large (Salperwyck, Boullé , & Lemaire, 2015), because it tends to become less sensitive to concept drifts, taking too many instances to detect the changes.

This work proposes the Reactive Drift Detection Method (RDDM), which is based on DDM and, among other heuristic modifications, adds an explicit mechanism to discard older instances of very long concepts to overcome or at least alleviate the performance loss problem of DDM. We claim RDDM is better than DDM as it should deliver higher or equal global accuracy in most situations by detecting most drifts earlier than DDM would.

In addition, using the Massive Online Analysis (MOA) framework (Bifet, Holmes, Kirkby, & Pfahringer, 2010a), we tested DDM, RDDM, and other detectors in a considerably large number of scenarios, with both artificial and real-world datasets, and statistically evaluated the results.

The rest of this article is organized as follows: Section 2 briefly surveys related work, with special attention given to DDM; Section 3 describes RDDM and presents its abstract pseudo-code; Section 4 details the experiments configuration, also including brief descriptions of the datasets used in the tests; Section 5 discusses the results obtained, analyses the drift identifications, and performs statistical evaluations of accuracy and of memory and run-time consumption; and, finally, Section 6 introduces our conclusions and proposes future work.

Section snippets

Related work: drift detection

Different approaches have been proposed to learn from data streams containing concept drift. One of the simplest is based on concept drift detection methods (Gonçalves et al., 2014), lightweight software that usually work together with a separate base classifier.

Other proposals adopt more sophisticated strategies, sometimes using ensembles with a base learner and computing different weighting functions to perform the classification, e.g. Dynamic Weighted Majority (DWM) (Kolter & Maloof, 2007),

Reactive drift detection method

This section provides a detailed description of RDDM, our original proposal to overcome deficiencies and thus improve the detections and accuracy results of DDM. This includes our motivation and heuristic assumptions, as well as all important details of the corresponding implementation in MOA.

As already mentioned, the main idea behind RDDM is to periodically shorten the number of instances of very long stable concepts to tackle a known performance loss problem of DDM. It is assumed that such a

Experimental setting

This section describes all the relevant information on the experiments designed to test and evaluate RDDM against DDM and other drift detectors.

To allow for a fair comparison, all the drift detection methods used NB as base learner, because it is a simple, fast, efficient, and freely available method, which is often used in experiments in the data stream area. Also, the first three parameters of RDDM were exceptionally set with the same values used by DDM, i.e. $n = 30,$ $α_{w} = 2.0,$ $α_{d} = 3.0$ .

The accuracy

Experimental results and analysis

This section presents the results of the experiments and includes analyses of accuracy, concept drift identifications, as well as memory and run-time usage over the 51 tested datasets.

Conclusion

This article proposed RDDM, a new method for concept drift detection in data streams, rooted in DDM, and motivated by a drop in performance, caused by sensitivity loss, which usually affects DDM when the concepts become very long.

Specifically, RDDM implements a softer type of concept drift that does not affect the base learner and is performed after long periods within a stable concept state, periodically recalculating the DDM statistics responsible for detecting the warning and drift levels,

Acknowledgments

Silas Santos is supported by a postgraduate grant from CNPq. The authors also thank Bruno Maciel for his MOA script generator and results extraction tool, which is still under development but greatly helped speed up the generation of scripts and the analysis of the results of the experiments.

References (35)

GonçalvesP.M. et al.
A comparative study on concept drift detectors
Expert Systems with Applications
(2014)
S. Sakthithasan et al.
One pass concept change detection for data streams
Advances in knowledge discovery and data mining
(2013)
R. Agrawal et al.
Database mining: A performance perspective
IEEE Transactions on Knowledge and Data Engineering
(1993)
S.H. Bach et al.
Paired learners for concept drift
Proceedings of the 8th IEEE international conference on data mining (ICDM’08) Pisa, Italy
(2008)
M. Baena-Garcia et al.
Early drift detection method
Proceedings of the fourth international workshop on knowledge discovery from data streams
(2006)
R.S.M. Barros
Advances in data stream mining with concept drift
(2017)
R.S.M. Barros et al.
A boosting-like online learning ensemble
Proceedings of the international joint conference on neural networks (IJCNN)
(2016)
A. Bifet et al.
Learning from time-changing data with adaptive windowing
Proceedings of the 7th SIAM international conference on data mining (SDM’07),Minneapolis, MN, USA
(2007)
A. Bifet et al.
MOA: Massive online analysis
Journal of Machine Learning Research
(2010)
A. Bifet et al.
Leveraging bagging for evolving data streams
Machine learning and knowledge discovery in databases
(2010)

A. Bifet et al.

New ensemble methods for evolving data streams

Proceedings of the 15th ACM international conference on knowledge discovery and data mining (KDD’09), Paris, France

(2009)

A. Bifet et al.

Pitfalls in benchmarking data stream classification and how to avoid them

Machine learning and knowledge discovery in databases

(2013)

T.H. Cormen et al.

Introduction to algorithms

(2009)

A.P. Dawid

Present position and potential developments: Some personal views: Statistical theory: The prequential approach

Journal of the Royal Statistical Society. Series A (General)

(1984)

J. Demšar

Statistical comparisons of classifiers over multiple data sets

Journal of Machine Learning Research

(2006)

L. Du et al.

A selective detector ensemble for concept drift detection

The Computer Journal

(2014)

I. Frías-Blanco et al.

Online and non-parametric drift detection methods based on hoeffding’s bounds

IEEE Transactions on Knowledge and Data Engineering

(2015)

Cited by (145)

Adaptive tree-like neural network: Overcoming catastrophic forgetting to classify streaming data with concept drifts
2024, Knowledge-Based Systems
With the development of deep neural networks (DNNs), classifying streaming data with concept drifts based on DNNs is becoming more and more effective. However, the continuous and infinite characteristic of streaming data makes it difficult to set an appropriate depth for a DNN-based model in advance. Moreover, how to improve the model’s adaptability to concept drifts while overcoming catastrophic forgetting still remains a difficult issue. To address these issues, an Adaptive Tree-like Neural Network (ATNN) is proposed in this paper. ATNN adaptively increases the depth of its active branch according to the weight of the deepest node in the active branch. Once a new concept is detected, it chooses a suitable position on its trunk to grow a branch for the new concept according to the relation between the Fisher information and the gradient of parameters. Experiments demonstrate the rationality of ATNN to adaptively increase the depth of its branch or to choose a suitable position to grow a new branch, and illustrate that ATNN can more quickly adapt to concept drifts, and continuously improve classification performance for recurring concepts. The code of the proposed algorithm is available at https://github.com/mlmmwym/ATNN.
Frouros: An open-source Python library for drift detection in machine learning systems
2024, SoftwareX
Frouros is an open-source Python library capable of detecting drift in machine learning systems. It provides a combination of classical and more recent algorithms for drift detection, covering both concept and data drift. We have designed it to be compatible with any machine learning framework and easily adaptable to real-world use cases. The library is developed following best development and continuous integration practices to ensure ease of maintenance and extensibility.
Entropy-based concept drift detection in information systems
2024, Knowledge-Based Systems
As time passes, the data within information systems may continuously evolve, causing the target concept to drift. To ensure the effectiveness of data-driven decision making, it is crucial to detect drift in a timely manner and gather relevant information. In this paper, we introduce two methods that can directly detect concept drift in the provided information system, by considering a new perspective on uncertainty. First, using entropy under a single attribute constraint, we define the uncertainty of the target concept in an information system. By integrating the uncertainty of each attribute, the overall uncertainty of the target concept in the information system is obtained. Subsequently, two concept drift detection methods are proposed, namely EBTBM (Entropy-Based Threshold-Based Method) and EBSBM (Entropy-Based Sampling-Based Method). These methods utilize the defined uncertainty of the target concept as a statistical measure of the difference between two data blocks. Finally, extensive experiments on artificial and real-world data sets are conducted to validate the effectiveness of the proposed concept drift detection methods.
A comprehensive analysis of concept drift locality in data streams
2024, Knowledge-Based Systems
Adapting to drifting data streams is a significant challenge in online learning. Concept drift must be detected for effective model adaptation to evolving data properties. Concept drift can impact the data distribution entirely or partially, which makes it difficult for drift detectors to accurately identify the concept drift. Despite the numerous concept drift detectors in the literature, standardized procedures and benchmarks for comprehensive evaluation considering the locality of the drift are lacking. We present a novel categorization of concept drift based on its locality and scale. A systematic approach leads to a test bed of 2760 data stream benchmarks, reflecting various difficulty levels following our proposed categorization. We conduct a comparative assessment of 9 state-of-the-art drift detectors across diverse difficulties, highlighting their strengths and weaknesses for future research. We examine how drift locality influences the classifier performance and propose strategies for different drift categories to minimize the recovery time. Lastly, we provide lessons learned and recommendations for future concept drift research. Our benchmark data streams and experiments are publicly available at https://github.com/gabrieljaguiar/locality-concept-drift.
A novel Edge architecture and solution for detecting concept drift in smart environments
2024, Future Generation Computer Systems
The proliferation of the Internet of Things (IoT), artificial intelligence (AI), the adoption of 5G, and progress towards 6G technology have led to the accumulation of massive amounts of real-world data; however, a significant portion of the data generated by smart cities and smart buildings remains unused. A notable problem is the shift of statistical properties in real-world streaming over time caused by unexpected factors, referred to as concept drift, which results in less efficient predictive models. To address this problem, the latest research leverages the cloud–edge continuum paradigm for the deployment of AI and general smart city applications while utilising the available resources optimally. In this article, we propose a computing architecture for different smart city applications in edge micro data centre (EMDC) settings over a hybrid cloud–edge continuum to support the deployment of AI workloads. We implement a feedback-driven automated concept drift detection and adaptation methodology, combining base learner long short-term memory (LSTM) with Page–Hinkley test (PHT), adaptive windowing (ADWIN) and the Kolmogorov–Smirnov windowing (KSWIN). Real-world data streams are utilised to forecast from various environmental sensors installed at the University of Oulu Smart Campus. The feedback-based concept drift detection and adaption process is first evaluated using synthetic datasets with known concept drift points and then employed in the real-world data. Subsequently, the implementation is evaluated using the state-of-the-art MAE, RMSE, and MAPE methods. The results showed a reduction in MAPE from 8.5% to 3.88% when concept drift detection was applied. Additionally, the challenges faced and the effectiveness of the suggested solutions are explored.
Complexity-based drift detection for nonstationary data streams
2023, Neurocomputing
This publication presents the Complexity Drift Detector (C2D) – the method for detecting a concept shift in the data stream based on the classification task complexity measures. The method belongs to the group of detectors agnostic to the recognition quality of the base classifier. The possibility of selecting a set of difficulty measures taken into account during the data stream processing allows applying the method to many tasks in which the detection of a classification task complexity change is expected. The publication includes experiments analyzing the hyperparameters’ influence on the operation of the method and a broad comparative experiment comparing the proposed algorithm with state-of-the-art solutions. The experiments were carried out on synthetic data streams of different dimensions and with different concept drift characteristics, also presenting the effects of processing real-world data streams. The results of the conducted research confirm the high efficiency of the method in detecting concept changes, sensitive not only to the fact of drift occurrence but also to its dynamics.

View all citing articles on Scopus

View full text

RDDM: Reactive drift detection method

Highlights

Abstract

Introduction

Section snippets

Related work: drift detection

Reactive drift detection method

Experimental setting

Experimental results and analysis

Conclusion

Acknowledgments

Expert Systems with Applications

Database mining: A performance perspective

IEEE Transactions on Knowledge and Data Engineering

Paired learners for concept drift

Proceedings of the 8th IEEE international conference on data mining (ICDM’08) Pisa, Italy

Early drift detection method

Proceedings of the fourth international workshop on knowledge discovery from data streams

Advances in data stream mining with concept drift

A boosting-like online learning ensemble

Proceedings of the international joint conference on neural networks (IJCNN)

Learning from time-changing data with adaptive windowing

Proceedings of the 7th SIAM international conference on data mining (SDM’07),Minneapolis, MN, USA

MOA: Massive online analysis

Journal of Machine Learning Research

Leveraging bagging for evolving data streams

Machine learning and knowledge discovery in databases

New ensemble methods for evolving data streams

Proceedings of the 15th ACM international conference on knowledge discovery and data mining (KDD’09), Paris, France

Pitfalls in benchmarking data stream classification and how to avoid them

Machine learning and knowledge discovery in databases

Introduction to algorithms

Present position and potential developments: Some personal views: Statistical theory: The prequential approach

Journal of the Royal Statistical Society. Series A (General)

Statistical comparisons of classifiers over multiple data sets

Journal of Machine Learning Research

A selective detector ensemble for concept drift detection

The Computer Journal

Online and non-parametric drift detection methods based on hoeffding’s bounds

IEEE Transactions on Knowledge and Data Engineering