Skip to main content
Erschienen in: Arabian Journal for Science and Engineering 8/2022

08.03.2022 | Research Article-Computer Engineering and Computer Science

Disposition-Based Concept Drift Detection and Adaptation in Data Stream

verfasst von: Supriya Agrahari, Anil Kumar Singh

Erschienen in: Arabian Journal for Science and Engineering | Ausgabe 8/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The change in data distribution over time (known as concept drift) makes the classification process complex because of the discrepancy between current and incoming data distribution. A plethora of drift detection methods often focus on the early identification of concept drift. Along with the drift, other deformities like noise and blips are also present in the data stream. These deformities may be damaged the underlying learning system by forcing adaptation to false drift. Thereby unnecessary update performs in the learning model that leads to decrease in learner’s accuracy. The existing drift detection methods are not capable of differentiating between actual and false drift. The paper proposes DBDDM, a disposition-based drift detection method, to overcome the issue of false drift. In this paper, we utilize the approximate randomization test to find the frequency of consecutive drift and compare the obtained frequency with the threshold to determine the actual drift. DBDDM compares with the several state-of-the-art methods using synthetic and real-time datasets. It exhibits a maximum increase in accuracy of 24% and 28% with a rise of 2.50 and 1.91 average ranks using Naive Bayes and the Hoeffding tree classifier, respectively.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Gama, J.; Žliobaite, I.: A survey on concept drift adaptation. In: 2015 International Joint Conference on Neural Networks (IJCNN). ACM Computing Survey, pp. 1–37 (2014) Gama, J.; Žliobaite, I.: A survey on concept drift adaptation. In: 2015 International Joint Conference on Neural Networks (IJCNN). ACM Computing Survey, pp. 1–37 (2014)
2.
Zurück zum Zitat Lu, J.; Liu, A.; Song, Y.; Zhang, G.: Data-driven decision support under concept drift in streamed big data. Complex Intell. Syst. 6(1), 157–163 (2020)CrossRef Lu, J.; Liu, A.; Song, Y.; Zhang, G.: Data-driven decision support under concept drift in streamed big data. Complex Intell. Syst. 6(1), 157–163 (2020)CrossRef
3.
Zurück zum Zitat de Barros, R.S.M.; Hidalgo, J.I.G.; de Lima Cabral, D.R.: Wilcoxon rank sum test drift detector. Neurocomputing 275, 1954–1963 (2018)CrossRef de Barros, R.S.M.; Hidalgo, J.I.G.; de Lima Cabral, D.R.: Wilcoxon rank sum test drift detector. Neurocomputing 275, 1954–1963 (2018)CrossRef
4.
Zurück zum Zitat Frías-Blanco, I.; del Campo-Ávila, J.; Ramos-Jimenez, G.; Morales-Bueno, R.; Ortiz-Díaz, A.; Caballero-Mota, Y.: Online and non-parametric drift detection methods based on Hoeffding’s bounds. IEEE Trans. Knowl. Data Eng. 27(3), 810–823 (2014)CrossRef Frías-Blanco, I.; del Campo-Ávila, J.; Ramos-Jimenez, G.; Morales-Bueno, R.; Ortiz-Díaz, A.; Caballero-Mota, Y.: Online and non-parametric drift detection methods based on Hoeffding’s bounds. IEEE Trans. Knowl. Data Eng. 27(3), 810–823 (2014)CrossRef
5.
Zurück zum Zitat Shao, J.; Ahmadi, Z.; Kramer, S.: Prototype-based learning on concept-drifting data streams. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 412–421 (2014) Shao, J.; Ahmadi, Z.; Kramer, S.: Prototype-based learning on concept-drifting data streams. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 412–421 (2014)
6.
Zurück zum Zitat Lu, N.; Lu, J.; Zhang, G.; De Mantaras, R.L.: A concept drift-tolerant case-base editing technique. Artif. Intell. 230, 108–133 (2016)MathSciNetCrossRef Lu, N.; Lu, J.; Zhang, G.; De Mantaras, R.L.: A concept drift-tolerant case-base editing technique. Artif. Intell. 230, 108–133 (2016)MathSciNetCrossRef
7.
Zurück zum Zitat Liu, A.; Lu, J.; Liu, F.; Zhang, G.: Accumulating regional density dissimilarity for concept drift detection in data streams. Pattern Recognit. 76, 256–272 (2018)CrossRef Liu, A.; Lu, J.; Liu, F.; Zhang, G.: Accumulating regional density dissimilarity for concept drift detection in data streams. Pattern Recognit. 76, 256–272 (2018)CrossRef
8.
Zurück zum Zitat Korycki, Ł.; Krawczyk, B. Adversarial concept drift detection under poisoning attacks for robust data stream mining (2020). arXiv preprint arXiv:200909497 Korycki, Ł.; Krawczyk, B. Adversarial concept drift detection under poisoning attacks for robust data stream mining (2020). arXiv preprint arXiv:​200909497
9.
Zurück zum Zitat Hulten, G.; Spencer, L.; Domingos, P.: Mining time-changing data streams. In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, pp. 97–106 (2001) Hulten, G.; Spencer, L.; Domingos, P.: Mining time-changing data streams. In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, pp. 97–106 (2001)
10.
Zurück zum Zitat Masud, M.M.; Gao, J.; Khan, L.; Han, J.; Thuraisingham, B.: A multi-partition multi-chunk ensemble technique to classify concept-drifting data streams. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, Springer, pp. 363–375 (2009) Masud, M.M.; Gao, J.; Khan, L.; Han, J.; Thuraisingham, B.: A multi-partition multi-chunk ensemble technique to classify concept-drifting data streams. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, Springer, pp. 363–375 (2009)
11.
Zurück zum Zitat Abdulsalam, H.; Skillicorn, D.B.; Martin, P.: Classification using streaming random forests. IEEE Trans. Knowl. Data Eng. 23(1), 22–36 (2010)CrossRef Abdulsalam, H.; Skillicorn, D.B.; Martin, P.: Classification using streaming random forests. IEEE Trans. Knowl. Data Eng. 23(1), 22–36 (2010)CrossRef
12.
Zurück zum Zitat Yu, S.; Abraham, Z.; Wang, H.; Shah, M.; Wei, Y.; Príncipe, J.C.: Concept drift detection and adaptation with hierarchical hypothesis testing. J. Frank. Inst. 356(5), 3187–3215 (2019)MathSciNetCrossRef Yu, S.; Abraham, Z.; Wang, H.; Shah, M.; Wei, Y.; Príncipe, J.C.: Concept drift detection and adaptation with hierarchical hypothesis testing. J. Frank. Inst. 356(5), 3187–3215 (2019)MathSciNetCrossRef
13.
Zurück zum Zitat Brzezinski, D.; Stefanowski, J.: Reacting to different types of concept drift: the accuracy updated ensemble algorithm. IEEE Trans. Neural Netw. Learn. Syst. 25(1), 81–94 (2013)CrossRef Brzezinski, D.; Stefanowski, J.: Reacting to different types of concept drift: the accuracy updated ensemble algorithm. IEEE Trans. Neural Netw. Learn. Syst. 25(1), 81–94 (2013)CrossRef
14.
Zurück zum Zitat Pesaranghader, A.; Viktor, H.L.: Fast Hoeffding drift detection method for evolving data streams. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Springer, pp. 96–111 (2016) Pesaranghader, A.; Viktor, H.L.: Fast Hoeffding drift detection method for evolving data streams. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Springer, pp. 96–111 (2016)
16.
Zurück zum Zitat Gama, J.; Medas, P.; Castillo, G.; Rodrigues, P.: Learning with drift detection. In: Brazilian Symposium on Artificial Intelligence, Springer, pp. 286–295 (2004) Gama, J.; Medas, P.; Castillo, G.; Rodrigues, P.: Learning with drift detection. In: Brazilian Symposium on Artificial Intelligence, Springer, pp. 286–295 (2004)
17.
Zurück zum Zitat Gama, J.; Castillo, G.: Learning with local drift detection. In: International Conference on Advanced Data Mining and Applications, Springer, pp. 42–55 (2006) Gama, J.; Castillo, G.: Learning with local drift detection. In: International Conference on Advanced Data Mining and Applications, Springer, pp. 42–55 (2006)
18.
Zurück zum Zitat Nishida, K.: Learning and Detecting Concept Drift. Information Science and Technology (2008) Nishida, K.: Learning and Detecting Concept Drift. Information Science and Technology (2008)
19.
Zurück zum Zitat Liu, A.; Song, Y.; Zhang, G.; Lu, J.: Regional concept drift detection and density synchronized drift adaptation. In: IJCAI International Joint Conference on Artificial Intelligence (2017) Liu, A.; Song, Y.; Zhang, G.; Lu, J.: Regional concept drift detection and density synchronized drift adaptation. In: IJCAI International Joint Conference on Artificial Intelligence (2017)
20.
Zurück zum Zitat Liu, A.; Zhang, G.; Lu, J.: Fuzzy time windowing for gradual concept drift adaptation. In: 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), IEEE, pp. 1–6 (2017) Liu, A.; Zhang, G.; Lu, J.: Fuzzy time windowing for gradual concept drift adaptation. In: 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), IEEE, pp. 1–6 (2017)
21.
Zurück zum Zitat Baena-Garcıa, M.; del Campo-Ávila, J.; Fidalgo, R.; Bifet, A.; Gavalda, R.; Morales-Bueno, R.: Early drift detection method. Fourth Int. Workshop Knowl. Discov. Data Streams 6, 77–86 (2006) Baena-Garcıa, M.; del Campo-Ávila, J.; Fidalgo, R.; Bifet, A.; Gavalda, R.; Morales-Bueno, R.: Early drift detection method. Fourth Int. Workshop Knowl. Discov. Data Streams 6, 77–86 (2006)
22.
Zurück zum Zitat Ross, G.J.; Adams, N.M.; Tasoulis, D.K.; Hand, D.J.: Exponentially weighted moving average charts for detecting concept drift. Pattern Recognit. Lett. 33(2), 191–198 (2012)CrossRef Ross, G.J.; Adams, N.M.; Tasoulis, D.K.; Hand, D.J.: Exponentially weighted moving average charts for detecting concept drift. Pattern Recognit. Lett. 33(2), 191–198 (2012)CrossRef
23.
Zurück zum Zitat Bifet, A.: Adaptive learning and mining for data streams and frequent patterns. ACM SIGKDD Explor. Newsl. 11(1), 55–56 (2009)CrossRef Bifet, A.: Adaptive learning and mining for data streams and frequent patterns. ACM SIGKDD Explor. Newsl. 11(1), 55–56 (2009)CrossRef
24.
Zurück zum Zitat Bifet, A.; Gavalda, R.: Learning from time-changing data with adaptive windowing. In: Proceedings of the 2007 SIAM International Conference on Data Mining, SIAM, pp. 443–448 (2007) Bifet, A.; Gavalda, R.: Learning from time-changing data with adaptive windowing. In: Proceedings of the 2007 SIAM International Conference on Data Mining, SIAM, pp. 443–448 (2007)
26.
Zurück zum Zitat Pears, R.; Sakthithasan, S.; Koh, Y.S.: Detecting concept change in dynamic data streams. Mach. Learn. 97(3), 259–293 (2014)MathSciNetCrossRef Pears, R.; Sakthithasan, S.; Koh, Y.S.: Detecting concept change in dynamic data streams. Mach. Learn. 97(3), 259–293 (2014)MathSciNetCrossRef
27.
Zurück zum Zitat Raza, H.; Prasad, G.; Li, Y.: EWMA model based shift-detection methods for detecting covariate shifts in non-stationary environments. Pattern Recognit. 48(3), 659–669 (2015)CrossRef Raza, H.; Prasad, G.; Li, Y.: EWMA model based shift-detection methods for detecting covariate shifts in non-stationary environments. Pattern Recognit. 48(3), 659–669 (2015)CrossRef
28.
Zurück zum Zitat Alippi, C.; Boracchi, G.; Roveri, M.: Hierarchical change-detection tests. IEEE Trans. Neural Netw. Learn. Syst. 28(2), 246–258 (2016)CrossRef Alippi, C.; Boracchi, G.; Roveri, M.: Hierarchical change-detection tests. IEEE Trans. Neural Netw. Learn. Syst. 28(2), 246–258 (2016)CrossRef
29.
Zurück zum Zitat Yu, S.; Abraham, Z.: Concept drift detection with hierarchical hypothesis testing. In: Proceedings of the 2017 SIAM International Conference on Data Mining, SIAM, pp. 768–776 (2017) Yu, S.; Abraham, Z.: Concept drift detection with hierarchical hypothesis testing. In: Proceedings of the 2017 SIAM International Conference on Data Mining, SIAM, pp. 768–776 (2017)
30.
Zurück zum Zitat Miyata, Y.; Ishikawa, H.: Concept drift detection on data stream for revising DBSCAN cluster. In: Proceedings of the 10th International Conference on Web Intelligence, Mining and Semantics, pp. 104–110 (2020) Miyata, Y.; Ishikawa, H.: Concept drift detection on data stream for revising DBSCAN cluster. In: Proceedings of the 10th International Conference on Web Intelligence, Mining and Semantics, pp. 104–110 (2020)
31.
Zurück zum Zitat Gu, F.; Zhang, G.; Lu, J.; Lin, C.T.: Concept drift detection based on equal density estimation. In: 2016 International Joint Conference on Neural Networks (IJCNN), IEEE, pp. 24–30 (2016) Gu, F.; Zhang, G.; Lu, J.; Lin, C.T.: Concept drift detection based on equal density estimation. In: 2016 International Joint Conference on Neural Networks (IJCNN), IEEE, pp. 24–30 (2016)
32.
Zurück zum Zitat Song, X.; Wu, M.; Jermaine, C.; Ranka, S.: Statistical change detection for multi-dimensional data. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 667–676 (2007) Song, X.; Wu, M.; Jermaine, C.; Ranka, S.: Statistical change detection for multi-dimensional data. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 667–676 (2007)
33.
Zurück zum Zitat Bu, L.; Alippi, C.; Zhao, D.: A pdf-free change detection test based on density difference estimation. IEEE Trans. Neural Netw. Learn. Syst. 29(2), 324–334 (2016)MathSciNetCrossRef Bu, L.; Alippi, C.; Zhao, D.: A pdf-free change detection test based on density difference estimation. IEEE Trans. Neural Netw. Learn. Syst. 29(2), 324–334 (2016)MathSciNetCrossRef
34.
Zurück zum Zitat Bu, L.; Zhao, D.; Alippi, C.: An incremental change detection test based on density difference estimation. IEEE Trans. Syst. Man Cybern. Syst. 47(10), 2714–2726 (2017)CrossRef Bu, L.; Zhao, D.; Alippi, C.: An incremental change detection test based on density difference estimation. IEEE Trans. Syst. Man Cybern. Syst. 47(10), 2714–2726 (2017)CrossRef
35.
Zurück zum Zitat Nishida, K.; Yamauchi, K.: Detecting concept drift using statistical testing. In: Corruble, V., Takeda, M., Suzuki, E. (eds.) Discovery Science, pp. 264–269. Springer, Berlin (2007)CrossRef Nishida, K.; Yamauchi, K.: Detecting concept drift using statistical testing. In: Corruble, V., Takeda, M., Suzuki, E. (eds.) Discovery Science, pp. 264–269. Springer, Berlin (2007)CrossRef
36.
Zurück zum Zitat Mahdi, O.A.; Pardede, E.; Ali, N.: A hybrid block-based ensemble framework for the multi-class problem to react to different types of drifts. Cluster Comput. 24(3), 2327–2340 (2021) Mahdi, O.A.; Pardede, E.; Ali, N.: A hybrid block-based ensemble framework for the multi-class problem to react to different types of drifts. Cluster Comput. 24(3), 2327–2340 (2021)
37.
Zurück zum Zitat Mahdi, O.A.; Pardede, E.; Ali, N.: kappa as drift detector in data stream mining. Procedia Comput. Sci. 184, 314–321 (2021)CrossRef Mahdi, O.A.; Pardede, E.; Ali, N.: kappa as drift detector in data stream mining. Procedia Comput. Sci. 184, 314–321 (2021)CrossRef
38.
Zurück zum Zitat Mehmood, H.; Kostakos, P.; Cortes, M.; Anagnostopoulos, T.; Pirttikangas, S.; Gilman, E.: Concept drift adaptation techniques in distributed environment for real-world data streams. Smart Cities 4(1), 349–371 (2021)CrossRef Mehmood, H.; Kostakos, P.; Cortes, M.; Anagnostopoulos, T.; Pirttikangas, S.; Gilman, E.: Concept drift adaptation techniques in distributed environment for real-world data streams. Smart Cities 4(1), 349–371 (2021)CrossRef
39.
Zurück zum Zitat Heusinger, M.; Schleif, F.M.: reactive concept drift detection using coresets over sliding windows. In: 2020 IEEE Symposium Series on Computational Intelligence (SSCI), IEEE, pp. 1350–1355 (2020) Heusinger, M.; Schleif, F.M.: reactive concept drift detection using coresets over sliding windows. In: 2020 IEEE Symposium Series on Computational Intelligence (SSCI), IEEE, pp. 1350–1355 (2020)
40.
Zurück zum Zitat Misra, S.; Biswas, D.; Saha, S.K.; Mazumdar, C.: Applying Fourier inspired windows for concept drift detection in data stream. In: 2020 IEEE Calcutta Conference (CALCON), IEEE, pp. 152–156 (2020) Misra, S.; Biswas, D.; Saha, S.K.; Mazumdar, C.: Applying Fourier inspired windows for concept drift detection in data stream. In: 2020 IEEE Calcutta Conference (CALCON), IEEE, pp. 152–156 (2020)
41.
Zurück zum Zitat Mahdi, O.A.; Pardede, E.; Ali, N.; Cao, J.: Diversity measure as a new drift detection method in data streaming. Knowl. Based Syst. 191, 105227 (2020)CrossRef Mahdi, O.A.; Pardede, E.; Ali, N.; Cao, J.: Diversity measure as a new drift detection method in data streaming. Knowl. Based Syst. 191, 105227 (2020)CrossRef
42.
Zurück zum Zitat Sakthithasan, S.; Pears, R.; Koh, Y.S.: One pass concept change detection for data streams. In: Pei, J., Tseng, V.S., Cao, L., Motoda, H., Xu, G. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 461–472. Springer, Berlin (2013)CrossRef Sakthithasan, S.; Pears, R.; Koh, Y.S.: One pass concept change detection for data streams. In: Pei, J., Tseng, V.S., Cao, L., Motoda, H., Xu, G. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 461–472. Springer, Berlin (2013)CrossRef
43.
Zurück zum Zitat Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)MathSciNetMATH Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)MathSciNetMATH
Metadaten
Titel
Disposition-Based Concept Drift Detection and Adaptation in Data Stream
verfasst von
Supriya Agrahari
Anil Kumar Singh
Publikationsdatum
08.03.2022
Verlag
Springer Berlin Heidelberg
Erschienen in
Arabian Journal for Science and Engineering / Ausgabe 8/2022
Print ISSN: 2193-567X
Elektronische ISSN: 2191-4281
DOI
https://doi.org/10.1007/s13369-022-06653-4

Weitere Artikel der Ausgabe 8/2022

Arabian Journal for Science and Engineering 8/2022 Zur Ausgabe

RESEARCH ARTICLE-COMPUTER ENGINEERING AND COMPUTER SCIENCE

Upgrading the Quality of Power Using TVSS Device and PFC Converter Fed SBLDC Motor

Research Article-Computer Engineering and Computer Science

AI-Based Mobile Edge Computing for IoT: Applications, Challenges, and Future Scope

    Marktübersichten

    Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.