Skip to main content

2016 | OriginalPaper | Buchkapitel

A Context-Driven Data Weighting Approach for Handling Concept Drift in Classification

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Adapting classification models to concept drift is one of the main challenges associated with applying these models in dynamic environments. In particular, the learned concept is not static and may change over time under the influence of varying conditions (i.e. varying context). Unlike existing approaches where only the most recent data are considered for adapting the model, we propose incorporating context awareness into the adaptation process. The goal is to utilise knowledge of relevant context variables to facilitate the selection of more relevant training data. Specifically, we propose to weight each training example based on the degree of similarity with the current context. To detect such similarity, we utilise two approaches: a simple difference between the context variable values and a distribution-based distance metric. The experimental analyses show that such explicit context utilisation results in a more effective data selection strategy and enables to produce more accurate predictions.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Barakat, L.: Context identification and exploitation in dynamic data mining—an application to classifying electricity price changes. In: Bouchachia, A. (ed.) Adaptive and Intelligent Systems. Lecture Notes in Computer Science, vol. 8779, pp. 80–89. Springer, Heidelberg (2014) Barakat, L.: Context identification and exploitation in dynamic data mining—an application to classifying electricity price changes. In: Bouchachia, A. (ed.) Adaptive and Intelligent Systems. Lecture Notes in Computer Science, vol. 8779, pp. 80–89. Springer, Heidelberg (2014)
2.
Zurück zum Zitat Du, L., Song, Q., Jia, X.: Detecting concept drift: an information entropy based method using an adaptive sliding window. Intell. Data Anal. 18, 337–364 (2014) Du, L., Song, Q., Jia, X.: Detecting concept drift: an information entropy based method using an adaptive sliding window. Intell. Data Anal. 18, 337–364 (2014)
3.
Zurück zum Zitat Frank, E., Hall, M., Pfahringer, B.: Locally weighted naive bayes. In: Proceedings of the 19th Conference on Uncertainty in Artificial Intelligence, pp. 249–256 (2003) Frank, E., Hall, M., Pfahringer, B.: Locally weighted naive bayes. In: Proceedings of the 19th Conference on Uncertainty in Artificial Intelligence, pp. 249–256 (2003)
4.
Zurück zum Zitat Gama, J., Medas, P., Castillo, G., Rodrigues, P.: Learning with drift detection. In: Proceedings of the 17th Brazilian Symposium on Artificial Intelligence, LNAI 3171, pp. 286–295 (2004) Gama, J., Medas, P., Castillo, G., Rodrigues, P.: Learning with drift detection. In: Proceedings of the 17th Brazilian Symposium on Artificial Intelligence, LNAI 3171, pp. 286–295 (2004)
5.
Zurück zum Zitat Gama, J., Bifet, A., Pechenizkiy, M., Bouchachia, A.: A survey on concept drift adaptation. ACM Comput. Surv. 46(4), 1–37 (2014)CrossRefMATH Gama, J., Bifet, A., Pechenizkiy, M., Bouchachia, A.: A survey on concept drift adaptation. ACM Comput. Surv. 46(4), 1–37 (2014)CrossRefMATH
6.
Zurück zum Zitat Garcia, M.B., del Campo-Avila, J., Fidalgo, R., Bifet, A., Gavalda, R., Morales-Bueno, R.: Early drift detection method. In: ECML PKDD 2006 Workshop on Knowledge Discovery from Data Streams (2006) Garcia, M.B., del Campo-Avila, J., Fidalgo, R., Bifet, A., Gavalda, R., Morales-Bueno, R.: Early drift detection method. In: ECML PKDD 2006 Workshop on Knowledge Discovery from Data Streams (2006)
7.
Zurück zum Zitat Harries, M., Sammut, C., Horn, K.: Extracting hidden context. Mach. Learn. 32(2), 101–126 (1998)CrossRefMATH Harries, M., Sammut, C., Horn, K.: Extracting hidden context. Mach. Learn. 32(2), 101–126 (1998)CrossRefMATH
8.
Zurück zum Zitat Harries, M.: Splice-2 Comparative Evaluation: Electricity Pricing. Technical report, University of New South Wales (1999) Harries, M.: Splice-2 Comparative Evaluation: Electricity Pricing. Technical report, University of New South Wales (1999)
9.
Zurück zum Zitat Hulten, G., Spencer, L., Domingos, P.: Mining time-changing data streams. In: Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 97–106 (2001) Hulten, G., Spencer, L., Domingos, P.: Mining time-changing data streams. In: Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 97–106 (2001)
10.
Zurück zum Zitat Katakis, I., Tsoumakas, G., Vlahavas, I.: Tracking recurring contexts using ensemble classifiers: an application to email filtering. Knowl. Inf. Syst. 22(3), 371–391 (2010)CrossRef Katakis, I., Tsoumakas, G., Vlahavas, I.: Tracking recurring contexts using ensemble classifiers: an application to email filtering. Knowl. Inf. Syst. 22(3), 371–391 (2010)CrossRef
11.
Zurück zum Zitat Koychev, I.: Gradual forgetting for adaptation to concept drift. In: Proceedings of ECAI Workshop Current Issues in Spatio-Temporal Reasoning, pp. 101–106 (2000) Koychev, I.: Gradual forgetting for adaptation to concept drift. In: Proceedings of ECAI Workshop Current Issues in Spatio-Temporal Reasoning, pp. 101–106 (2000)
12.
Zurück zum Zitat Kubat, M.: Floating approximation in time-varying knowledge bases. Pattern Recognit. Lett. 10(4), 223–227 (1989)CrossRefMATH Kubat, M.: Floating approximation in time-varying knowledge bases. Pattern Recognit. Lett. 10(4), 223–227 (1989)CrossRefMATH
13.
Zurück zum Zitat Pavlidis, N.G., Tasoulis, D.K., Adams, N.M., Hand, D.J.: \(\lambda \)-Perceptron: an adaptive classifier for data streams. Pattern Recognit. 44(1), 78–96 (2011)CrossRefMATH Pavlidis, N.G., Tasoulis, D.K., Adams, N.M., Hand, D.J.: \(\lambda \)-Perceptron: an adaptive classifier for data streams. Pattern Recognit. 44(1), 78–96 (2011)CrossRefMATH
14.
Zurück zum Zitat Pavlidis, N.G., Tasoulis, D.K., Adams, N.M., Hand, D.J.: Adaptive consumer credit classification. J. Oper. Res. Soc. 63(12), 1645–1654 (2012)CrossRef Pavlidis, N.G., Tasoulis, D.K., Adams, N.M., Hand, D.J.: Adaptive consumer credit classification. J. Oper. Res. Soc. 63(12), 1645–1654 (2012)CrossRef
15.
Zurück zum Zitat Schlimmer, J., Granger, R.: Incremental learning from noisy data. Mach. Learn. 1(3), 317–354 (1986) Schlimmer, J., Granger, R.: Incremental learning from noisy data. Mach. Learn. 1(3), 317–354 (1986)
16.
Zurück zum Zitat Sebastiao, R., Gama, J.: Change detection in learning histograms from data streams. In: Proceedings of the Portuguese Conference on Artificial Intelligence. LNAI 4874, pp. 112–123 (2007) Sebastiao, R., Gama, J.: Change detection in learning histograms from data streams. In: Proceedings of the Portuguese Conference on Artificial Intelligence. LNAI 4874, pp. 112–123 (2007)
17.
Zurück zum Zitat Tsymbal, A.: The Problem of Concept Drift: Definitions and Related Work. Computer Science Department, Trinity College Dublin (2004) Tsymbal, A.: The Problem of Concept Drift: Definitions and Related Work. Computer Science Department, Trinity College Dublin (2004)
18.
Zurück zum Zitat Wang, H., Fan, W., Yu, P. S., Han, J.: Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 226–235 (2003) Wang, H., Fan, W., Yu, P. S., Han, J.: Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 226–235 (2003)
19.
Zurück zum Zitat Widmer, G., Kubat, M.: Learning in the presence of concept drift and hidden contexts. Mach. Learn. 23(1), 69–101 (1996) Widmer, G., Kubat, M.: Learning in the presence of concept drift and hidden contexts. Mach. Learn. 23(1), 69–101 (1996)
20.
Zurück zum Zitat Zliobaite, I., Kuncheva, L.: Determining the training window for small sample size classification with concept drift. In: Proceedings of the IEEE International Conference on Data Mining Workshop, pp. 447–452 (2009) Zliobaite, I., Kuncheva, L.: Determining the training window for small sample size classification with concept drift. In: Proceedings of the IEEE International Conference on Data Mining Workshop, pp. 447–452 (2009)
21.
Zurück zum Zitat Zliobaite, I.: How Good is the Electricity Benchmark for Evaluating Concept Drift Adaptation. CoRR, abs/1301-3524 (2013) Zliobaite, I.: How Good is the Electricity Benchmark for Evaluating Concept Drift Adaptation. CoRR, abs/1301-3524 (2013)
Metadaten
Titel
A Context-Driven Data Weighting Approach for Handling Concept Drift in Classification
verfasst von
Lida Barakat
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-26227-7_36