Skip to main content
Erschienen in: Pattern Analysis and Applications 4/2018

13.02.2017 | Theoretical Advances

A time-varying quadratic programming for online clustering of streaming data

verfasst von: Mohammad Amin Adibi, Jamal Shahrabi

Erschienen in: Pattern Analysis and Applications | Ausgabe 4/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

An online clustering method based on a time-varying quadratic programming is proposed which can precisely detect streaming data clustering structure when no assumption is desired on the shape and density of data classes. In the proposed method, online clustering is achieved through simulating some dynamical equations which yield optimum solution of the time-varying quadratic programming over time. A new framework is also proposed which efficiently permits streaming data clustering based on a relatively small and renewable dataset. This framework reduces the need for incoming data storage memory to a small and independent of original data size. The performance of the proposed method is evaluated through the experiments using synthetic data as well as the KDD cup 99 dataset. The results illustrate higher performance of the proposed method in comparison with a range of benchmark methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Adibi MA, Shahrabi J (2015) Online anomaly detection based on support vector clustering. Int J Comput Intell Syst 8(4):735–746CrossRef Adibi MA, Shahrabi J (2015) Online anomaly detection based on support vector clustering. Int J Comput Intell Syst 8(4):735–746CrossRef
2.
Zurück zum Zitat Aggarwal CC, Han J, Wang J, Yu PS (2003) A framework for clustering evolving data streams. In: Proceedings of the 29th international conference on very large data bases, vol 29, pp 81–92CrossRef Aggarwal CC, Han J, Wang J, Yu PS (2003) A framework for clustering evolving data streams. In: Proceedings of the 29th international conference on very large data bases, vol 29, pp 81–92CrossRef
3.
Zurück zum Zitat Aggarwal CC, Han J, Wang J, Yu PS (2004) A framework for projected clustering of high dimensional data streams. In: Proceedings of the thirtieth international conference on very large data bases, vol 30, pp 852–863CrossRef Aggarwal CC, Han J, Wang J, Yu PS (2004) A framework for projected clustering of high dimensional data streams. In: Proceedings of the thirtieth international conference on very large data bases, vol 30, pp 852–863CrossRef
4.
Zurück zum Zitat Ben-Hur A, Horn D, Siegelmann HT, Vapnik V (2002) Support vector clustering. J Mach Learn Res 2:125–137MATH Ben-Hur A, Horn D, Siegelmann HT, Vapnik V (2002) Support vector clustering. J Mach Learn Res 2:125–137MATH
5.
Zurück zum Zitat Beringer J, Hüllermeier E (2006) Online clustering of parallel data streams. Data Knowl Eng 58(2):180–204CrossRef Beringer J, Hüllermeier E (2006) Online clustering of parallel data streams. Data Knowl Eng 58(2):180–204CrossRef
6.
Zurück zum Zitat Boubacar HA, Lecoeuche S, Maouche S (2008) SAKM: self-adaptive kernel machine A kernel-based algorithm for online clustering. Neural Netw 21(9):1287–1301CrossRef Boubacar HA, Lecoeuche S, Maouche S (2008) SAKM: self-adaptive kernel machine A kernel-based algorithm for online clustering. Neural Netw 21(9):1287–1301CrossRef
7.
Zurück zum Zitat Boukharouba K, Lecoeuche S (2008) Online clustering of non-stationary data using incremental and decremental SVM. In: Artificial neural networks—ICANN, pp 336–345 Boukharouba K, Lecoeuche S (2008) Online clustering of non-stationary data using incremental and decremental SVM. In: Artificial neural networks—ICANN, pp 336–345
8.
Zurück zum Zitat Cao F, Ester M, Qian W, Zhou A (2006) Density-based clustering over an evolving data stream with noise. In: SDM, vol 6, pp 328–339 Cao F, Ester M, Qian W, Zhou A (2006) Density-based clustering over an evolving data stream with noise. In: SDM, vol 6, pp 328–339
9.
Zurück zum Zitat Deng D, Kasabov N (2003) On-line pattern analysis by evolving self-organizing maps. Neurocomputing 51:87–103CrossRef Deng D, Kasabov N (2003) On-line pattern analysis by evolving self-organizing maps. Neurocomputing 51:87–103CrossRef
10.
Zurück zum Zitat Gao J, Li J, Zhang Z, Tan PN (2005) An incremental data stream clustering algorithm based on dense units detection. In: Ho TB, Cheung D, Liu H (eds) Advances in knowledge discovery and data mining. PAKDD 2005. Lecture notes in computer science, vol 3518. Springer, HeidelbergCrossRef Gao J, Li J, Zhang Z, Tan PN (2005) An incremental data stream clustering algorithm based on dense units detection. In: Ho TB, Cheung D, Liu H (eds) Advances in knowledge discovery and data mining. PAKDD 2005. Lecture notes in computer science, vol 3518. Springer, HeidelbergCrossRef
11.
Zurück zum Zitat Guha S, Meyerson A, Mishra N, Motwani R, O’Callaghan L (2003) Clustering data streams: theory and practice. IEEE Trans Knowl Data Eng 15(3):515–528CrossRef Guha S, Meyerson A, Mishra N, Motwani R, O’Callaghan L (2003) Clustering data streams: theory and practice. IEEE Trans Knowl Data Eng 15(3):515–528CrossRef
12.
Zurück zum Zitat Kasabov N (2001) Evolving fuzzy neural networks for supervised/unsupervised online knowledge-based learning. IEEE Trans Syst Man Cybern B Cybern 31(6):902–918CrossRef Kasabov N (2001) Evolving fuzzy neural networks for supervised/unsupervised online knowledge-based learning. IEEE Trans Syst Man Cybern B Cybern 31(6):902–918CrossRef
13.
Zurück zum Zitat Kharratzadeh M, Renard B, Coates MJ (2015) Bayesian topic model approaches to online and time-dependent clustering. Digit Signal Process 47:25–35MathSciNetCrossRef Kharratzadeh M, Renard B, Coates MJ (2015) Bayesian topic model approaches to online and time-dependent clustering. Digit Signal Process 47:25–35MathSciNetCrossRef
14.
Zurück zum Zitat Kranen P, Assent I, Baldauf C, Seidl T (2011) The ClusTree: indexing micro-clusters for anytime stream mining. Knowl Inf Syst 29(2):249–272CrossRef Kranen P, Assent I, Baldauf C, Seidl T (2011) The ClusTree: indexing micro-clusters for anytime stream mining. Knowl Inf Syst 29(2):249–272CrossRef
15.
Zurück zum Zitat Labroche N (2014) Online fuzzy medoid based clustering algorithms. Neurocomputing 126:141–150CrossRef Labroche N (2014) Online fuzzy medoid based clustering algorithms. Neurocomputing 126:141–150CrossRef
16.
Zurück zum Zitat Langone R, Agudelo OM, De Moor B, Suykens JA (2014) Incremental kernel spectral clustering for online learning of non-stationary data. Neurocomputing 139:246–260CrossRef Langone R, Agudelo OM, De Moor B, Suykens JA (2014) Incremental kernel spectral clustering for online learning of non-stationary data. Neurocomputing 139:246–260CrossRef
17.
Zurück zum Zitat Li Y, Li D, Wang S, Zhai Y (2014) Incremental entropy-based clustering on categorical data streams with concept drift. Knowl Based Syst 59:33–47CrossRef Li Y, Li D, Wang S, Zhai Y (2014) Incremental entropy-based clustering on categorical data streams with concept drift. Knowl Based Syst 59:33–47CrossRef
18.
Zurück zum Zitat Lühr S, Lazarescu M (2009) Incremental clustering of dynamic data streams using connectivity based representative points. Data Knowl Eng 68(1):1–27CrossRef Lühr S, Lazarescu M (2009) Incremental clustering of dynamic data streams using connectivity based representative points. Data Knowl Eng 68(1):1–27CrossRef
19.
Zurück zum Zitat Nazemi A (2014) A neural network model for solving convex quadratic programming problems with some applications. Eng Appl Artif Intell 32:54–62CrossRef Nazemi A (2014) A neural network model for solving convex quadratic programming problems with some applications. Eng Appl Artif Intell 32:54–62CrossRef
20.
Zurück zum Zitat Ping L, Chun-Guang Z, Xu Z (2010) Improved support vector clustering. Eng Appl Artif Intell 23(4):552–559CrossRef Ping L, Chun-Guang Z, Xu Z (2010) Improved support vector clustering. Eng Appl Artif Intell 23(4):552–559CrossRef
21.
Zurück zum Zitat Sahu SK, Sarangi S, Jena SK (2014) A detail analysis on intrusion detection datasets. In: Advance computing conference (IACC), 2014 IEEE international, pp 1348–1353 Sahu SK, Sarangi S, Jena SK (2014) A detail analysis on intrusion detection datasets. In: Advance computing conference (IACC), 2014 IEEE international, pp 1348–1353
22.
Zurück zum Zitat Schölkopf B, Smola AJ (2002) Learning with kernels: support vector machines, regularization, optimization, and beyond. MIT Press, Cambridge Schölkopf B, Smola AJ (2002) Learning with kernels: support vector machines, regularization, optimization, and beyond. MIT Press, Cambridge
23.
Zurück zum Zitat Tao Q, Cao J, Sun D (2001) A simple and high performance neural network for quadratic programming problems. Appl Math Comput 124(2):251–260MathSciNetMATH Tao Q, Cao J, Sun D (2001) A simple and high performance neural network for quadratic programming problems. Appl Math Comput 124(2):251–260MathSciNetMATH
24.
Zurück zum Zitat Tavallaee M, Bagheri E, Lu W, Ghorbani AA (2009) A detailed analysis of the KDD CUP 99 data set. In: Proceedings of the second IEEE symposium on computational intelligence for security and defense applications Tavallaee M, Bagheri E, Lu W, Ghorbani AA (2009) A detailed analysis of the KDD CUP 99 data set. In: Proceedings of the second IEEE symposium on computational intelligence for security and defense applications
25.
Zurück zum Zitat Wang Y, Zhang X, Wang S, Lai KK (2008) Nonlinear clustering based support vector machine for large data sets. Optim Methods Softw 23(4):533–549MathSciNetCrossRef Wang Y, Zhang X, Wang S, Lai KK (2008) Nonlinear clustering based support vector machine for large data sets. Optim Methods Softw 23(4):533–549MathSciNetCrossRef
26.
Zurück zum Zitat Zhang Y, Li Z (2009) Zhang neural network for online solution of time-varying convex quadratic program subject to time-varying linear-equality constraints. Phys Lett A 373(18):1639–1643CrossRef Zhang Y, Li Z (2009) Zhang neural network for online solution of time-varying convex quadratic program subject to time-varying linear-equality constraints. Phys Lett A 373(18):1639–1643CrossRef
28.
Zurück zum Zitat Zhou A, Cao F, Yan Y, Sha C, He X (2007) Distributed data stream clustering: a fast EM-based approach. In: International conference on data engineering 2007, ICDE 2007, IEEE 23rd, pp 736–745 Zhou A, Cao F, Yan Y, Sha C, He X (2007) Distributed data stream clustering: a fast EM-based approach. In: International conference on data engineering 2007, ICDE 2007, IEEE 23rd, pp 736–745
Metadaten
Titel
A time-varying quadratic programming for online clustering of streaming data
verfasst von
Mohammad Amin Adibi
Jamal Shahrabi
Publikationsdatum
13.02.2017
Verlag
Springer London
Erschienen in
Pattern Analysis and Applications / Ausgabe 4/2018
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-017-0608-9

Weitere Artikel der Ausgabe 4/2018

Pattern Analysis and Applications 4/2018 Zur Ausgabe

Premium Partner