Skip to main content
Top

2020 | OriginalPaper | Chapter

Online Multi-objective Subspace Clustering for Streaming Data

Authors : Dipanjyoti Paul, Sriparna Saha, Jimson Mathew

Published in: Neural Information Processing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper develops an online subspace clustering technique which is capable of handling continuous arrival of data in a streaming manner. Subspace clustering is a technique where the subset of features that are used to represent a cluster are different for different clusters. Most of the streaming data clustering methods primarily optimize only a single objective function which limits the model in capturing only a particular shape or property. However, the simultaneous optimization of multiple objectives helps in overcoming the above mentioned limitations and enables to generate good quality clusters. Inspired by this, the developed streaming subspace clustering method optimizes multiple objectives capturing cluster compactness and feature relevancy. In this paper, we consider an evolutionary-based technique and optimize multiple objective functions simultaneously to determine the optimal subspace clusters. The generated clusters in the proposed method are allowed to contain overlapping of objects. To establish the superiority of using multiple objectives, the proposed method is evaluated on three real-life and three synthetic data sets. The results obtained by the proposed method are compared with several state-of-the-art methods and the comparative study shows the superiority of using multiple objectives in the proposed method.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Aggarwal, C C., Han, J., Wang, J., Yu, P.S.: A framework for projected clustering of high dimensional data streams. In: Proceedings of the Thirtieth International Conference on Very Large Data, vol. 30, pp. 852–863 (2004) Aggarwal, C C., Han, J., Wang, J., Yu, P.S.: A framework for projected clustering of high dimensional data streams. In: Proceedings of the Thirtieth International Conference on Very Large Data, vol. 30, pp. 852–863 (2004)
2.
go back to reference Aggarwal, C.C., Philip, S.Y., Han, J., Wang, J.: A framework for clustering evolving data streams. In: Proceedings 2003 VLDB Conference, pp. 81–92 (2003) Aggarwal, C.C., Philip, S.Y., Han, J., Wang, J.: A framework for clustering evolving data streams. In: Proceedings 2003 VLDB Conference, pp. 81–92 (2003)
3.
go back to reference Bezdek, J.C., Ehrlich, R., Full, W.: FCM: the fuzzy c-means clustering algorithm. Comput. Geosci. 10, 191–203 (1984)CrossRef Bezdek, J.C., Ehrlich, R., Full, W.: FCM: the fuzzy c-means clustering algorithm. Comput. Geosci. 10, 191–203 (1984)CrossRef
4.
go back to reference Chen, Y., Tu, L.: Density-based clustering for real-time stream data. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 133–142 (2007) Chen, Y., Tu, L.: Density-based clustering for real-time stream data. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 133–142 (2007)
5.
go back to reference Paul, D., Saha, S., Mathew, J.: Improved subspace clustering algorithm using multi-objective framework and subspace optimization. Expert Syst. Appl. 113487 (2020) Paul, D., Saha, S., Mathew, J.: Improved subspace clustering algorithm using multi-objective framework and subspace optimization. Expert Syst. Appl. 113487 (2020)
6.
go back to reference Paul, D., Saha, S., Mathew, J.: Fusion of evolvable genome structure and multi-objective optimization for subspace clustering. Pattern Recogn. 95, 58–71 (2019)CrossRef Paul, D., Saha, S., Mathew, J.: Fusion of evolvable genome structure and multi-objective optimization for subspace clustering. Pattern Recogn. 95, 58–71 (2019)CrossRef
7.
go back to reference Peignier, S.: Subspace clustering on static datasets and dynamic data streams using bio-inspired algorithms. Ph.D. Thesis, University de Lyon, INSA Lyon (2017) Peignier, S.: Subspace clustering on static datasets and dynamic data streams using bio-inspired algorithms. Ph.D. Thesis, University de Lyon, INSA Lyon (2017)
8.
go back to reference Yan, X., Razeghi, J.M., Homaifar, A., Erol, B.A., Girma, A., Tunstel, E:. A novel streaming data clustering algorithm based on fitness proportionate sharing. In: IEEE Access, pp. 184985–185000 (2019) Yan, X., Razeghi, J.M., Homaifar, A., Erol, B.A., Girma, A., Tunstel, E:. A novel streaming data clustering algorithm based on fitness proportionate sharing. In: IEEE Access, pp. 184985–185000 (2019)
9.
go back to reference Guha, S., Mishra, N., Motwani, R., o’Callaghan, L.: Clustering data streams. In: Proceedings 41st Annual Symposium on Foundations of Computer Science, pp. 359–366 (2000) Guha, S., Mishra, N., Motwani, R., o’Callaghan, L.: Clustering data streams. In: Proceedings 41st Annual Symposium on Foundations of Computer Science, pp. 359–366 (2000)
10.
go back to reference Cao, F., Estert, M., Qian, W., Zhou, A.: Density-based clustering over an evolving data stream with noise. In: Proceedings of the 2006 SIAM International Conference on Data Mining, pp. 328–339 (2006) Cao, F., Estert, M., Qian, W., Zhou, A.: Density-based clustering over an evolving data stream with noise. In: Proceedings of the 2006 SIAM International Conference on Data Mining, pp. 328–339 (2006)
11.
go back to reference Ren, J., Ma, R.: Density-based data streams clustering over sliding windows. In: Sixth International Conference on Fuzzy Systems and Knowledge Discovery, vol. 5, pp. 248–252 (2009) Ren, J., Ma, R.: Density-based data streams clustering over sliding windows. In: Sixth International Conference on Fuzzy Systems and Knowledge Discovery, vol. 5, pp. 248–252 (2009)
12.
go back to reference Amini, A., Wah, T.Y., Teh, Y.W.: DENGRIS-stream: a density-grid based clustering algorithm for evolving data streams over sliding window. In: International Conference on Data Mining and Computer Engineering, pp. 206–210 (2012) Amini, A., Wah, T.Y., Teh, Y.W.: DENGRIS-stream: a density-grid based clustering algorithm for evolving data streams over sliding window. In: International Conference on Data Mining and Computer Engineering, pp. 206–210 (2012)
Metadata
Title
Online Multi-objective Subspace Clustering for Streaming Data
Authors
Dipanjyoti Paul
Sriparna Saha
Jimson Mathew
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-63820-7_11

Premium Partner