Skip to main content
Erschienen in: Soft Computing 5/2020

04.06.2019 | Methodologies and Application

Semi-supervised data clustering using particle swarm optimisation

verfasst von: Daphne T. C. Lai, Minami Miyakawa, Yuji Sato

Erschienen in: Soft Computing | Ausgabe 5/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this study, we propose the semi-supervised particle swarm optimisation (ssPSO) algorithm for data clustering. The algorithm takes advantage of the strengths of semi-supervised fuzzy c-means (ssFCM) and particle swarm optimisation (PSO) to allow for a more informed search using labelled data across small number of iterations while maintaining diversity in the search process. ssFCM algorithms can find meaningful clusters using available labelled data to guide the learning process. PSOs are often chosen to solve clustering problems due to their versatility in problem representation and exploration capabilities. To verify the goodness of ssPSOs and provide practical insights to researchers, the clustering performances and clustering behaviours of ssPSOs are investigated and compared with PSO variants and ssFCMs. Two approaches of ssPSO were studied, one applied at initialisation only and the other throughout the learning process. Evaluated based on accuracy and quantisation error (QE), the ssPSO, PSOs and ssFCM algorithms were tested on 13 UCI datasets with different sizes, dimensions, number of classes and distribution, exploring several swarm size and maximum iteration settings over 100 runs. Visual examination of biplots and convergence graphs was conducted. ssPSOs were found to perform competitively well with ssFCM in most datasets in terms of accuracy and outperform ssFCM in terms of QE using swarm size 20 and maximum iteration 20. The results demonstrate that ssPSOs perform particularly well in sparsely distributed datasets with overlapping clusters and produce clusters with better structures in terms of QE. Furthermore, ssPSOs were demonstrated to perform competitively well as ssFCM in datasets with more than three clusters, while QPSO performed poorly in such datasets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Azab SS, Hady MFA, Hefny HA (2017) Semi-supervised classification: cluster and label approach using particle swarm optimization. Int J Comput Appl 160(3):39 Azab SS, Hady MFA, Hefny HA (2017) Semi-supervised classification: cluster and label approach using particle swarm optimization. Int J Comput Appl 160(3):39
Zurück zum Zitat Chen L, Wu X, Gao C (2012) Semi-supervised fuzzy clustering algorithm based on QPSO. J Inf Comput Sci 9(1):93–101CrossRef Chen L, Wu X, Gao C (2012) Semi-supervised fuzzy clustering algorithm based on QPSO. J Inf Comput Sci 9(1):93–101CrossRef
Zurück zum Zitat Chuang LY, Hsiao CJ, Yang CH (2011) Chaotic particle swarm optimization for data clustering. Expert Syst Appl 38(12):14555–14563CrossRef Chuang LY, Hsiao CJ, Yang CH (2011) Chaotic particle swarm optimization for data clustering. Expert Syst Appl 38(12):14555–14563CrossRef
Zurück zum Zitat Guo J, Sato Y (2017) A bare bones particle swarm optimization algorithm with dynamic local search. In: International conference in swarm intelligence. Springer, pp 158–165 Guo J, Sato Y (2017) A bare bones particle swarm optimization algorithm with dynamic local search. In: International conference in swarm intelligence. Springer, pp 158–165
Zurück zum Zitat Kennedy J (2003) Bare bones particle swarms. In: Proceedings of the 2003 IEEE swarm intelligence symposium, SIS’03. IEEE, pp 80–87 Kennedy J (2003) Bare bones particle swarms. In: Proceedings of the 2003 IEEE swarm intelligence symposium, SIS’03. IEEE, pp 80–87
Zurück zum Zitat Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings of the IEEE international conference on neural networks, vol 4. IEEE, pp 1942–1948 Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings of the IEEE international conference on neural networks, vol 4. IEEE, pp 1942–1948
Zurück zum Zitat Lai DTC, Garibaldi JM (2011) A comparison of distance-based semi-supervised fuzzy c-means clustering algorithms. In: Proceedings of IEEE international conference on fuzzy systems, pp 1580–1586 Lai DTC, Garibaldi JM (2011) A comparison of distance-based semi-supervised fuzzy c-means clustering algorithms. In: Proceedings of IEEE international conference on fuzzy systems, pp 1580–1586
Zurück zum Zitat Maaten L, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9(Nov):2579–2605MATH Maaten L, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9(Nov):2579–2605MATH
Zurück zum Zitat Omran M, Al-Sharhan S (2007) Barebones particle swarm methods for unsupervised image classification. In: IEEE congress on evolutionary computation, CEC 2007. IEEE, pp 3247–3252 Omran M, Al-Sharhan S (2007) Barebones particle swarm methods for unsupervised image classification. In: IEEE congress on evolutionary computation, CEC 2007. IEEE, pp 3247–3252
Zurück zum Zitat Pedrycz W, Waletzky J (1997) Fuzzy clustering with partial supervision. IEEE Trans Syst Man Cybern 27(5):787–795CrossRef Pedrycz W, Waletzky J (1997) Fuzzy clustering with partial supervision. IEEE Trans Syst Man Cybern 27(5):787–795CrossRef
Zurück zum Zitat Sengupta S, Basak S, Peters RA (2018) Data clustering using a hybrid of fuzzy c-means and quantum-behaved particle swarm optimization. In: 2018 IEEE 8th annual computing and communication workshop and conference (CCWC). IEEE, pp 137–142 Sengupta S, Basak S, Peters RA (2018) Data clustering using a hybrid of fuzzy c-means and quantum-behaved particle swarm optimization. In: 2018 IEEE 8th annual computing and communication workshop and conference (CCWC). IEEE, pp 137–142
Zurück zum Zitat Sun J, Xu W, Feng B (2004) A global search strategy of quantum-behaved particle swarm optimization. In: 2004 IEEE conference on cybernetics and intelligent systems, vol 1. IEEE, pp 111–116 Sun J, Xu W, Feng B (2004) A global search strategy of quantum-behaved particle swarm optimization. In: 2004 IEEE conference on cybernetics and intelligent systems, vol 1. IEEE, pp 111–116
Zurück zum Zitat Van der Merwe D, Engelbrecht AP (2003) Data clustering using particle swarm optimization. In: The 2003 congress on evolutionary computation, CEC’03, vol 1. IEEE, pp 215–220 Van der Merwe D, Engelbrecht AP (2003) Data clustering using particle swarm optimization. In: The 2003 congress on evolutionary computation, CEC’03, vol 1. IEEE, pp 215–220
Zurück zum Zitat Zhang D, Tan K, Chen S (2004) Semi-supervised kernel-based fuzzy c-means. Lect Notes Comput Sci Neural Inf Process 3316:1229–1234CrossRef Zhang D, Tan K, Chen S (2004) Semi-supervised kernel-based fuzzy c-means. Lect Notes Comput Sci Neural Inf Process 3316:1229–1234CrossRef
Zurück zum Zitat Zhang Y, Xiong X, Zhang Q (2013) An improved self-adaptive PSO algorithm with detection function for multimodal function optimization problems. Math Probl Eng 2013:8MathSciNetMATH Zhang Y, Xiong X, Zhang Q (2013) An improved self-adaptive PSO algorithm with detection function for multimodal function optimization problems. Math Probl Eng 2013:8MathSciNetMATH
Metadaten
Titel
Semi-supervised data clustering using particle swarm optimisation
verfasst von
Daphne T. C. Lai
Minami Miyakawa
Yuji Sato
Publikationsdatum
04.06.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 5/2020
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-019-04114-z

Weitere Artikel der Ausgabe 5/2020

Soft Computing 5/2020 Zur Ausgabe

Premium Partner