Skip to main content
Erschienen in: Neural Computing and Applications 2/2023

03.10.2022 | Original Article

Multi-objective memetic differential evolution optimization algorithm for text clustering problems

verfasst von: Hossam M. J. Mustafa, Masri Ayob, Hisham A. Shehadeh, Sawsan Abu-Taleb

Erschienen in: Neural Computing and Applications | Ausgabe 2/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Most text clustering algorithms adopt a single criterion optimization approach, which often fails to find good clustering solutions for a wide diversity of datasets with different clustering characteristics. The multi-objective meta-heuristic approach is utilized to seek optimal clustering by maximizing (or minimizing) more than two objective functions. In this paper, we propose a multi-objective memetic differential evolution algorithm (MOMDE) for text clustering. The MOMDE text clustering algorithm combines memetic and differential evolution algorithms to improve the search for optimal clustering by improving the balance between exploitation and exploration. Moreover, a combination with the dominance-based multi-objective approach is employed, which may improve the search for optimal clustering by maximizing or/and minimizing two cluster quality measures. The proposed algorithm is tested on six text clustering datasets from the Laboratory of Computational Intelligence. Our experimental results revealed that the performance of the MOMDE algorithm is better than state-of-the-art text clustering algorithms. Further validation is provided using the F-measure to assess the efficiency of the obtained clustering of MOMDE, whilst the multi-objective performance assessment matrices are used to evaluate the quality of Pareto-optimality.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
11.
Zurück zum Zitat Yassen ET, Ayob M, Nazri MZA (2015) The effect of hybridizing local search algorithms with harmony search for the vehicle routing problem with time windows. J Theor Appl Inf Technol 73:43–58 Yassen ET, Ayob M, Nazri MZA (2015) The effect of hybridizing local search algorithms with harmony search for the vehicle routing problem with time windows. J Theor Appl Inf Technol 73:43–58
13.
Zurück zum Zitat Shehadeh HA, Idris MYI, Ahmedy I et al (2018) The multi-objective optimization algorithm based on sperm fertilization procedure (MOSFP) method for solving wireless sensor networks optimization problems in smart grid applications. Energies. https://doi.org/10.3390/en11010097CrossRef Shehadeh HA, Idris MYI, Ahmedy I et al (2018) The multi-objective optimization algorithm based on sperm fertilization procedure (MOSFP) method for solving wireless sensor networks optimization problems in smart grid applications. Energies. https://​doi.​org/​10.​3390/​en11010097CrossRef
14.
Zurück zum Zitat Maulik U, Bandyopadhyay S, Mukhopadhyay A (2011) Multiobjective genetic algorithms for clustering. Springer, HeidelbergCrossRefMATH Maulik U, Bandyopadhyay S, Mukhopadhyay A (2011) Multiobjective genetic algorithms for clustering. Springer, HeidelbergCrossRefMATH
22.
Zurück zum Zitat Sabar NR, Ayob M, Kendall G (2013) A hybrid of differential evolution and simulated annealing algorithms for the capacitated arc routing problems. In: Proceedings of the 6th multidisciplinary international conference on scheduling: theory and applications, Gent, Belgium, pp 549–554 Sabar NR, Ayob M, Kendall G (2013) A hybrid of differential evolution and simulated annealing algorithms for the capacitated arc routing problems. In: Proceedings of the 6th multidisciplinary international conference on scheduling: theory and applications, Gent, Belgium, pp 549–554
23.
Zurück zum Zitat Mustafa H, Ayob M, Nazri MZA, Abu-Taleb S (2019) Multi-objectives memetic discrete differential evolution algorithm for solving the container pre-marshalling problem. J Inf Commun Technol 18:77–96 Mustafa H, Ayob M, Nazri MZA, Abu-Taleb S (2019) Multi-objectives memetic discrete differential evolution algorithm for solving the container pre-marshalling problem. J Inf Commun Technol 18:77–96
29.
Zurück zum Zitat Ali S, Wang G, Cottrell RL, Anwar T (2018) Detecting anomalies from end-to-end internet performance measurements (PingER) using cluster based local outlier factor. In: Proceedings of 15th IEEE International symposium parallel distribution process with applications. In: 16th IEEE international conference on ubiquitous computing and communications (ISPA/IUCC), pp 982–989. https://doi.org/10.1109/ISPA/IUCC.2017.00150 Ali S, Wang G, Cottrell RL, Anwar T (2018) Detecting anomalies from end-to-end internet performance measurements (PingER) using cluster based local outlier factor. In: Proceedings of 15th IEEE International symposium parallel distribution process with applications. In: 16th IEEE international conference on ubiquitous computing and communications (ISPA/IUCC), pp 982–989. https://​doi.​org/​10.​1109/​ISPA/​IUCC.​2017.​00150
31.
32.
Zurück zum Zitat Chen E, Wang F (2005) Dynamic clustering using multi-objective evolutionary algorithm. In: Hao Y, Liu J, Wang Y et al (eds) Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics). Springer, Berlin, pp 73–80 Chen E, Wang F (2005) Dynamic clustering using multi-objective evolutionary algorithm. In: Hao Y, Liu J, Wang Y et al (eds) Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics). Springer, Berlin, pp 73–80
33.
Zurück zum Zitat Ripon KSN, Siddique MNH (2009) Evolutionary multi-objective clustering for overlapping clusters detection. In: 2009 IEEE congress on evolutionary computation, CEC 2009, pp 976–982 Ripon KSN, Siddique MNH (2009) Evolutionary multi-objective clustering for overlapping clusters detection. In: 2009 IEEE congress on evolutionary computation, CEC 2009, pp 976–982
36.
Zurück zum Zitat Garza-Fabre M, Handl J, Knowles J (2017) A new reduced-length genetic representation for evolutionary multiobjective clustering. In: Lecture notes computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), LNCS, , vol 10173, pp 236–251. https://doi.org/10.1007/978-3-319-54157-0_17 Garza-Fabre M, Handl J, Knowles J (2017) A new reduced-length genetic representation for evolutionary multiobjective clustering. In: Lecture notes computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), LNCS, , vol 10173, pp 236–251. https://​doi.​org/​10.​1007/​978-3-319-54157-0_​17
38.
Zurück zum Zitat Mukhopadhyay A, Maulik U (2007) Multiobjective approach to categorical data clustering. In: 2007 IEEE congress on evolutionary computation, pp 1296–1303 Mukhopadhyay A, Maulik U (2007) Multiobjective approach to categorical data clustering. In: 2007 IEEE congress on evolutionary computation, pp 1296–1303
39.
Zurück zum Zitat Qian X, Zhang X, Jiao L, Ma W (2008) Unsupervised texture image segmentation using multiobjective evolutionary clustering ensemble algorithm. In: 2008 IEEE congress on evolutionary computation (IEEE world congress on computational intelligence), pp 3561–3567 Qian X, Zhang X, Jiao L, Ma W (2008) Unsupervised texture image segmentation using multiobjective evolutionary clustering ensemble algorithm. In: 2008 IEEE congress on evolutionary computation (IEEE world congress on computational intelligence), pp 3561–3567
40.
Zurück zum Zitat Kirkland O, Rayward-Smith VJ, de la Iglesia B (2011) A novel multi-objective genetic algorithm for clustering. In: Yin H, Wang W, Rayward-Smith V (eds) Intelligent data engineering and automated learning—IDEAL 2011. Springer, Berlin, pp 317–326CrossRef Kirkland O, Rayward-Smith VJ, de la Iglesia B (2011) A novel multi-objective genetic algorithm for clustering. In: Yin H, Wang W, Rayward-Smith V (eds) Intelligent data engineering and automated learning—IDEAL 2011. Springer, Berlin, pp 317–326CrossRef
41.
Zurück zum Zitat Kang J, Zhang W (2012) Combination of fuzzy c-means and particle swarm optimization for text document clustering. In: Xie A, Huang X (eds) Advances in intelligent and soft computing. Springer, Berlin, pp 247–252 Kang J, Zhang W (2012) Combination of fuzzy c-means and particle swarm optimization for text document clustering. In: Xie A, Huang X (eds) Advances in intelligent and soft computing. Springer, Berlin, pp 247–252
43.
Zurück zum Zitat Abualigah LM, Khader AT, Al-Betar MA (2016) Multi-objectives-based text clustering technique using K-mean algorithm. In: Proceedings—CSIT 2016: 2016 7th international conference on computer science and information technology, pp 1–6 Abualigah LM, Khader AT, Al-Betar MA (2016) Multi-objectives-based text clustering technique using K-mean algorithm. In: Proceedings—CSIT 2016: 2016 7th international conference on computer science and information technology, pp 1–6
45.
Zurück zum Zitat Cui X, Potok TE, Palathingal P (2005) Document clustering using particle swarm optimization. In: Proceedings—2005 IEEE swarm intelligence symposium, SIS 2005, pp 191–197 Cui X, Potok TE, Palathingal P (2005) Document clustering using particle swarm optimization. In: Proceedings—2005 IEEE swarm intelligence symposium, SIS 2005, pp 191–197
49.
Zurück zum Zitat Nagarajan E, Saritha K, Madhugayathri G (2017) Document clustering using ant colony algorithm. In: Proceedings of the 2017 international conference on big data analytics and computational intelligence, ICBDACI 2017, pp 459–463 Nagarajan E, Saritha K, Madhugayathri G (2017) Document clustering using ant colony algorithm. In: Proceedings of the 2017 international conference on big data analytics and computational intelligence, ICBDACI 2017, pp 459–463
53.
Zurück zum Zitat Bharti KK, Singh PK (2014) Chaotic artificial bee colony for text clustering. In: Proceedings—4th international conference on emerging applications of information technology, EAIT 2014, pp 337–343 Bharti KK, Singh PK (2014) Chaotic artificial bee colony for text clustering. In: Proceedings—4th international conference on emerging applications of information technology, EAIT 2014, pp 337–343
56.
Zurück zum Zitat Zaw MM, Mon EE (2015) Web document clustering by using PSO-based Cuckoo Search Clustering Algorithm. In: Studies in computational intelligence. Springer, Berlin, pp 263–281 Zaw MM, Mon EE (2015) Web document clustering by using PSO-based Cuckoo Search Clustering Algorithm. In: Studies in computational intelligence. Springer, Berlin, pp 263–281
58.
Zurück zum Zitat Manikandan P, Selvarajan S (2014) Data clustering using cuckoo search algorithm (CSA). In: Babu B V, Nagar A, Deep K et al (eds) Advances in intelligent systems and computing. Springer, New Delhi, pp 1275–1283 Manikandan P, Selvarajan S (2014) Data clustering using cuckoo search algorithm (CSA). In: Babu B V, Nagar A, Deep K et al (eds) Advances in intelligent systems and computing. Springer, New Delhi, pp 1275–1283
59.
Zurück zum Zitat Saida IB, Nadjet K, Omar B (2014) A new algorithm for data clustering based on Cuckoo search optimization. In: Pan J-S, Krömer P, Snášel V (eds) Genetic and evolutionary computing. Springer, Cham, pp 55–64CrossRefMATH Saida IB, Nadjet K, Omar B (2014) A new algorithm for data clustering based on Cuckoo search optimization. In: Pan J-S, Krömer P, Snášel V (eds) Genetic and evolutionary computing. Springer, Cham, pp 55–64CrossRefMATH
60.
Zurück zum Zitat Hassanzadeh T, Meybodi MR (2012) A new hybrid approach for data clustering using firefly algorithm and K-means. In: AISP 2012—16th CSI international symposium on artificial intelligence and signal processing, pp 7–11 Hassanzadeh T, Meybodi MR (2012) A new hybrid approach for data clustering using firefly algorithm and K-means. In: AISP 2012—16th CSI international symposium on artificial intelligence and signal processing, pp 7–11
70.
Zurück zum Zitat Wu M, Xu Z, Watada J (2012) Memetic algorithm based support vector machine classification. Int J Innov Manag Inf Prod 3:99–117 Wu M, Xu Z, Watada J (2012) Memetic algorithm based support vector machine classification. Int J Innov Manag Inf Prod 3:99–117
79.
Zurück zum Zitat Aggarwal CC, Reddy CK (2013) DATA clustering algorithms and applications, 1st edn. Taylor & Francis Group, Milton ParkCrossRef Aggarwal CC, Reddy CK (2013) DATA clustering algorithms and applications, 1st edn. Taylor & Francis Group, Milton ParkCrossRef
80.
Zurück zum Zitat Sivanandam SN, Deepa SN (2008) Introduction to genetic algorithms, 1st edn. Springer, BerlinMATH Sivanandam SN, Deepa SN (2008) Introduction to genetic algorithms, 1st edn. Springer, BerlinMATH
81.
Zurück zum Zitat Neri F, Cotta C, Moscato P (2012) Handbook of memetic algorithms. Studies in Computational Intelligence, vol 370. Springer Neri F, Cotta C, Moscato P (2012) Handbook of memetic algorithms. Studies in Computational Intelligence, vol 370. Springer
88.
Zurück zum Zitat Deb K, Sachin J (2002) Running performance metrics for evolutionary multi-objective optimization. Kangal Rep 2002004:13–20 Deb K, Sachin J (2002) Running performance metrics for evolutionary multi-objective optimization. Kangal Rep 2002004:13–20
Metadaten
Titel
Multi-objective memetic differential evolution optimization algorithm for text clustering problems
verfasst von
Hossam M. J. Mustafa
Masri Ayob
Hisham A. Shehadeh
Sawsan Abu-Taleb
Publikationsdatum
03.10.2022
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 2/2023
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-022-07888-w

Weitere Artikel der Ausgabe 2/2023

Neural Computing and Applications 2/2023 Zur Ausgabe

Premium Partner