Skip to main content
Erschienen in: Neural Computing and Applications 3/2018

01.02.2017 | ISCMI15

Exploring differential evolution and particle swarm optimization to develop some symmetry-based automatic clustering techniques: application to gene clustering

verfasst von: Sriparna Saha, Ranjita Das

Erschienen in: Neural Computing and Applications | Ausgabe 3/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In the current paper, we have developed two bio-inspired fuzzy clustering algorithms by incorporating the optimization techniques, namely differential evolution and particle swarm optimization. Both these clustering techniques can detect symmetrical-shaped clusters utilizing the established point symmetry-based distance measure. Both the proposed approaches are automatic in nature and can detect the number of clusters automatically from a given dataset. A symmetry-based cluster validity measure, F-Sym-index, is used as the objective function to be optimized in order to automatically determine the correct partitioning by both the approaches. The effectiveness of the proposed approaches is shown for automatically clustering some artificial and real-life datasets as well as for clustering some real-life gene expression datasets. The current paper presents a comparative analysis of some meta-heuristic-based clustering approaches, namely newly proposed two techniques and the already existing automatic genetic clustering techniques, VGAPS, GCUK, HNGA. The obtained results are compared with respect to some external cluster validity indices. Moreover, some statistical significance tests, as well as biological significance tests, are also conducted. Finally, results on gene expression datasets have been visualized by using some visualization tools, namely Eisen plot and cluster profile plot.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Agustn-Blas LE, Salcedo-Sanz S, Jimnez-Fernndez S, Carro-Calvo L, Del Ser J, Portilla-Figueras JA (2012) A new grouping genetic algorithm for clustering problems. Expert Syst Appl 39(10):9695–9703CrossRef Agustn-Blas LE, Salcedo-Sanz S, Jimnez-Fernndez S, Carro-Calvo L, Del Ser J, Portilla-Figueras JA (2012) A new grouping genetic algorithm for clustering problems. Expert Syst Appl 39(10):9695–9703CrossRef
2.
Zurück zum Zitat Alam S, Dobbie G, Koh YS, Riddle P, Rehma SU (2014) Research on particle swarm optimization based clustering: a systematic review of literature and techniques. Swarm Evolut Comput 17:1–13CrossRef Alam S, Dobbie G, Koh YS, Riddle P, Rehma SU (2014) Research on particle swarm optimization based clustering: a systematic review of literature and techniques. Swarm Evolut Comput 17:1–13CrossRef
3.
Zurück zum Zitat Alizadeh AA, Eisen MB, Davis RE, Ma C, Lossos IS, Rosenwald A, Boldrick JC, Sabet H, Tran T, Yu X, Powell JI, Yang L, Marti GE, Moore T, Hudson JJ, Lu L, Lewis DB, Tibshirani R, Sherlock G, Chan WC, Greiner TC, Weisenburger DD, Armitage JO, Warnke R, Levy R, Wilson W, Grever MR, Byrd JC, Botstein D, Brown PO, Staudt LM (2000) Distinct types of diffuse large b-cell lymphoma identified by gene expression profiling. Nature 403(6769):503–511CrossRef Alizadeh AA, Eisen MB, Davis RE, Ma C, Lossos IS, Rosenwald A, Boldrick JC, Sabet H, Tran T, Yu X, Powell JI, Yang L, Marti GE, Moore T, Hudson JJ, Lu L, Lewis DB, Tibshirani R, Sherlock G, Chan WC, Greiner TC, Weisenburger DD, Armitage JO, Warnke R, Levy R, Wilson W, Grever MR, Byrd JC, Botstein D, Brown PO, Staudt LM (2000) Distinct types of diffuse large b-cell lymphoma identified by gene expression profiling. Nature 403(6769):503–511CrossRef
4.
Zurück zum Zitat Ankerbrandt CA, Buckles BP, Petry FE (1990) Scene recognition using genetic algorithms with semantic nets. Pattern Recognit Lett 11:285–293CrossRefMATH Ankerbrandt CA, Buckles BP, Petry FE (1990) Scene recognition using genetic algorithms with semantic nets. Pattern Recognit Lett 11:285–293CrossRefMATH
6.
Zurück zum Zitat Bandyopadhyay S, Maulik U (2002) Genetic clustering for automatic evolution of clusters and application to image classification. Pattern Recognit 35(6):1197–1208CrossRefMATH Bandyopadhyay S, Maulik U (2002) Genetic clustering for automatic evolution of clusters and application to image classification. Pattern Recognit 35(6):1197–1208CrossRefMATH
7.
Zurück zum Zitat Bandyopadhyay S, Saha S (2007) GAPS: a clustering method using a new point symmetry based distance measure. Pattern Recognit 40(12):3430–3451CrossRefMATH Bandyopadhyay S, Saha S (2007) GAPS: a clustering method using a new point symmetry based distance measure. Pattern Recognit 40(12):3430–3451CrossRefMATH
8.
Zurück zum Zitat Ben-Hur A, Guyon I (2003) Detecting stable clusters using principal component analysis. Humana Press, TotowaCrossRef Ben-Hur A, Guyon I (2003) Detecting stable clusters using principal component analysis. Humana Press, TotowaCrossRef
9.
Zurück zum Zitat Bentley JL, Weide BW, Yao AC (1980) Optimal expected-time algorithms for closest point problems. ACM Trans Math Softw 6(4):563–580MathSciNetCrossRefMATH Bentley JL, Weide BW, Yao AC (1980) Optimal expected-time algorithms for closest point problems. ACM Trans Math Softw 6(4):563–580MathSciNetCrossRefMATH
10.
Zurück zum Zitat Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Plenum, New YorkCrossRefMATH Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Plenum, New YorkCrossRefMATH
11.
Zurück zum Zitat Cai Z, Gong W, Ling CX, Zhang H (2011) A clustering-based differential evolution for global optimization. Appl Soft Comput 11(1):1363–1379CrossRef Cai Z, Gong W, Ling CX, Zhang H (2011) A clustering-based differential evolution for global optimization. Appl Soft Comput 11(1):1363–1379CrossRef
12.
Zurück zum Zitat Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30MathSciNetMATH Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30MathSciNetMATH
13.
Zurück zum Zitat Dorigo M, Stützle T (2004) Ant colony optimization. MIT Press, CambridgeMATH Dorigo M, Stützle T (2004) Ant colony optimization. MIT Press, CambridgeMATH
14.
Zurück zum Zitat Eisen MB, Spellman PT, Brown PO, Botstein D (1998) Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA 95(25):14863–14868CrossRef Eisen MB, Spellman PT, Brown PO, Botstein D (1998) Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA 95(25):14863–14868CrossRef
15.
Zurück zum Zitat Everitt BS (1993) Cluster analysis, third edn. Halsted Press, New YorkMATH Everitt BS (1993) Cluster analysis, third edn. Halsted Press, New YorkMATH
16.
Zurück zum Zitat Friedman JH, Bently JL, Finkel RA (1977) An algorithm for finding best matches in logarithmic expected time. ACM Trans Math Softw 3(3):209–226CrossRefMATH Friedman JH, Bently JL, Finkel RA (1977) An algorithm for finding best matches in logarithmic expected time. ACM Trans Math Softw 3(3):209–226CrossRefMATH
17.
Zurück zum Zitat Friedman M (1937) The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J Am Stat Assoc 32(200):675–701CrossRefMATH Friedman M (1937) The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J Am Stat Assoc 32(200):675–701CrossRefMATH
18.
Zurück zum Zitat García S, Herrera F (2008) An extension on statistical comparisons of classifiers over multiple data sets for all pairwise comparisons. J Mach Learn Res 9:2677–2694MATH García S, Herrera F (2008) An extension on statistical comparisons of classifiers over multiple data sets for all pairwise comparisons. J Mach Learn Res 9:2677–2694MATH
19.
Zurück zum Zitat Goldberg DE (1989) Genet algorithms search, optimization and machine learning. Addison-Wesley, New York Goldberg DE (1989) Genet algorithms search, optimization and machine learning. Addison-Wesley, New York
20.
Zurück zum Zitat He H, Tan Y (2012) A two-stage genetic algorithm for automatic clustering. Neurocomputing 81:49–59CrossRef He H, Tan Y (2012) A two-stage genetic algorithm for automatic clustering. Neurocomputing 81:49–59CrossRef
21.
Zurück zum Zitat Horta D, de Andrade IC, Campello RJGB (2011) Evolutionary fuzzy clustering of relational data. Theor Comput Sci 412(42):5854–5870MathSciNetCrossRefMATH Horta D, de Andrade IC, Campello RJGB (2011) Evolutionary fuzzy clustering of relational data. Theor Comput Sci 412(42):5854–5870MathSciNetCrossRefMATH
22.
Zurück zum Zitat Jain AK, Dubes RC (1988) Algorithms for clustering data. Prentice-Hall, Englewood CliffsMATH Jain AK, Dubes RC (1988) Algorithms for clustering data. Prentice-Hall, Englewood CliffsMATH
23.
Zurück zum Zitat Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv 31(3):264–323CrossRef Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv 31(3):264–323CrossRef
24.
Zurück zum Zitat Kao Y, Chen C-C (2014) Automatic clustering for generalised cell formation using a hybrid particle swarm optimisation. Int J Prod Res 52(12):3466–3484CrossRef Kao Y, Chen C-C (2014) Automatic clustering for generalised cell formation using a hybrid particle swarm optimisation. Int J Prod Res 52(12):3466–3484CrossRef
25.
Zurück zum Zitat Kennedy J, Eberhart RC (2001) Swarm intelligence. Morgan Kaufmann Publishers Inc., San Francisco Kennedy J, Eberhart RC (2001) Swarm intelligence. Morgan Kaufmann Publishers Inc., San Francisco
26.
Zurück zum Zitat Kuila P, Jana PK (2014) A novel differential evolution based clustering algorithm for wireless sensor networks. Appl Soft Comput 25:414–425CrossRef Kuila P, Jana PK (2014) A novel differential evolution based clustering algorithm for wireless sensor networks. Appl Soft Comput 25:414–425CrossRef
27.
Zurück zum Zitat Liu G, Li Y, Nie X, Zheng H (2012) A novel clustering-based differential evolution with 2 multi-parent crossovers for global optimization. Appl Soft Comput 12(2):663–681CrossRef Liu G, Li Y, Nie X, Zheng H (2012) A novel clustering-based differential evolution with 2 multi-parent crossovers for global optimization. Appl Soft Comput 12(2):663–681CrossRef
28.
Zurück zum Zitat Liu Y, Xindong W, Shen Y (2011) Automatic clustering using genetic algorithms. Appl Math Comput 218(4):1267–1279MathSciNetMATH Liu Y, Xindong W, Shen Y (2011) Automatic clustering using genetic algorithms. Appl Math Comput 218(4):1267–1279MathSciNetMATH
29.
Zurück zum Zitat Mangat V (2012) Survey on particle swarm optimization based clustering analysis. In: Swarm and evolutionary computation—international symposia, (SIDE) 2012 and EC 2012, Held in Conjunction with ICAISC 2012, Zakopane, Poland, April 29 May 3, 2012. Proceedings, pp 301–309 Mangat V (2012) Survey on particle swarm optimization based clustering analysis. In: Swarm and evolutionary computation—international symposia, (SIDE) 2012 and EC 2012, Held in Conjunction with ICAISC 2012, Zakopane, Poland, April 29 May 3, 2012. Proceedings, pp 301–309
30.
Zurück zum Zitat Maulik U, Mukhopadhyay A, Bandyopadhyay S (2009) Combining pareto-optimal clusters using supervised learning for identifying co-expressed genes. BMC Bioinform 10(27) Maulik U, Mukhopadhyay A, Bandyopadhyay S (2009) Combining pareto-optimal clusters using supervised learning for identifying co-expressed genes. BMC Bioinform 10(27)
31.
Zurück zum Zitat Maulik U, Saha I (2009) Modified differential evolution based fuzzy clustering for pixel classification in remote sensing imagery. Pattern Recognit 42(9):2135–2149CrossRefMATH Maulik U, Saha I (2009) Modified differential evolution based fuzzy clustering for pixel classification in remote sensing imagery. Pattern Recognit 42(9):2135–2149CrossRefMATH
32.
Zurück zum Zitat Nemenyi P (1963) Distribution-free multiple comparisons. PhD thesis, Princeton University Nemenyi P (1963) Distribution-free multiple comparisons. PhD thesis, Princeton University
33.
Zurück zum Zitat Poli R, Kennedy J, Blackwell T (2007) Particle swarm optimization. Swarm Intell 1(1):33–57CrossRef Poli R, Kennedy J, Blackwell T (2007) Particle swarm optimization. Swarm Intell 1(1):33–57CrossRef
34.
Zurück zum Zitat Price K, Storn RM, Lampinen JA (2005) Differential evolution: a practical approach to global optimization (natural computing series). Springer, SecaucusMATH Price K, Storn RM, Lampinen JA (2005) Differential evolution: a practical approach to global optimization (natural computing series). Springer, SecaucusMATH
35.
Zurück zum Zitat Rana S, Jasola S, Kumar R (2011) A review on particle swarm optimization algorithms and their applications to data clustering. Artif Intell Rev 35(3):211–222CrossRef Rana S, Jasola S, Kumar R (2011) A review on particle swarm optimization algorithms and their applications to data clustering. Artif Intell Rev 35(3):211–222CrossRef
36.
Zurück zum Zitat Ravi V, Aggarwal N, Chauhan N (2010) Differential evolution based fuzzy clustering. In: Swarm, evolutionary, and memetic computing—first international conference on swarm, evolutionary, and memetic computing, SEMCCO 2010, Chennai, India, December 16–18, 2010. Proceedings, pp 38–45 Ravi V, Aggarwal N, Chauhan N (2010) Differential evolution based fuzzy clustering. In: Swarm, evolutionary, and memetic computing—first international conference on swarm, evolutionary, and memetic computing, SEMCCO 2010, Chennai, India, December 16–18, 2010. Proceedings, pp 38–45
37.
Zurück zum Zitat Rousseeuw P (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20(1):53–65CrossRefMATH Rousseeuw P (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20(1):53–65CrossRefMATH
38.
Zurück zum Zitat Saha S, Bandyopadhyay S (2009) A new point symmetry based fuzzy genetic clustering technique for automatic evolution of clusters. Inf Sci 179(19):3230–3246CrossRefMATH Saha S, Bandyopadhyay S (2009) A new point symmetry based fuzzy genetic clustering technique for automatic evolution of clusters. Inf Sci 179(19):3230–3246CrossRefMATH
39.
Zurück zum Zitat Saha S, Bandyopadhyay S (2008) Application of a new symmetry based cluster validity index for satellite image segmentation. IEEE Geosci Remote Sens Lett 5(2):166–170CrossRef Saha S, Bandyopadhyay S (2008) Application of a new symmetry based cluster validity index for satellite image segmentation. IEEE Geosci Remote Sens Lett 5(2):166–170CrossRef
40.
Zurück zum Zitat Sarkar S, Patra GR, Das S, Chaudhuri SS (2013) Fuzzy clustering of image pixels with a fitness-based adaptive differential evolution. In: Swarm, evolutionary, and memetic computing—4th international conference, SEMCCO 2013, Chennai, India, December 19–21, 2013, Proceedings, Part I, pp 179–188 Sarkar S, Patra GR, Das S, Chaudhuri SS (2013) Fuzzy clustering of image pixels with a fitness-based adaptive differential evolution. In: Swarm, evolutionary, and memetic computing—4th international conference, SEMCCO 2013, Chennai, India, December 19–21, 2013, Proceedings, Part I, pp 179–188
41.
Zurück zum Zitat Sheng W, Swift S, Zhang L, Liu X (2005) A weighted sum validity function for clustering with a hybrid niching genetic algorithm. IEEE Trans Syst Man Cybern Part B Cybern 35(6):56–67CrossRef Sheng W, Swift S, Zhang L, Liu X (2005) A weighted sum validity function for clustering with a hybrid niching genetic algorithm. IEEE Trans Syst Man Cybern Part B Cybern 35(6):56–67CrossRef
42.
43.
Zurück zum Zitat Su M-C, Chou C-H (2001) A modified version of the K-means algorithm with a distance based on cluster symmetry. IEEE Trans Pattern Anal Mach Intell 23(6):674–680CrossRef Su M-C, Chou C-H (2001) A modified version of the K-means algorithm with a distance based on cluster symmetry. IEEE Trans Pattern Anal Mach Intell 23(6):674–680CrossRef
44.
Zurück zum Zitat Tvrdk J, Kiv I (2015) Hybrid differential evolution algorithm for optimal clustering. Appl Soft Comput 35:502–512CrossRef Tvrdk J, Kiv I (2015) Hybrid differential evolution algorithm for optimal clustering. Appl Soft Comput 35:502–512CrossRef
45.
Zurück zum Zitat Xie XL, Beni G (1991) A validity measure for fuzzy clustering. IEEE Trans Pattern Anal Mach Intell 13(8):841–847CrossRef Xie XL, Beni G (1991) A validity measure for fuzzy clustering. IEEE Trans Pattern Anal Mach Intell 13(8):841–847CrossRef
46.
Zurück zum Zitat Yang X-S, Deb S (2012) Two-stage eagle strategy with differential evolution. IJBIC 4(1):1–5CrossRef Yang X-S, Deb S (2012) Two-stage eagle strategy with differential evolution. IJBIC 4(1):1–5CrossRef
47.
Zurück zum Zitat Yang X-S, Deb S (2014) Cuckoo search: recent advances and applications. CoRR, abs/1408.5316 Yang X-S, Deb S (2014) Cuckoo search: recent advances and applications. CoRR, abs/1408.5316
48.
Zurück zum Zitat Zhang L, Ma M, Liu X, Sun C, Liu M, Zhou C (2006) Differential evolution fuzzy clustering algorithm based on kernel methods. In: Rough sets and knowledge technology, first international conference, RSKT 2006, Chongqing, China, July 24–26, 2006, Proceedings, pp 430–435 Zhang L, Ma M, Liu X, Sun C, Liu M, Zhou C (2006) Differential evolution fuzzy clustering algorithm based on kernel methods. In: Rough sets and knowledge technology, first international conference, RSKT 2006, Chongqing, China, July 24–26, 2006, Proceedings, pp 430–435
Metadaten
Titel
Exploring differential evolution and particle swarm optimization to develop some symmetry-based automatic clustering techniques: application to gene clustering
verfasst von
Sriparna Saha
Ranjita Das
Publikationsdatum
01.02.2017
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 3/2018
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-016-2710-0

Weitere Artikel der Ausgabe 3/2018

Neural Computing and Applications 3/2018 Zur Ausgabe

Premium Partner