Skip to main content

2015 | OriginalPaper | Buchkapitel

Multi-objective Optimization for Clustering Microarray Gene Expression Data - A Comparative Study

verfasst von : Muhammad Marwan Muhammad Fuad

Erschienen in: Agent and Multi-Agent Systems: Technologies and Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Clustering is one of the main data mining tasks. It can be performed on a fuzzy or a crisp basis. Fuzzy clustering is widely-applied with microarray gene expression data as these data are usually uncertain and imprecise. There are several measures to evaluate the quality of clustering, but their performance is highly related to the dataset to which they are applied. In a previous work the authors proposed using a multi-objective genetic algorithm – based method, NSGA – II, to optimize two clustering validity measures simultaneously. In this paper we use another multi-objective optimizer, NSPSO, which is based on the particle swarm optimization algorithm, to solve the same problem. The experiments we conducted on two microarray gene expression data show that NSPSO is superior to NSGA-II in handling this problem.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat El-Ghazali, T.: Metaheuristics: from design to implementation. Wiley (2009) El-Ghazali, T.: Metaheuristics: from design to implementation. Wiley (2009)
2.
Zurück zum Zitat Mukhopadhyay, A., Maulik, U., Bandyopadhyay, S.: Multiobjective Evolutionary Approach to Fuzzy Clustering of Microarray Data, pp. 303–328. World Scientific, Singapore (2007) Mukhopadhyay, A., Maulik, U., Bandyopadhyay, S.: Multiobjective Evolutionary Approach to Fuzzy Clustering of Microarray Data, pp. 303–328. World Scientific, Singapore (2007)
3.
Zurück zum Zitat Muhammad Fuad, M.M.: Differential evolution versus genetic algorithms: towards symbolic aggregate approximation of non-normalized time series. In: Sixteenth International Database Engineering & Applications Symposium– IDEAS’12, Prague, Czech Republic, Published by BytePress/ACM, 8–10 Aug 2012 Muhammad Fuad, M.M.: Differential evolution versus genetic algorithms: towards symbolic aggregate approximation of non-normalized time series. In: Sixteenth International Database Engineering & Applications Symposium– IDEAS’12, Prague, Czech Republic, Published by BytePress/ACM, 8–10 Aug 2012
4.
Zurück zum Zitat Muhammad Fuad, M.M.: Using differential evolution to set weights to segments with different information content in the piecewise aggregate approximation. In: 16th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems, KES 2012, San Sebastian, Spain, Published in Frontiers of Artificial Intelligence and Applications (FAIA), IOS Press, 10–12 Sept 2012 Muhammad Fuad, M.M.: Using differential evolution to set weights to segments with different information content in the piecewise aggregate approximation. In: 16th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems, KES 2012, San Sebastian, Spain, Published in Frontiers of Artificial Intelligence and Applications (FAIA), IOS Press, 10–12 Sept 2012
5.
Zurück zum Zitat Han, J., Kamber, M., Pei, J.: Data mining: concepts and techniques, 3rd edn. Morgan Kaufmann (2011) Han, J., Kamber, M., Pei, J.: Data mining: concepts and techniques, 3rd edn. Morgan Kaufmann (2011)
6.
Zurück zum Zitat Larose, D.T.: Discovering Knowledge in Data: An Introduction to Data Mining. Wiley, New York (2005) Larose, D.T.: Discovering Knowledge in Data: An Introduction to Data Mining. Wiley, New York (2005)
7.
Zurück zum Zitat Kanungo, T., Netanyahu, N.S., Wu, A.Y.: An Efficient k-means clustering algorithm: analysis and implementation. IEEE Trans. Pattern Anal. Mach. Intell. 24(7) (2002) Kanungo, T., Netanyahu, N.S., Wu, A.Y.: An Efficient k-means clustering algorithm: analysis and implementation. IEEE Trans. Pattern Anal. Mach. Intell. 24(7) (2002)
8.
Zurück zum Zitat Krzysztof, J.C., Pedrycz, W., Swiniarski, R.W., Kurgan, L.A.: Data Mining: A Knowledge Discovery Approach. Springer-Verlag New York, Inc., Secaucus (2007) Krzysztof, J.C., Pedrycz, W., Swiniarski, R.W., Kurgan, L.A.: Data Mining: A Knowledge Discovery Approach. Springer-Verlag New York, Inc., Secaucus (2007)
9.
Zurück zum Zitat Maulik, U., Bandyopadhyay, S., Mukhopadhyay, A.: Multiobjective Genetic Algorithms for Clustering: Applications in Data Mining and Bioinformatics. Springer (2011) Maulik, U., Bandyopadhyay, S., Mukhopadhyay, A.: Multiobjective Genetic Algorithms for Clustering: Applications in Data Mining and Bioinformatics. Springer (2011)
10.
Zurück zum Zitat Gorunescu, F.: Data mining: Concepts. Blue Publishing House, Cluj-Napoca, Models (2006) Gorunescu, F.: Data mining: Concepts. Blue Publishing House, Cluj-Napoca, Models (2006)
11.
Zurück zum Zitat Muhammad Fuad, M.M.: Differential evolution-based weighted combination of distance metrics for k-means clustering. In: The 3rd International Conference on the Theory and Practice of Natural Computing – TPNC 2014, Granada, Spain. Published in Lecture Notes in Computer Science, vol. 8890, 9–11 Dec 2014 Muhammad Fuad, M.M.: Differential evolution-based weighted combination of distance metrics for k-means clustering. In: The 3rd International Conference on the Theory and Practice of Natural Computing – TPNC 2014, Granada, Spain. Published in Lecture Notes in Computer Science, vol. 8890, 9–11 Dec 2014
12.
Zurück zum Zitat Muhammad Fuad, M.M.: On the application of bio-inspired optimization algorithms to fuzzy c-means clustering of time series. In: The 4th International Conference on Pattern Recognition Applications and Methods - ICPRAM 2015, Lisbon, Portugal. SCITEPRESS Digital Library, 10–12 Jan 2015 Muhammad Fuad, M.M.: On the application of bio-inspired optimization algorithms to fuzzy c-means clustering of time series. In: The 4th International Conference on Pattern Recognition Applications and Methods - ICPRAM 2015, Lisbon, Portugal. SCITEPRESS Digital Library, 10–12 Jan 2015
13.
Zurück zum Zitat Lodhi, H., Muggleton, S.: Elements of Computational Systems Biology. Wiley, New York (2010)CrossRefMATH Lodhi, H., Muggleton, S.: Elements of Computational Systems Biology. Wiley, New York (2010)CrossRefMATH
14.
Zurück zum Zitat Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.: A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. (2002) Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.: A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. (2002)
15.
Zurück zum Zitat Ma, Q., Xu, D., Iv, P., Shi, Y.: Application of NSGA-II in Parameter Optimization of Extended State Observer. Challenges of Power Engineering and Environment (2007) Ma, Q., Xu, D., Iv, P., Shi, Y.: Application of NSGA-II in Parameter Optimization of Extended State Observer. Challenges of Power Engineering and Environment (2007)
16.
Zurück zum Zitat Haupt, R.L., Haupt, S.E.: Practical Genetic Algorithms with CD-ROM. Wiley-Interscience, (2004) Haupt, R.L., Haupt, S.E.: Practical Genetic Algorithms with CD-ROM. Wiley-Interscience, (2004)
17.
Zurück zum Zitat Li, X.: A Non-dominated Sorting particle swarm optimizer for multi-objective optimization. In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO’2003), Lecture Notes in Computer Science, vol. 2723, pp. 37–48. Springer (2003) Li, X.: A Non-dominated Sorting particle swarm optimizer for multi-objective optimization. In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO’2003), Lecture Notes in Computer Science, vol. 2723, pp. 37–48. Springer (2003)
18.
Zurück zum Zitat Reyes Sierra, M., Coello Coello, C.: Multi-objective particle swarm optimizers: a survey of the state-of-the-art. Int. J. Comput. Intell. Res. 2(3), 287–308 (2006) Reyes Sierra, M., Coello Coello, C.: Multi-objective particle swarm optimizers: a survey of the state-of-the-art. Int. J. Comput. Intell. Res. 2(3), 287–308 (2006)
20.
Zurück zum Zitat Maulik, U., Mukhopadhyay, A., and Bandyopadhyay, S.: Combining pareto-optimal clusters using supervised learning for identifying co-expressed Genes. BMC Bioinfor. 10(27), 1–16, 20 Jan 2009 Maulik, U., Mukhopadhyay, A., and Bandyopadhyay, S.: Combining pareto-optimal clusters using supervised learning for identifying co-expressed Genes. BMC Bioinfor. 10(27), 1–16, 20 Jan 2009
22.
Zurück zum Zitat Xie, X.L., Beni, G.: A validity measure for fuzzy clustering. IEEE Trans. Pattern Anal. Mach. Intell. 13, 841–847 (1991)CrossRef Xie, X.L., Beni, G.: A validity measure for fuzzy clustering. IEEE Trans. Pattern Anal. Mach. Intell. 13, 841–847 (1991)CrossRef
23.
Zurück zum Zitat Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comp. Appl. Math. 20, 53–65 (1987)CrossRefMATH Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comp. Appl. Math. 20, 53–65 (1987)CrossRefMATH
24.
Zurück zum Zitat Petrovic, S.: A comparison between the silhouette index and the davies-bouldin index in labelling IDS clusters. In: Proceedings of the 11th Nordic Workshop of Secure IT Systems (2006) Petrovic, S.: A comparison between the silhouette index and the davies-bouldin index in labelling IDS clusters. In: Proceedings of the 11th Nordic Workshop of Secure IT Systems (2006)
Metadaten
Titel
Multi-objective Optimization for Clustering Microarray Gene Expression Data - A Comparative Study
verfasst von
Muhammad Marwan Muhammad Fuad
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-19728-9_10