Skip to main content

2018 | OriginalPaper | Buchkapitel

Incorporating Gene Ontology Information in Gene Expression Data Clustering Using Multiobjective Evolutionary Optimization: Application in Yeast Cell Cycle Data

verfasst von : Anirban Mukhopadhyay

Erschienen in: Multi-Objective Optimization

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The advances in microarray technology have allowed the biologists to simultaneously study the levels of expression of a large set of genes over different time points. Clustering algorithms are used to discover groups of genes that are similarly expressed over all the experimental conditions. In this chapter, an approach for combining experimental gene expression information and biological information in the form of Gene Ontology (GO) knowledge through multiobjective clustering has been presented. The proposed method combines the expression-based and GO-based measures to compute the distances between the genes. Moreover, it simultaneously optimizes two objective functions, one from gene expression point of view and another from GO point of view. The performance of the proposed technique has been demonstrated on real-life gene expression dataset of yeast cell cycle. Moreover, biological relevance studies have been conducted for the produced clusters to demonstrate the effectiveness of the proposed technique.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat A.A. Alizadeh, M.B. Eisen, R. Davis, C. Ma, I. Lossos, A. Rosenwald, J. Boldrick, R. Warnke, R. Levy, W. Wilson, M. Grever, J. Byrd, D. Botstein, P.O. Brown, L.M. Straudt, Distinct types of diffuse large B-cell lymphomas identified by gene expression profiling. Nature 403, 503–511 (2000)CrossRef A.A. Alizadeh, M.B. Eisen, R. Davis, C. Ma, I. Lossos, A. Rosenwald, J. Boldrick, R. Warnke, R. Levy, W. Wilson, M. Grever, J. Byrd, D. Botstein, P.O. Brown, L.M. Straudt, Distinct types of diffuse large B-cell lymphomas identified by gene expression profiling. Nature 403, 503–511 (2000)CrossRef
Zurück zum Zitat M. Ashburner, C.A. Ball, J.A. Blake, D. Botstein, H. Butler, J.M. Cherry, A.P. Davis, K. Dolinski, S.S. Dwight, J.T. Eppig, M.A. Harris, D.P. Hill, L. Issel-Tarver, A. Kasarskis, S. Lewis, J.C. Matese, J.E. Richardson, M. Ringwald, G.M. Rubin, G. Sherlock, Gene Ontology: tool for the unification of biology. The gene ontology consortium. Na. Genet. 25, 25–29 (2000)CrossRef M. Ashburner, C.A. Ball, J.A. Blake, D. Botstein, H. Butler, J.M. Cherry, A.P. Davis, K. Dolinski, S.S. Dwight, J.T. Eppig, M.A. Harris, D.P. Hill, L. Issel-Tarver, A. Kasarskis, S. Lewis, J.C. Matese, J.E. Richardson, M. Ringwald, G.M. Rubin, G. Sherlock, Gene Ontology: tool for the unification of biology. The gene ontology consortium. Na. Genet. 25, 25–29 (2000)CrossRef
Zurück zum Zitat S. Bandyopadhyay, U. Maulik, J.T.L. Wang, Analysis of Biological Data: A Soft Computing Approach (World Scientific, 2007) S. Bandyopadhyay, U. Maulik, J.T.L. Wang, Analysis of Biological Data: A Soft Computing Approach (World Scientific, 2007)
Zurück zum Zitat S. Bandyopadhyay, A. Mukhopadhyay, U. Maulik, An improved algorithm for clustering gene expression data. Bioinformatics 23(21), 2859–2865 (2007)CrossRef S. Bandyopadhyay, A. Mukhopadhyay, U. Maulik, An improved algorithm for clustering gene expression data. Bioinformatics 23(21), 2859–2865 (2007)CrossRef
Zurück zum Zitat C.A. Coello Coello, A comprehensive survey of evolutionary-based multiobjective optimization techniques. Knowl. Inf. Syst. 1(3), 129–156 (1999)CrossRef C.A. Coello Coello, A comprehensive survey of evolutionary-based multiobjective optimization techniques. Knowl. Inf. Syst. 1(3), 129–156 (1999)CrossRef
Zurück zum Zitat C. Coello Coello, Evolutionary multiobjective optimization: a historical view of the field. IEEE Comput. Intell. Mag. 1(1), 28–36 (2006)MathSciNetCrossRef C. Coello Coello, Evolutionary multiobjective optimization: a historical view of the field. IEEE Comput. Intell. Mag. 1(1), 28–36 (2006)MathSciNetCrossRef
Zurück zum Zitat K. Deb, S. Agrawal, A. Pratap, T. Meyarivan, A fast elitist non-dominated sorting genetic algorithm for multi-objective optimization: NSGA-II, in Proceedings of the Parallel Problem Solving from Nature VI Conference. Lecture Notes in Computer Science No. 1917 (Springer, 2000), pp. 849–858 K. Deb, S. Agrawal, A. Pratap, T. Meyarivan, A fast elitist non-dominated sorting genetic algorithm for multi-objective optimization: NSGA-II, in Proceedings of the Parallel Problem Solving from Nature VI Conference. Lecture Notes in Computer Science No. 1917 (Springer, 2000), pp. 849–858
Zurück zum Zitat K. Deb, Multi-Objective Optimization Using Evolutionary Algorithms (Wiley, England, 2001)MATH K. Deb, Multi-Objective Optimization Using Evolutionary Algorithms (Wiley, England, 2001)MATH
Zurück zum Zitat K. Deb, A. Pratap, S. Agrawal, T. Meyarivan, A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 6, 182–197 (2002)CrossRef K. Deb, A. Pratap, S. Agrawal, T. Meyarivan, A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 6, 182–197 (2002)CrossRef
Zurück zum Zitat M.B. Eisen, P.T. Spellman, P.O. Brown, D. Botstein, Cluster analysis and display of genome-wide expression patterns, in Proceedings of the National Academy of Sciences, (USA) (1998), pp. 14863–14868 M.B. Eisen, P.T. Spellman, P.O. Brown, D. Botstein, Cluster analysis and display of genome-wide expression patterns, in Proceedings of the National Academy of Sciences, (USA) (1998), pp. 14863–14868
Zurück zum Zitat K. Fellenberg, N.C. Hauser, B. Brors, A. Neutzner, J.D. Hoheisel, M. Vingron, Correspondence analysis applied to microarray data. Proc. Natl. Acad. Sci. 98(19), 10781–10786 (2001)CrossRef K. Fellenberg, N.C. Hauser, B. Brors, A. Neutzner, J.D. Hoheisel, M. Vingron, Correspondence analysis applied to microarray data. Proc. Natl. Acad. Sci. 98(19), 10781–10786 (2001)CrossRef
Zurück zum Zitat A.K. Jain, R.C. Dubes, Algorithms for Clustering Data (Prentice-Hall, Englewood Cliffs, NJ, 1988)MATH A.K. Jain, R.C. Dubes, Algorithms for Clustering Data (Prentice-Hall, Englewood Cliffs, NJ, 1988)MATH
Zurück zum Zitat M. Kanehisa, S. Goto, KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000)CrossRef M. Kanehisa, S. Goto, KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000)CrossRef
Zurück zum Zitat D. Lin, An information-theoretic definition of similarity, in Proceedings of the 15th International Conference on Machine Learning (ICML-98) (Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1998), pp. 296–304 D. Lin, An information-theoretic definition of similarity, in Proceedings of the 15th International Conference on Machine Learning (ICML-98) (Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1998), pp. 296–304
Zurück zum Zitat D.J. Lockhart, E.A. Winzeler, Genomics, gene expreesion and DNA arrays. Nature 405, 827–836 (2000)CrossRef D.J. Lockhart, E.A. Winzeler, Genomics, gene expreesion and DNA arrays. Nature 405, 827–836 (2000)CrossRef
Zurück zum Zitat U. Maulik, A. Mukhopadhyay, S. Bandyopadhyay, Combining pareto-optimal clusters using supervised learning for identifying co-expressed genes. BMC Bioinform. 10(27) (2009) U. Maulik, A. Mukhopadhyay, S. Bandyopadhyay, Combining pareto-optimal clusters using supervised learning for identifying co-expressed genes. BMC Bioinform. 10(27) (2009)
Zurück zum Zitat U. Maulik, S. Bandyopadhyay, Performance evaluation of some clustering algorithms and validity indices. IEEE Trans. Pattern Anal. Mach. Intell. 24(12), 1650–1654 (2002)CrossRef U. Maulik, S. Bandyopadhyay, Performance evaluation of some clustering algorithms and validity indices. IEEE Trans. Pattern Anal. Mach. Intell. 24(12), 1650–1654 (2002)CrossRef
Zurück zum Zitat A. Mukhopadhyay, U. Maulik, Unsupervised pixel classification in satellite imagery using multiobjective fuzzy clustering combined with SVM classifier. IEEE Trans. Geosci. Remote Sens. (in press) A. Mukhopadhyay, U. Maulik, Unsupervised pixel classification in satellite imagery using multiobjective fuzzy clustering combined with SVM classifier. IEEE Trans. Geosci. Remote Sens. (in press)
Zurück zum Zitat A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, A survey of multiobjective evolutionary clustering. ACM Comput. Surv. 47, 61:1–61:46 (2015) A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, A survey of multiobjective evolutionary clustering. ACM Comput. Surv. 47, 61:1–61:46 (2015)
Zurück zum Zitat A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, Multi-objective genetic algorithm based fuzzy clustering of categorical attributes. IEEE Trans. Evol. Comput. (in press) A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, Multi-objective genetic algorithm based fuzzy clustering of categorical attributes. IEEE Trans. Evol. Comput. (in press)
Zurück zum Zitat A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, B. Brors, GOGA: GO-driven genetic algorithm-based fuzzy clustering of gene expression data, in 2010 International Conference on Systems in Medicine and Biology (ICSMB) (IEEE, 2010), pp. 221–226 A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, B. Brors, GOGA: GO-driven genetic algorithm-based fuzzy clustering of gene expression data, in 2010 International Conference on Systems in Medicine and Biology (ICSMB) (IEEE, 2010), pp. 221–226
Zurück zum Zitat A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, C.A.C. Coello, A survey of multiobjective evolutionary algorithms for data mining: part I. IEEE Trans. Evol. Comput. 18(1), 4–19 (2014a)CrossRef A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, C.A.C. Coello, A survey of multiobjective evolutionary algorithms for data mining: part I. IEEE Trans. Evol. Comput. 18(1), 4–19 (2014a)CrossRef
Zurück zum Zitat A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, C.A.C. Coello, Survey of multiobjective evolutionary algorithms for data mining: part II. IEEE Trans. Evol. Comput. 18(1), 20–35 (2014b)CrossRef A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, C.A.C. Coello, Survey of multiobjective evolutionary algorithms for data mining: part II. IEEE Trans. Evol. Comput. 18(1), 20–35 (2014b)CrossRef
Zurück zum Zitat K. Ovaska, M. Laakso, S. Hautaniemi, Fast gene ontology based clustering for microarray experiments. BioData Min. 1(11) (2008) K. Ovaska, M. Laakso, S. Hautaniemi, Fast gene ontology based clustering for microarray experiments. BioData Min. 1(11) (2008)
Zurück zum Zitat C. Pesquita, D. Faria, H.B.A.O. Falco, F.M. Couto, An information-theoretic definition of similarity, in Proceedings of the 10th Annual Bio-Ontologies Meeting (Bio-Ontologies-07) (2007), pp. 37–40 C. Pesquita, D. Faria, H.B.A.O. Falco, F.M. Couto, An information-theoretic definition of similarity, in Proceedings of the 10th Annual Bio-Ontologies Meeting (Bio-Ontologies-07) (2007), pp. 37–40
Zurück zum Zitat P. Resnik, Using information content to evaluate semantic similarity in a taxonomy, in Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI-95) (Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1995), pp. 448–453 P. Resnik, Using information content to evaluate semantic similarity in a taxonomy, in Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI-95) (Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1995), pp. 448–453
Zurück zum Zitat R. Shamir, A. Maron-Katz, A. Tanay, C. Linhart, I. Steinfeld, R. Sharan, Y. Shiloh, R. Elkon, EXPANDER—an integrative program suite for microarray data analysis. BMC Bioinform. 6(232) (2005) R. Shamir, A. Maron-Katz, A. Tanay, C. Linhart, I. Steinfeld, R. Sharan, Y. Shiloh, R. Elkon, EXPANDER—an integrative program suite for microarray data analysis. BMC Bioinform. 6(232) (2005)
Zurück zum Zitat R. Sharan, M.-K. Adi, R. Shamir, CLICK and EXPANDER: a system for clustering and visualizing gene expression data. Bioinformatics 19, 1787–1799 (2003)CrossRef R. Sharan, M.-K. Adi, R. Shamir, CLICK and EXPANDER: a system for clustering and visualizing gene expression data. Bioinformatics 19, 1787–1799 (2003)CrossRef
Zurück zum Zitat E. Zitzler, M. Laumanns, L. Thiele, SPEA2: Improving the Strength Pareto Evolutionary Algorithm, Technical Report 103, Gloriastrasse 35, CH-8092 Zurich, Switzerland (2001) E. Zitzler, M. Laumanns, L. Thiele, SPEA2: Improving the Strength Pareto Evolutionary Algorithm, Technical Report 103, Gloriastrasse 35, CH-8092 Zurich, Switzerland (2001)
Zurück zum Zitat E. Zitzler, L. Thiele, An Evolutionary Algorithm for Multiobjective Optimization: The Strength Pareto Approach, Technical Report 43, Gloriastrasse 35, CH-8092 Zurich, Switzerland (1998) E. Zitzler, L. Thiele, An Evolutionary Algorithm for Multiobjective Optimization: The Strength Pareto Approach, Technical Report 43, Gloriastrasse 35, CH-8092 Zurich, Switzerland (1998)
Metadaten
Titel
Incorporating Gene Ontology Information in Gene Expression Data Clustering Using Multiobjective Evolutionary Optimization: Application in Yeast Cell Cycle Data
verfasst von
Anirban Mukhopadhyay
Copyright-Jahr
2018
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-1471-1_3

Premium Partner