Skip to main content
Top

2018 | OriginalPaper | Chapter

Incorporating Gene Ontology Information in Gene Expression Data Clustering Using Multiobjective Evolutionary Optimization: Application in Yeast Cell Cycle Data

Author : Anirban Mukhopadhyay

Published in: Multi-Objective Optimization

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The advances in microarray technology have allowed the biologists to simultaneously study the levels of expression of a large set of genes over different time points. Clustering algorithms are used to discover groups of genes that are similarly expressed over all the experimental conditions. In this chapter, an approach for combining experimental gene expression information and biological information in the form of Gene Ontology (GO) knowledge through multiobjective clustering has been presented. The proposed method combines the expression-based and GO-based measures to compute the distances between the genes. Moreover, it simultaneously optimizes two objective functions, one from gene expression point of view and another from GO point of view. The performance of the proposed technique has been demonstrated on real-life gene expression dataset of yeast cell cycle. Moreover, biological relevance studies have been conducted for the produced clusters to demonstrate the effectiveness of the proposed technique.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference A.A. Alizadeh, M.B. Eisen, R. Davis, C. Ma, I. Lossos, A. Rosenwald, J. Boldrick, R. Warnke, R. Levy, W. Wilson, M. Grever, J. Byrd, D. Botstein, P.O. Brown, L.M. Straudt, Distinct types of diffuse large B-cell lymphomas identified by gene expression profiling. Nature 403, 503–511 (2000)CrossRef A.A. Alizadeh, M.B. Eisen, R. Davis, C. Ma, I. Lossos, A. Rosenwald, J. Boldrick, R. Warnke, R. Levy, W. Wilson, M. Grever, J. Byrd, D. Botstein, P.O. Brown, L.M. Straudt, Distinct types of diffuse large B-cell lymphomas identified by gene expression profiling. Nature 403, 503–511 (2000)CrossRef
go back to reference M. Ashburner, C.A. Ball, J.A. Blake, D. Botstein, H. Butler, J.M. Cherry, A.P. Davis, K. Dolinski, S.S. Dwight, J.T. Eppig, M.A. Harris, D.P. Hill, L. Issel-Tarver, A. Kasarskis, S. Lewis, J.C. Matese, J.E. Richardson, M. Ringwald, G.M. Rubin, G. Sherlock, Gene Ontology: tool for the unification of biology. The gene ontology consortium. Na. Genet. 25, 25–29 (2000)CrossRef M. Ashburner, C.A. Ball, J.A. Blake, D. Botstein, H. Butler, J.M. Cherry, A.P. Davis, K. Dolinski, S.S. Dwight, J.T. Eppig, M.A. Harris, D.P. Hill, L. Issel-Tarver, A. Kasarskis, S. Lewis, J.C. Matese, J.E. Richardson, M. Ringwald, G.M. Rubin, G. Sherlock, Gene Ontology: tool for the unification of biology. The gene ontology consortium. Na. Genet. 25, 25–29 (2000)CrossRef
go back to reference S. Bandyopadhyay, U. Maulik, J.T.L. Wang, Analysis of Biological Data: A Soft Computing Approach (World Scientific, 2007) S. Bandyopadhyay, U. Maulik, J.T.L. Wang, Analysis of Biological Data: A Soft Computing Approach (World Scientific, 2007)
go back to reference S. Bandyopadhyay, A. Mukhopadhyay, U. Maulik, An improved algorithm for clustering gene expression data. Bioinformatics 23(21), 2859–2865 (2007)CrossRef S. Bandyopadhyay, A. Mukhopadhyay, U. Maulik, An improved algorithm for clustering gene expression data. Bioinformatics 23(21), 2859–2865 (2007)CrossRef
go back to reference C.A. Coello Coello, A comprehensive survey of evolutionary-based multiobjective optimization techniques. Knowl. Inf. Syst. 1(3), 129–156 (1999)CrossRef C.A. Coello Coello, A comprehensive survey of evolutionary-based multiobjective optimization techniques. Knowl. Inf. Syst. 1(3), 129–156 (1999)CrossRef
go back to reference C. Coello Coello, Evolutionary multiobjective optimization: a historical view of the field. IEEE Comput. Intell. Mag. 1(1), 28–36 (2006)MathSciNetCrossRef C. Coello Coello, Evolutionary multiobjective optimization: a historical view of the field. IEEE Comput. Intell. Mag. 1(1), 28–36 (2006)MathSciNetCrossRef
go back to reference K. Deb, S. Agrawal, A. Pratap, T. Meyarivan, A fast elitist non-dominated sorting genetic algorithm for multi-objective optimization: NSGA-II, in Proceedings of the Parallel Problem Solving from Nature VI Conference. Lecture Notes in Computer Science No. 1917 (Springer, 2000), pp. 849–858 K. Deb, S. Agrawal, A. Pratap, T. Meyarivan, A fast elitist non-dominated sorting genetic algorithm for multi-objective optimization: NSGA-II, in Proceedings of the Parallel Problem Solving from Nature VI Conference. Lecture Notes in Computer Science No. 1917 (Springer, 2000), pp. 849–858
go back to reference K. Deb, Multi-Objective Optimization Using Evolutionary Algorithms (Wiley, England, 2001)MATH K. Deb, Multi-Objective Optimization Using Evolutionary Algorithms (Wiley, England, 2001)MATH
go back to reference K. Deb, A. Pratap, S. Agrawal, T. Meyarivan, A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 6, 182–197 (2002)CrossRef K. Deb, A. Pratap, S. Agrawal, T. Meyarivan, A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 6, 182–197 (2002)CrossRef
go back to reference M.B. Eisen, P.T. Spellman, P.O. Brown, D. Botstein, Cluster analysis and display of genome-wide expression patterns, in Proceedings of the National Academy of Sciences, (USA) (1998), pp. 14863–14868 M.B. Eisen, P.T. Spellman, P.O. Brown, D. Botstein, Cluster analysis and display of genome-wide expression patterns, in Proceedings of the National Academy of Sciences, (USA) (1998), pp. 14863–14868
go back to reference K. Fellenberg, N.C. Hauser, B. Brors, A. Neutzner, J.D. Hoheisel, M. Vingron, Correspondence analysis applied to microarray data. Proc. Natl. Acad. Sci. 98(19), 10781–10786 (2001)CrossRef K. Fellenberg, N.C. Hauser, B. Brors, A. Neutzner, J.D. Hoheisel, M. Vingron, Correspondence analysis applied to microarray data. Proc. Natl. Acad. Sci. 98(19), 10781–10786 (2001)CrossRef
go back to reference A.K. Jain, R.C. Dubes, Algorithms for Clustering Data (Prentice-Hall, Englewood Cliffs, NJ, 1988)MATH A.K. Jain, R.C. Dubes, Algorithms for Clustering Data (Prentice-Hall, Englewood Cliffs, NJ, 1988)MATH
go back to reference M. Kanehisa, S. Goto, KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000)CrossRef M. Kanehisa, S. Goto, KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000)CrossRef
go back to reference D. Lin, An information-theoretic definition of similarity, in Proceedings of the 15th International Conference on Machine Learning (ICML-98) (Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1998), pp. 296–304 D. Lin, An information-theoretic definition of similarity, in Proceedings of the 15th International Conference on Machine Learning (ICML-98) (Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1998), pp. 296–304
go back to reference D.J. Lockhart, E.A. Winzeler, Genomics, gene expreesion and DNA arrays. Nature 405, 827–836 (2000)CrossRef D.J. Lockhart, E.A. Winzeler, Genomics, gene expreesion and DNA arrays. Nature 405, 827–836 (2000)CrossRef
go back to reference U. Maulik, A. Mukhopadhyay, S. Bandyopadhyay, Combining pareto-optimal clusters using supervised learning for identifying co-expressed genes. BMC Bioinform. 10(27) (2009) U. Maulik, A. Mukhopadhyay, S. Bandyopadhyay, Combining pareto-optimal clusters using supervised learning for identifying co-expressed genes. BMC Bioinform. 10(27) (2009)
go back to reference U. Maulik, S. Bandyopadhyay, Performance evaluation of some clustering algorithms and validity indices. IEEE Trans. Pattern Anal. Mach. Intell. 24(12), 1650–1654 (2002)CrossRef U. Maulik, S. Bandyopadhyay, Performance evaluation of some clustering algorithms and validity indices. IEEE Trans. Pattern Anal. Mach. Intell. 24(12), 1650–1654 (2002)CrossRef
go back to reference A. Mukhopadhyay, U. Maulik, Unsupervised pixel classification in satellite imagery using multiobjective fuzzy clustering combined with SVM classifier. IEEE Trans. Geosci. Remote Sens. (in press) A. Mukhopadhyay, U. Maulik, Unsupervised pixel classification in satellite imagery using multiobjective fuzzy clustering combined with SVM classifier. IEEE Trans. Geosci. Remote Sens. (in press)
go back to reference A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, A survey of multiobjective evolutionary clustering. ACM Comput. Surv. 47, 61:1–61:46 (2015) A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, A survey of multiobjective evolutionary clustering. ACM Comput. Surv. 47, 61:1–61:46 (2015)
go back to reference A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, Multi-objective genetic algorithm based fuzzy clustering of categorical attributes. IEEE Trans. Evol. Comput. (in press) A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, Multi-objective genetic algorithm based fuzzy clustering of categorical attributes. IEEE Trans. Evol. Comput. (in press)
go back to reference A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, B. Brors, GOGA: GO-driven genetic algorithm-based fuzzy clustering of gene expression data, in 2010 International Conference on Systems in Medicine and Biology (ICSMB) (IEEE, 2010), pp. 221–226 A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, B. Brors, GOGA: GO-driven genetic algorithm-based fuzzy clustering of gene expression data, in 2010 International Conference on Systems in Medicine and Biology (ICSMB) (IEEE, 2010), pp. 221–226
go back to reference A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, C.A.C. Coello, A survey of multiobjective evolutionary algorithms for data mining: part I. IEEE Trans. Evol. Comput. 18(1), 4–19 (2014a)CrossRef A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, C.A.C. Coello, A survey of multiobjective evolutionary algorithms for data mining: part I. IEEE Trans. Evol. Comput. 18(1), 4–19 (2014a)CrossRef
go back to reference A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, C.A.C. Coello, Survey of multiobjective evolutionary algorithms for data mining: part II. IEEE Trans. Evol. Comput. 18(1), 20–35 (2014b)CrossRef A. Mukhopadhyay, U. Maulik, S. Bandyopadhyay, C.A.C. Coello, Survey of multiobjective evolutionary algorithms for data mining: part II. IEEE Trans. Evol. Comput. 18(1), 20–35 (2014b)CrossRef
go back to reference K. Ovaska, M. Laakso, S. Hautaniemi, Fast gene ontology based clustering for microarray experiments. BioData Min. 1(11) (2008) K. Ovaska, M. Laakso, S. Hautaniemi, Fast gene ontology based clustering for microarray experiments. BioData Min. 1(11) (2008)
go back to reference C. Pesquita, D. Faria, H.B.A.O. Falco, F.M. Couto, An information-theoretic definition of similarity, in Proceedings of the 10th Annual Bio-Ontologies Meeting (Bio-Ontologies-07) (2007), pp. 37–40 C. Pesquita, D. Faria, H.B.A.O. Falco, F.M. Couto, An information-theoretic definition of similarity, in Proceedings of the 10th Annual Bio-Ontologies Meeting (Bio-Ontologies-07) (2007), pp. 37–40
go back to reference P. Resnik, Using information content to evaluate semantic similarity in a taxonomy, in Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI-95) (Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1995), pp. 448–453 P. Resnik, Using information content to evaluate semantic similarity in a taxonomy, in Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI-95) (Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1995), pp. 448–453
go back to reference R. Shamir, A. Maron-Katz, A. Tanay, C. Linhart, I. Steinfeld, R. Sharan, Y. Shiloh, R. Elkon, EXPANDER—an integrative program suite for microarray data analysis. BMC Bioinform. 6(232) (2005) R. Shamir, A. Maron-Katz, A. Tanay, C. Linhart, I. Steinfeld, R. Sharan, Y. Shiloh, R. Elkon, EXPANDER—an integrative program suite for microarray data analysis. BMC Bioinform. 6(232) (2005)
go back to reference R. Sharan, M.-K. Adi, R. Shamir, CLICK and EXPANDER: a system for clustering and visualizing gene expression data. Bioinformatics 19, 1787–1799 (2003)CrossRef R. Sharan, M.-K. Adi, R. Shamir, CLICK and EXPANDER: a system for clustering and visualizing gene expression data. Bioinformatics 19, 1787–1799 (2003)CrossRef
go back to reference E. Zitzler, M. Laumanns, L. Thiele, SPEA2: Improving the Strength Pareto Evolutionary Algorithm, Technical Report 103, Gloriastrasse 35, CH-8092 Zurich, Switzerland (2001) E. Zitzler, M. Laumanns, L. Thiele, SPEA2: Improving the Strength Pareto Evolutionary Algorithm, Technical Report 103, Gloriastrasse 35, CH-8092 Zurich, Switzerland (2001)
go back to reference E. Zitzler, L. Thiele, An Evolutionary Algorithm for Multiobjective Optimization: The Strength Pareto Approach, Technical Report 43, Gloriastrasse 35, CH-8092 Zurich, Switzerland (1998) E. Zitzler, L. Thiele, An Evolutionary Algorithm for Multiobjective Optimization: The Strength Pareto Approach, Technical Report 43, Gloriastrasse 35, CH-8092 Zurich, Switzerland (1998)
Metadata
Title
Incorporating Gene Ontology Information in Gene Expression Data Clustering Using Multiobjective Evolutionary Optimization: Application in Yeast Cell Cycle Data
Author
Anirban Mukhopadhyay
Copyright Year
2018
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-1471-1_3

Premium Partner