Skip to main content
Top
Published in: Soft Computing 23/2020

11-08-2020 | Focus

cPEA: a parallel method to perform pathway enrichment analysis using multiple pathways databases

Authors: Giuseppe Agapito, Mario Cannataro

Published in: Soft Computing | Issue 23/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Genes/proteins are essential to activate or inhibit biological pathways both inside or outside the cells in each living organism. The key to understand the functional roles of genes/proteins is the deduction of the relationship between pathways and genes/proteins. To understand the role of genes/proteins in a biological context, we can use pathway enrichment analysis (PEA), an essential method in omics research, to identify the biological role of genes/proteins. A large number of PEA methods and tools are available; nevertheless, only a few can perform PEA exploiting information coming from multiple databases in the same analysis. Many of these databases were initially developed to use their pathway representation format, resulting in a heterogeneous collection of resources that are extremely difficult to combine and use. Soft computing enables approximate solutions for problems challenging to solve precisely, such as merging and integrating structured and unstructured data, or data from different databases. The integration and merging of biological pathways from diverse data sources are challenging due to the different pathway data representations used. The use of parallel preprocessing methods to deal with approximation and imprecision can contribute to integrate heterogeneous pathway data. We implemented an automatic methodology to perform PEA using pathways coming from different databases and a method to compute topological scores to rank enriched pathways. This methodology is available in a software framework called cross-pathway enrichment analysis. The obtained results show good performance in terms of execution times and reduced memory consumption, allowing to improve PEA by using pathways coming from different databases.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M, Yefanov A, Lee H, Zhang N, Robertson CL, Serova N, Davis S, Soboleva A (2013) NCBI GEO: archive for functional genomics data sets-update. Nucleic Acids Res 41(Database issue): D991–D995. https://doi.org/10.1093/nar/gks1193 Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M, Yefanov A, Lee H, Zhang N, Robertson CL, Serova N, Davis S, Soboleva A (2013) NCBI GEO: archive for functional genomics data sets-update. Nucleic Acids Res 41(Database issue): D991–D995. https://​doi.​org/​10.​1093/​nar/​gks1193
go back to reference Collins FS, Morgan M, Patrinos A (2003) The human genome project: lessons from large-scale biology. Science 300(5617):286–290CrossRef Collins FS, Morgan M, Patrinos A (2003) The human genome project: lessons from large-scale biology. Science 300(5617):286–290CrossRef
go back to reference Dennis G, Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, Lempicki RA (2003) David: database for annotation, visualization, and integrated discovery. Genome Biol 4(9):R60CrossRef Dennis G, Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, Lempicki RA (2003) David: database for annotation, visualization, and integrated discovery. Genome Biol 4(9):R60CrossRef
go back to reference Fung KY, Brierley GV, Henderson S, Hoffmann P, McColl SR, Lockett T, Head R, Cosgrove L (2011) Butyrate-induced apoptosis in hct116 colorectal cancer cells includes induction of a cell stress response. J Proteome Res 10(4):1860–1869CrossRef Fung KY, Brierley GV, Henderson S, Hoffmann P, McColl SR, Lockett T, Head R, Cosgrove L (2011) Butyrate-induced apoptosis in hct116 colorectal cancer cells includes induction of a cell stress response. J Proteome Res 10(4):1860–1869CrossRef
go back to reference Gu Z, Wang J (2013) CEPA: an R package for finding significant pathways weighted by multiple network centralities. Bioinformatics 29(5):658–660CrossRef Gu Z, Wang J (2013) CEPA: an R package for finding significant pathways weighted by multiple network centralities. Bioinformatics 29(5):658–660CrossRef
go back to reference Hughes R, Magee E, Bingham S et al (2000) Protein degradation in the large intestine: relevance to colorectal cancer. Curr Issues Intest Microbiol 1(2):51–58 Hughes R, Magee E, Bingham S et al (2000) Protein degradation in the large intestine: relevance to colorectal cancer. Curr Issues Intest Microbiol 1(2):51–58
go back to reference Khatri P, Sirota M, Butte AJ (2012) Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Comput Biol 8(2):e1002375CrossRef Khatri P, Sirota M, Butte AJ (2012) Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Comput Biol 8(2):e1002375CrossRef
go back to reference Liu D, Dai E, Liu H, Yang Q, Meng Q, Zhou S, Wang S, Jiang W, Jiang B (2017) Pathway enrichment analysis approach based on topological structure and updated annotation of pathway. Brief Bioinform 20(1):168–177 Liu D, Dai E, Liu H, Yang Q, Meng Q, Zhou S, Wang S, Jiang W, Jiang B (2017) Pathway enrichment analysis approach based on topological structure and updated annotation of pathway. Brief Bioinform 20(1):168–177
go back to reference O’Dushlaine C, Kenny E, Heron EA, Segurado R, Gill M, Morris DW, Corvin A (2009) The snp ratio test: pathway analysis of genome-wide association datasets. Bioinformatics 25(20):2762–2763CrossRef O’Dushlaine C, Kenny E, Heron EA, Segurado R, Gill M, Morris DW, Corvin A (2009) The snp ratio test: pathway analysis of genome-wide association datasets. Bioinformatics 25(20):2762–2763CrossRef
go back to reference Ramanan VK, Shen L, Moore JH, Saykin AJ (2012) Pathway analysis of genomic data: concepts, methods, and prospects for future development. Trends Genet 28(7):323–332CrossRef Ramanan VK, Shen L, Moore JH, Saykin AJ (2012) Pathway analysis of genomic data: concepts, methods, and prospects for future development. Trends Genet 28(7):323–332CrossRef
go back to reference Simão ÉM, Sinigaglia M, Bugs CA, Castro MAA, Librelotto GR, Alves R, Mombach JCM (2012) Induced genome maintenance pathways in pre-cancer tissues describe an anti-cancer barrier in tumor development. Mol BioSyst 8(11):3003–3009CrossRef Simão ÉM, Sinigaglia M, Bugs CA, Castro MAA, Librelotto GR, Alves R, Mombach JCM (2012) Induced genome maintenance pathways in pre-cancer tissues describe an anti-cancer barrier in tumor development. Mol BioSyst 8(11):3003–3009CrossRef
go back to reference Tomonaga T, Matsushita K, Yamaguchi S, Oh-Ishi M, Kodera Y, Maeda T, Shimada H, Ochiai T, Nomura F (2004) Identification of altered protein expression and post-translational modifications in primary colorectal cancer by using agarose two-dimensional gel electrophoresis. Clin Cancer Res 10(6):2007–2014CrossRef Tomonaga T, Matsushita K, Yamaguchi S, Oh-Ishi M, Kodera Y, Maeda T, Shimada H, Ochiai T, Nomura F (2004) Identification of altered protein expression and post-translational modifications in primary colorectal cancer by using agarose two-dimensional gel electrophoresis. Clin Cancer Res 10(6):2007–2014CrossRef
go back to reference Wang K, Li M, Hakonarson H (2010) Analysing biological pathways in genome-wide association studies. Nat Rev Genet 11(12):843–854CrossRef Wang K, Li M, Hakonarson H (2010) Analysing biological pathways in genome-wide association studies. Nat Rev Genet 11(12):843–854CrossRef
Metadata
Title
cPEA: a parallel method to perform pathway enrichment analysis using multiple pathways databases
Authors
Giuseppe Agapito
Mario Cannataro
Publication date
11-08-2020
Publisher
Springer Berlin Heidelberg
Published in
Soft Computing / Issue 23/2020
Print ISSN: 1432-7643
Electronic ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-020-05243-6

Other articles of this Issue 23/2020

Soft Computing 23/2020 Go to the issue

Premium Partner