Skip to main content

2019 | OriginalPaper | Buchkapitel

Associating Protein Domains with Biological Functions: A Tripartite Network Approach

verfasst von : Elena Rojano, James Richard Perkins, Ian Sillitoe, Christine Orengo, Juan Antonio García Ranea, Pedro Seoane

Erschienen in: Bioinformatics and Biomedical Engineering

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Protein domains are key determinants of protein function. However, a large number of domains have no recorded functional annotation. These domains of unknown function (DUFs) are a recognised problem and efforts have been made to remedy this situation, including the use of data such as structural and sequence similarity and annotation data such as that of Gene Ontology (GO) and The Enzyme Commission.
Here, we present a new approach based on tripartite network analysis to assign functional terms to DUFs. We combine functional annotation at the protein level, taken from GO, KEGG, Reactome and UniPathway, with structural domain annotation, taken from the CATH-Gene3D resource. We validate our method using 10-fold cross-validation and find it performs well when assigning annotation from the UniPathway, Reactome and GO resources, but less well for KEGG. We also explored using a finer functional subclassification of CATH superfamilies (FunFams) but these families were found to be too specific in this context.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Ponting, C.P., Russell, R.R.: The natural history of protein domains. Ann. Rev. Biophys. Biomol. Struct. 31(1), 45–71 (2002)CrossRef Ponting, C.P., Russell, R.R.: The natural history of protein domains. Ann. Rev. Biophys. Biomol. Struct. 31(1), 45–71 (2002)CrossRef
2.
Zurück zum Zitat Bateman, A., Coggill, P., Finn, R.D.: DUFs: families in search of function. Acta Crystallogr. Section F: Struct. Biol. Crystallization Commun. 66(10), 1148–1152 (2010)CrossRef Bateman, A., Coggill, P., Finn, R.D.: DUFs: families in search of function. Acta Crystallogr. Section F: Struct. Biol. Crystallization Commun. 66(10), 1148–1152 (2010)CrossRef
3.
Zurück zum Zitat Dawson, N., Sillitoe, I., Marsden, R.L., Orengo, C.A.: The classification of protein domains. In: Methods in Molecular Biology, pp. 137–164 (2017) Dawson, N., Sillitoe, I., Marsden, R.L., Orengo, C.A.: The classification of protein domains. In: Methods in Molecular Biology, pp. 137–164 (2017)
4.
Zurück zum Zitat Sillitoe, I., et al.: CATH: comprehensive structural and functional annotations for genome sequences. Nucleic Acids Res. 43(Database issue), D376–D381 (2015)CrossRef Sillitoe, I., et al.: CATH: comprehensive structural and functional annotations for genome sequences. Nucleic Acids Res. 43(Database issue), D376–D381 (2015)CrossRef
5.
Zurück zum Zitat Rose, P.W., et al.: The RCSB protein data bank: integrative view of protein, gene and 3D structural information. Nucleic Acids Res. 45(D1), D271–D281 (2017) Rose, P.W., et al.: The RCSB protein data bank: integrative view of protein, gene and 3D structural information. Nucleic Acids Res. 45(D1), D271–D281 (2017)
6.
Zurück zum Zitat Dawson, N.L., et al.: CATH: an expanded resource to predict protein function through structure and sequence. Nucleic Acids Res. 45(D1), D289–D295 (2017)CrossRef Dawson, N.L., et al.: CATH: an expanded resource to predict protein function through structure and sequence. Nucleic Acids Res. 45(D1), D289–D295 (2017)CrossRef
7.
Zurück zum Zitat Lewis, T.E., et al.: Gene3D: extensive prediction of globular domains in proteins. Nucleic Acids Res. 46(D1), D435–D439 (2018)CrossRef Lewis, T.E., et al.: Gene3D: extensive prediction of globular domains in proteins. Nucleic Acids Res. 46(D1), D435–D439 (2018)CrossRef
8.
Zurück zum Zitat Rentzsch, R., Orengo, C.A.: Protein function prediction using domain families. BMC Bioinform. 14(Suppl. 3), 1–14 (2013) Rentzsch, R., Orengo, C.A.: Protein function prediction using domain families. BMC Bioinform. 14(Suppl. 3), 1–14 (2013)
9.
Zurück zum Zitat Carbon, S., et al.: Expansion of the gene ontology knowledgebase and resources: the gene ontology consortium. Nucleic Acids Res. 45(Database issue), D331–D338 (2017) Carbon, S., et al.: Expansion of the gene ontology knowledgebase and resources: the gene ontology consortium. Nucleic Acids Res. 45(Database issue), D331–D338 (2017)
10.
Zurück zum Zitat Ogata, H., Goto, S., Sato, K., Fujibuchi, W., Bono, H., Kanehisa, M.: KEGG: Kyoto encyclopedia of genes and genomes. 27(1), 29–34 (1999) Ogata, H., Goto, S., Sato, K., Fujibuchi, W., Bono, H., Kanehisa, M.: KEGG: Kyoto encyclopedia of genes and genomes. 27(1), 29–34 (1999)
11.
Zurück zum Zitat Fabregat, A., et al.: The Reactome pathway knowledgebase. Nucleic Acids Res. 44(Database issue), D481–D487 (2018)CrossRef Fabregat, A., et al.: The Reactome pathway knowledgebase. Nucleic Acids Res. 44(Database issue), D481–D487 (2018)CrossRef
12.
Zurück zum Zitat Morgat, A., et al.: UniPathway: a resource for the exploration and annotation of metabolic pathways. Nucleic Acids Res. 40(Database issue), D761–D769 (2012)CrossRef Morgat, A., et al.: UniPathway: a resource for the exploration and annotation of metabolic pathways. Nucleic Acids Res. 40(Database issue), D761–D769 (2012)CrossRef
13.
Zurück zum Zitat Rojano, E., Seoane, P., Bueno-Amoros, A., Perkins, J.R., Garcia-Ranea, J.A.: Revealing the relationship between human genome regions and pathological phenotypes through network analysis. In: Rojas, I., Ortuño, F. (eds.) IWBBIO 2017. LNCS, vol. 10208, pp. 197–207. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-56148-6_17CrossRef Rojano, E., Seoane, P., Bueno-Amoros, A., Perkins, J.R., Garcia-Ranea, J.A.: Revealing the relationship between human genome regions and pathological phenotypes through network analysis. In: Rojas, I., Ortuño, F. (eds.) IWBBIO 2017. LNCS, vol. 10208, pp. 197–207. Springer, Cham (2017). https://​doi.​org/​10.​1007/​978-3-319-56148-6_​17CrossRef
14.
Zurück zum Zitat Das, S., Lee, D., Sillitoe, I., Dawson, N.L., Lees, J.G., Orengo, C.A.: Functional classification of CATH superfamilies: a domain-based approach for protein function annotation. Bioinformatics 31(21), 3460–3467 (2015)CrossRef Das, S., Lee, D., Sillitoe, I., Dawson, N.L., Lees, J.G., Orengo, C.A.: Functional classification of CATH superfamilies: a domain-based approach for protein function annotation. Bioinformatics 31(21), 3460–3467 (2015)CrossRef
15.
Zurück zum Zitat Lopez, D., Pazos, F.: Gene ontology functional annotations at the structural domain level. Proteins: Struct. Funct. Bioinform. 76(3), 598–607 (2009)CrossRef Lopez, D., Pazos, F.: Gene ontology functional annotations at the structural domain level. Proteins: Struct. Funct. Bioinform. 76(3), 598–607 (2009)CrossRef
16.
Zurück zum Zitat Bass, J.I., Diallo, A., Nelson, J., Soto, J.M., Myers, C.L., Walhout, A.J.: Using networks to measure similarity between genes: association index selection. Nat. Methods 10(12), 1169–1176 (2013)CrossRef Bass, J.I., Diallo, A., Nelson, J., Soto, J.M., Myers, C.L., Walhout, A.J.: Using networks to measure similarity between genes: association index selection. Nat. Methods 10(12), 1169–1176 (2013)CrossRef
17.
Zurück zum Zitat Cassandri, M., et al.: Zinc-finger proteins in health and disease. Cell Death Discov. 3, 17071 (2017)CrossRef Cassandri, M., et al.: Zinc-finger proteins in health and disease. Cell Death Discov. 3, 17071 (2017)CrossRef
Metadaten
Titel
Associating Protein Domains with Biological Functions: A Tripartite Network Approach
verfasst von
Elena Rojano
James Richard Perkins
Ian Sillitoe
Christine Orengo
Juan Antonio García Ranea
Pedro Seoane
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-17935-9_15