Skip to main content

2017 | OriginalPaper | Buchkapitel

Extending the Genomic Data Model and the Genometric Query Language with Domain Taxonomies

verfasst von : Eleonora Cappelli, Emanuel Weitschek

Erschienen in: Web Engineering

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In bioinformatics and biology researchers annotate experimental data in many different ways. When other researchers need to query these data, they are typically unaware of the specificity of the annotations; often they encounter possible mismatches between the granularity of the query and the granularity of the annotations. In this work, we propose an extension of the Genomic Data Model and the GenoMetric Query Language (a well established framework for biomedical data), able to search, integrate, and extend genomic data. The extension is going to be performed through domain taxonomies and by considering many external ontologies and databases. An ad-hoc software system and query language will be implemented for the storage, management, search, retrieval, and integration of biomedical data.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Weitschek, E., Cumbo, F., Cappelli, E., Felici, G.: Genomic data integration: a case study on next generation sequencing of cancer. In: 2016 27th International Workshop on Database and Expert Systems Applications (DEXA), pp. 49–53. IEEE (2016) Weitschek, E., Cumbo, F., Cappelli, E., Felici, G.: Genomic data integration: a case study on next generation sequencing of cancer. In: 2016 27th International Workshop on Database and Expert Systems Applications (DEXA), pp. 49–53. IEEE (2016)
2.
Zurück zum Zitat Cumbo, F., Fiscon, G., Ceri, S., Masseroli, M., Weitschek, E.: Tcga2bed: extracting, extending, integrating, and querying the cancer genome atlas. BMC Bioinform. 18(1), 6 (2017)CrossRef Cumbo, F., Fiscon, G., Ceri, S., Masseroli, M., Weitschek, E.: Tcga2bed: extracting, extending, integrating, and querying the cancer genome atlas. BMC Bioinform. 18(1), 6 (2017)CrossRef
3.
Zurück zum Zitat Ceri, S., Kaitoua, A., Masseroli, M., Pinoli, P., Venco, F.: Data management for next generation genomic computing. In: EDBT, pp. 485–490 (2016) Ceri, S., Kaitoua, A., Masseroli, M., Pinoli, P., Venco, F.: Data management for next generation genomic computing. In: EDBT, pp. 485–490 (2016)
4.
Zurück zum Zitat Masseroli, M., Pinoli, P., Venco, F., Kaitoua, A., Jalili, V., Palluzzi, F., Muller, H., Ceri, S.: Genometric query language: a novel approach to large-scale genomic data management. Bioinformatics 31(12), 1881–1888 (2015)CrossRef Masseroli, M., Pinoli, P., Venco, F., Kaitoua, A., Jalili, V., Palluzzi, F., Muller, H., Ceri, S.: Genometric query language: a novel approach to large-scale genomic data management. Bioinformatics 31(12), 1881–1888 (2015)CrossRef
5.
Zurück zum Zitat Martinenghi, D., Torlone, R.: Taxonomy-based relaxation of query answering in relational databases. VLDB J. 23(5), 747–769 (2014)CrossRef Martinenghi, D., Torlone, R.: Taxonomy-based relaxation of query answering in relational databases. VLDB J. 23(5), 747–769 (2014)CrossRef
6.
Zurück zum Zitat Stein, L.: Genome annotation: from sequence to biology. Nat. Rev. Genet. 2(7), 493–503 (2001)CrossRef Stein, L.: Genome annotation: from sequence to biology. Nat. Rev. Genet. 2(7), 493–503 (2001)CrossRef
7.
Zurück zum Zitat Noy, N.F.: Semantic integration: a survey of ontology-based approaches. ACM Sigmod Rec. 33(4), 65–70 (2004)CrossRef Noy, N.F.: Semantic integration: a survey of ontology-based approaches. ACM Sigmod Rec. 33(4), 65–70 (2004)CrossRef
8.
Zurück zum Zitat Wache, H., Voegele, T., Visser, U., Stuckenschmidt, H., Schuster, G., Neumann, H., Hübner, S.: Ontology-based integration of information-a survey of existing approaches. In: IJCAI-01 Workshop: Ontologies and Information Sharing, vol. 2001, pp. 108–117. Citeseer (2001) Wache, H., Voegele, T., Visser, U., Stuckenschmidt, H., Schuster, G., Neumann, H., Hübner, S.: Ontology-based integration of information-a survey of existing approaches. In: IJCAI-01 Workshop: Ontologies and Information Sharing, vol. 2001, pp. 108–117. Citeseer (2001)
9.
Zurück zum Zitat Gene Ontology Consortium et al.: The gene ontology (go) database and informatics resource. Nucleic Acids Res. 32(suppl 1), D258–D261 (2004) Gene Ontology Consortium et al.: The gene ontology (go) database and informatics resource. Nucleic Acids Res. 32(suppl 1), D258–D261 (2004)
10.
Zurück zum Zitat Cates, S.: Ncbi: National center for biotechnology information (2006) Cates, S.: Ncbi: National center for biotechnology information (2006)
11.
Zurück zum Zitat Benson, D.A., Cavanaugh, M., Clark, K., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Sayers, E.W.: Genbank. Nucleic Acids Res. 41(D1), D36–D42 (2013)CrossRef Benson, D.A., Cavanaugh, M., Clark, K., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Sayers, E.W.: Genbank. Nucleic Acids Res. 41(D1), D36–D42 (2013)CrossRef
12.
Zurück zum Zitat Sayers, E.W., Barrett, T., Benson, D.A., Bolton, E., Bryant, S.H., Canese, K., Chetvernin, V., Church, D.M., DiCuccio, M., Federhen, S., et al.: Database resources of the national center for biotechnology information. Nucleic Acids Res. 39(suppl 1), D38–D51 (2011)CrossRef Sayers, E.W., Barrett, T., Benson, D.A., Bolton, E., Bryant, S.H., Canese, K., Chetvernin, V., Church, D.M., DiCuccio, M., Federhen, S., et al.: Database resources of the national center for biotechnology information. Nucleic Acids Res. 39(suppl 1), D38–D51 (2011)CrossRef
13.
Zurück zum Zitat Tatusova, T.A., Karsch-Mizrachi, I., Ostell, J.A.: Complete genomes in www entrez: data representation and analysis. Bioinformatics 15(7), 536–543 (1999)CrossRef Tatusova, T.A., Karsch-Mizrachi, I., Ostell, J.A.: Complete genomes in www entrez: data representation and analysis. Bioinformatics 15(7), 536–543 (1999)CrossRef
14.
Zurück zum Zitat Weinstein, J.N., Collisson, E.A., Mills, G.B., Shaw, K.R.M., Ozenberger, B.A., Ellrott, K., Shmulevich, I., Sander, C., Stuart, J.M., Network, C.G.A.R., et al.: The cancer genome atlas pan-cancer analysis project. Nat. Genet. 45(10), 1113–1120 (2013)CrossRef Weinstein, J.N., Collisson, E.A., Mills, G.B., Shaw, K.R.M., Ozenberger, B.A., Ellrott, K., Shmulevich, I., Sander, C., Stuart, J.M., Network, C.G.A.R., et al.: The cancer genome atlas pan-cancer analysis project. Nat. Genet. 45(10), 1113–1120 (2013)CrossRef
15.
Zurück zum Zitat Wick, M.: Geonames. GeoNames Geographical Database (2011) Wick, M.: Geonames. GeoNames Geographical Database (2011)
16.
Zurück zum Zitat Uniprot Consortium et al.: Uniprot: a hub for protein information. Nucleic Acids Res. 43, gku989 (2014) Uniprot Consortium et al.: Uniprot: a hub for protein information. Nucleic Acids Res. 43, gku989 (2014)
17.
Zurück zum Zitat Blankenberg, D., Kuster, G.V., Coraor, N., Ananda, G., Lazarus, R., Mangan, M., Nekrutenko, A., Taylor, J.: Galaxy: a web-based genome analysis tool for experimentalists. Curr. Protoc. Mol. Biol. 10, 1–21 (2010) Blankenberg, D., Kuster, G.V., Coraor, N., Ananda, G., Lazarus, R., Mangan, M., Nekrutenko, A., Taylor, J.: Galaxy: a web-based genome analysis tool for experimentalists. Curr. Protoc. Mol. Biol. 10, 1–21 (2010)
Metadaten
Titel
Extending the Genomic Data Model and the Genometric Query Language with Domain Taxonomies
verfasst von
Eleonora Cappelli
Emanuel Weitschek
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-60131-1_44