Skip to main content

2015 | OriginalPaper | Buchkapitel

Toward RDF Normalization

verfasst von : Regina Ticona-Herrera, Joe Tekli, Richard Chbeir, Sébastien Laborie, Irvin Dongo, Renato Guzman

Erschienen in: Conceptual Modeling

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Billions of RDF triples are currently available on the Web through the Linked Open Data cloud (e.g., DBpedia, LinkedGeoData and New York Times). Governments, universities as well as companies (e.g., BBC, CNN) are also producing huge collections of RDF triples and exchanging them through different serialization formats (e.g., RDF/XML, Turtle, N-Triple, etc.). However, RDF descriptions (i.e., graphs and serializations) are verbose in syntax, often contain redundancies, and could be generated differently even when describing the same resources, which would have a negative impact on their processing. Hence, we propose here an approach to clean and eliminate redundancies from such RDF descriptions as a means of transforming different descriptions of the same information into one representation, which can then be tuned, depending on the target application (information retrieval, compression, etc.). Experimental tests show significant improvements, namely in reducing RDF description loading time and file size.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
We use disparities to designate different serializations of the same information.
 
4
Following the W3C Recommendation, we consider that all the prefixes have to be unique for each namespace.
 
5
DT is a set of datatypes: string, number, date, etc.
 
6
Lang is a set of language tags: @fr, @en, etc.
 
7
\(st_{i}^{+}\), \(u_{i}\), \(p_{i}\), \(bn_{i}\), and \(l_{i}\) represent corresponding extended statements, IRIs, predicates, blank nodes, and literals.
 
8
An unused namespace is a namespace which is mention in the serialization file but which is not use in any of the statements, i.e., it will not appear in the Graph.
 
9
This is comparable to the notion of map function in [4] except that the authors do not consider namespaces.
 
Literatur
1.
Zurück zum Zitat Belleau, F., et al.: Bio2rdf: towards a mashup to build bioinformatics knowledge systems. J. Biomed. Inform. 41(5), 706–716 (2008)CrossRef Belleau, F., et al.: Bio2rdf: towards a mashup to build bioinformatics knowledge systems. J. Biomed. Inform. 41(5), 706–716 (2008)CrossRef
2.
Zurück zum Zitat Fernández, J.D., et al.: Binary rdf representation for publication and exchange (HDT). J. Web Semant. 19, 22–41 (2013)CrossRef Fernández, J.D., et al.: Binary rdf representation for publication and exchange (HDT). J. Web Semant. 19, 22–41 (2013)CrossRef
3.
Zurück zum Zitat Gutierrez, C., et al.: Foundations of semantic web databases. In: PODS 2004, pp. 95–106. ACM (2004) Gutierrez, C., et al.: Foundations of semantic web databases. In: PODS 2004, pp. 95–106. ACM (2004)
5.
Zurück zum Zitat Hayes, J., Gutierrez, C.: Bipartite graphs as intermediate model for RDF. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 47–61. Springer, Heidelberg (2004) CrossRef Hayes, J., Gutierrez, C.: Bipartite graphs as intermediate model for RDF. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 47–61. Springer, Heidelberg (2004) CrossRef
6.
Zurück zum Zitat Jiang, G., et al.: Using semantic web technology to support ICD-11 textual definitions authoring. J. Biomed. Semant. 4, 11 (2013)CrossRef Jiang, G., et al.: Using semantic web technology to support ICD-11 textual definitions authoring. J. Biomed. Semant. 4, 11 (2013)CrossRef
7.
Zurück zum Zitat Kerzazi, A., et al.: A model-based mediator system for biological data integration. In: Journes Scientifiques en Bio-Informatique, pp. 70–77 (2007) Kerzazi, A., et al.: A model-based mediator system for biological data integration. In: Journes Scientifiques en Bio-Informatique, pp. 70–77 (2007)
8.
Zurück zum Zitat Kerzazi, A., et al.: A semantic mediation architecture for RDF data integration. In: SWAP, p. 3 (2008) Kerzazi, A., et al.: A semantic mediation architecture for RDF data integration. In: SWAP, p. 3 (2008)
10.
Zurück zum Zitat Nolin, M.-A., et al.: Building an hiv data mashup using Bio2RDF. Briefings Bioinform. 13(1), 98–106 (2012)CrossRef Nolin, M.-A., et al.: Building an hiv data mashup using Bio2RDF. Briefings Bioinform. 13(1), 98–106 (2012)CrossRef
11.
Zurück zum Zitat Pathak, J., et al.: Lexgrid: a framework for representing, storing, and querying biomedical terminologies from simple to sublime. J. Am. Med. Inform. Assoc. 16(3), 305–315 (2009)CrossRef Pathak, J., et al.: Lexgrid: a framework for representing, storing, and querying biomedical terminologies from simple to sublime. J. Am. Med. Inform. Assoc. 16(3), 305–315 (2009)CrossRef
12.
Zurück zum Zitat Salameh, K., Tekli, J., Chbeir, R.: SVG-to-RDF image Semantization. In: Traina, A.J.M., Traina Jr., C., Cordeiro, R.L.F. (eds.) SISAP 2014. LNCS, vol. 8821, pp. 214–228. Springer, Heidelberg (2014) Salameh, K., Tekli, J., Chbeir, R.: SVG-to-RDF image Semantization. In: Traina, A.J.M., Traina Jr., C., Cordeiro, R.L.F. (eds.) SISAP 2014. LNCS, vol. 8821, pp. 214–228. Springer, Heidelberg (2014)
14.
Zurück zum Zitat Tao, C., et al.: A RDF-base normalized model for biomedical lexical grid. In: The 8th International Semantic Web Conference, p. 2 (2009) Tao, C., et al.: A RDF-base normalized model for biomedical lexical grid. In: The 8th International Semantic Web Conference, p. 2 (2009)
16.
Zurück zum Zitat Vrandecic, D., et al.: RDF syntax normalization using XML validation. In: Proceedings of the SemRUs, p. 11 (2009) Vrandecic, D., et al.: RDF syntax normalization using XML validation. In: Proceedings of the SemRUs, p. 11 (2009)
17.
Zurück zum Zitat Weiss, C., Karras, P., Bernstein, A.: Hexastore: sextuple indexing for semantic web data management. Proc. VLDB Endow. 1(1), 1008–1019 (2008)CrossRef Weiss, C., Karras, P., Bernstein, A.: Hexastore: sextuple indexing for semantic web data management. Proc. VLDB Endow. 1(1), 1008–1019 (2008)CrossRef
Metadaten
Titel
Toward RDF Normalization
verfasst von
Regina Ticona-Herrera
Joe Tekli
Richard Chbeir
Sébastien Laborie
Irvin Dongo
Renato Guzman
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-25264-3_19

Neuer Inhalt