Skip to main content
Top

2017 | OriginalPaper | Chapter

Learning Commonalities in RDF

Authors : Sara El Hassad, François Goasdoué, Hélène Jaudoin

Published in: The Semantic Web

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Finding the commonalities between descriptions of data or knowledge is a foundational reasoning problem of Machine Learning introduced in the 70’s, which amounts to computing a least general generalization (\(\mathtt {lgg}\)) of such descriptions. It has also started receiving consideration in Knowledge Representation from the 90’s, and recently in the Semantic Web field. We revisit this problem in the popular Resource Description Framework (RDF) of W3C, where descriptions are RDF graphs, i.e., a mix of data and knowledge. Notably, and in contrast to the literature, our solution to this problem holds for the entire RDF standard, i.e., we do not restrict RDF graphs in any way (neither their structure nor their semantics based on RDF entailment, i.e., inference) and, further, our algorithms can compute \(\mathtt {lgg}\)s of small-to-huge RDF graphs.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
The merge of RDF graphs is their union after renaming their blank nodes, so that these RDF graphs do not join on such values which are local to them (Sect. 2).
 
2
We assume w.l.o.g. that input and output data of an MR job is stored on disk, like in Hadoop, while it can also reside in in-memory shared data structures, like in Spark.
 
3
An \(\mathcal {EL}\) concept C recursively translates into the RDF graph rooted in the blank node \(b_r\) returned by the call \(\mathcal {G}(C,b_r)\), with: \(\mathcal {G}(\top ,b)=\emptyset \) for the universal \(\mathcal {EL}\) concept \(\top \), \(\mathcal {G}(A,b)=\{(b,\tau ,A)\}\) for an atomic \(\mathcal {EL}\) concept A, \(\mathcal {G}(\exists r.C,b)=\{(b,r,b')\} \cup \mathcal {G}(C,b')\), with \(b'\) a fresh blank node, for an \(\mathcal {EL}\) existential restriction \(\exists r.C\), and \(\mathcal {G}(C_1 \sqcap C_2,b)= \mathcal {G}(C_1,b)\cup \mathcal {G}(C_2,b)\) for an \(\mathcal {EL}\) conjunction \(C_1 \sqcap C_2\); the \(\mathcal {EL}\) constraints \(A_1 \sqsubseteq A_2\) and \(\exists r.\top \sqsubseteq A\) correspond to \((A_1,\preceq _\mathrm {sc},A_2)\) and \((r,\hookleftarrow _d,A)\) resp.
 
Literature
9.
go back to reference Baader, F., Sertkaya, B., Turhan, A.Y.: Computing the least common subsumer w.r.t. a background terminology. J. Appl. Logic 5(3), 392–420 (2007)MathSciNetCrossRef Baader, F., Sertkaya, B., Turhan, A.Y.: Computing the least common subsumer w.r.t. a background terminology. J. Appl. Logic 5(3), 392–420 (2007)MathSciNetCrossRef
10.
go back to reference Baget, J., Croitoru, M., Gutierrez, A., Leclère, M., Mugnier, M.: Translations between RDF(S) and conceptual graphs. In: ICCS (2010) Baget, J., Croitoru, M., Gutierrez, A., Leclère, M., Mugnier, M.: Translations between RDF(S) and conceptual graphs. In: ICCS (2010)
11.
go back to reference Chein, M., Mugnier, M.: Graph-Based Knowledge Representation - Computational Foundations of Conceptual Graphs. Springer, London (2009)MATH Chein, M., Mugnier, M.: Graph-Based Knowledge Representation - Computational Foundations of Conceptual Graphs. Springer, London (2009)MATH
12.
go back to reference Cohen, W.W., Borgida, A., Hirsh, H.: Computing least common subsumers in description logics. In: AAAI (1992) Cohen, W.W., Borgida, A., Hirsh, H.: Computing least common subsumers in description logics. In: AAAI (1992)
13.
go back to reference Colucci, S., Donini, F., Giannini, S., Sciascio, E.D.: Defining and computing least common subsumers in RDF. J. Web Semant. 39, 62–80 (2016)CrossRef Colucci, S., Donini, F., Giannini, S., Sciascio, E.D.: Defining and computing least common subsumers in RDF. J. Web Semant. 39, 62–80 (2016)CrossRef
14.
go back to reference Colucci, S., Donini, F.M., Sciascio, E.D.: Common subsumbers in RDF. In: AI*IA (2013)CrossRef Colucci, S., Donini, F.M., Sciascio, E.D.: Common subsumbers in RDF. In: AI*IA (2013)CrossRef
15.
go back to reference Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. In: OSDI (2004) Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. In: OSDI (2004)
17.
go back to reference Garcia-Molina, H., Ullman, J.D., Widom, J.: Database Systems - The Complete Book. Pearson Education, Harlow (2009) Garcia-Molina, H., Ullman, J.D., Widom, J.: Database Systems - The Complete Book. Pearson Education, Harlow (2009)
18.
go back to reference Goasdoué, F., Kaoudi, Z., Manolescu, I., Quiané-Ruiz, J., Zampetakis, S.: Cliquesquare: flat plans for massively parallel RDF queries. In: ICDE (2015) Goasdoué, F., Kaoudi, Z., Manolescu, I., Quiané-Ruiz, J., Zampetakis, S.: Cliquesquare: flat plans for massively parallel RDF queries. In: ICDE (2015)
19.
go back to reference Küsters, R.: Non-standard Inferences in Description Logics. LNCS, vol. 2100. Springer, Heidelberg (2001)MATH Küsters, R.: Non-standard Inferences in Description Logics. LNCS, vol. 2100. Springer, Heidelberg (2001)MATH
20.
go back to reference Lehmann, J., Bühmann, L.: AutoSPARQL: let users query your knowledge base. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., Leenheer, P., Pan, J. (eds.) ESWC 2011. LNCS, vol. 6643, pp. 63–79. Springer, Heidelberg (2011). doi:10.1007/978-3-642-21034-1_5CrossRef Lehmann, J., Bühmann, L.: AutoSPARQL: let users query your knowledge base. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., Leenheer, P., Pan, J. (eds.) ESWC 2011. LNCS, vol. 6643, pp. 63–79. Springer, Heidelberg (2011). doi:10.​1007/​978-3-642-21034-1_​5CrossRef
22.
go back to reference Neumann, T., Weikum, G.: The RDF-3X engine for scalable management of RDF data. VLDB J. 19(1), 91–113 (2010)CrossRef Neumann, T., Weikum, G.: The RDF-3X engine for scalable management of RDF data. VLDB J. 19(1), 91–113 (2010)CrossRef
23.
go back to reference Papailiou, N., Tsoumakos, D., Konstantinou, I., Karras, P., Koziris, N.: H\({}_{\text{2}}\)rdf+: an efficient data management system for big RDF graphs. In: SIGMOD (2014) Papailiou, N., Tsoumakos, D., Konstantinou, I., Karras, P., Koziris, N.: H\({}_{\text{2}}\)rdf+: an efficient data management system for big RDF graphs. In: SIGMOD (2014)
24.
go back to reference Pichler, R., Polleres, A., Skritek, S., Woltran, S.: Complexity of redundancy detection on RDF graphs in the presence of rules, constraints, and queries. Semant. Web 4(4), 351–393 (2013) Pichler, R., Polleres, A., Skritek, S., Woltran, S.: Complexity of redundancy detection on RDF graphs in the presence of rules, constraints, and queries. Semant. Web 4(4), 351–393 (2013)
26.
27.
go back to reference Ramakrishnan, R., Gehrke, J.: Database Management Systems. McGraw-Hill, New York (2003)MATH Ramakrishnan, R., Gehrke, J.: Database Management Systems. McGraw-Hill, New York (2003)MATH
28.
29.
go back to reference Robinson, J.A., Voronkov, A. (eds.): Handbook of Automated Reasoning. Elsevier and MIT Press, Weidenbach (2001)MATH Robinson, J.A., Voronkov, A. (eds.): Handbook of Automated Reasoning. Elsevier and MIT Press, Weidenbach (2001)MATH
30.
go back to reference Urbani, J., Kotoulas, S., Maassen, J., van Harmelen, F., Bal, H.E.: WebPIE: a web-scale parallel inference engine using MapReduce. J. Web Semant. 10, 59–75 (2012)CrossRef Urbani, J., Kotoulas, S., Maassen, J., van Harmelen, F., Bal, H.E.: WebPIE: a web-scale parallel inference engine using MapReduce. J. Web Semant. 10, 59–75 (2012)CrossRef
33.
go back to reference Zarrieß, B., Turhan, A.: Most specific generalizations w.r.t. general EL-TBoxes. In: IJCAI (2013) Zarrieß, B., Turhan, A.: Most specific generalizations w.r.t. general EL-TBoxes. In: IJCAI (2013)
Metadata
Title
Learning Commonalities in RDF
Authors
Sara El Hassad
François Goasdoué
Hélène Jaudoin
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-58068-5_31