Skip to main content
Erschienen in: Knowledge and Information Systems 4/2021

13.01.2021 | Regular Paper

Distributed processing of regular path queries in RDF graphs

verfasst von: Xintong Guo, Hong Gao, Zhaonian Zou

Erschienen in: Knowledge and Information Systems | Ausgabe 4/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

SPARQL 1.1 offers a type of navigational query for RDF systems, called regular path query (RPQ). A regular path query allows for retrieving node pairs with the paths between them satisfying regular expressions. Regular path queries are always difficult to be evaluated efficiently because of the possible large search space. Thus there has been no scalable and practical solution so far. In this paper, we present Leon+, an in-memory distributed framework, to address the RPQ problem in the context of the knowledge graph. To reduce search space and mitigate mounting communication costs, Leon+ takes advantage of join-ahead pruning via a novel RDF summarization technique together with a path partitioning strategy. We also develop a subtle cost model to devise query plans to achieve high efficiency for complex RPQs. As there has been no available RPQ benchmark, we create micro-benchmarks on both synthetic and real-world datasets. A thorough experimental evaluation is presented between our approach and the state-of-the-art RDF stores. The results show that our approach outperforms 5x faster than the competitors on single RPQ. For query workload, it saves up to 1/2 time and 2/3 communication overheads over the baseline method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
11.
13.
Zurück zum Zitat Abul-Basher Z, Yakovets N, Godfrey P, Ghajar-Khosravi S, Chignell MH (2017) Tasweet: optimizing disjunctive regular path queries in graph databases. In: EDBT/ICDT 2017 joint conference 20th international conference on extending database technology. https://doi.org/10.5441/002/edbt.2017.47 Abul-Basher Z, Yakovets N, Godfrey P, Ghajar-Khosravi S, Chignell MH (2017) Tasweet: optimizing disjunctive regular path queries in graph databases. In: EDBT/ICDT 2017 joint conference 20th international conference on extending database technology. https://​doi.​org/​10.​5441/​002/​edbt.​2017.​47
14.
16.
Zurück zum Zitat Arias M, Fernández JD, Martínez-Prieto MA, Fuente P (2011) An empirical study of real-world sparql queries. arXiv:abs/1103.5043 Arias M, Fernández JD, Martínez-Prieto MA, Fuente P (2011) An empirical study of real-world sparql queries. arXiv:abs/1103.5043
18.
Zurück zum Zitat Bonifati A, Martens W, Timm T (2019) An analytical study of large SPARQL query logs. Springer, Berlin Bonifati A, Martens W, Timm T (2019) An analytical study of large SPARQL query logs. Springer, Berlin
19.
Zurück zum Zitat Dey S, Cuevas-Vicenttín V, Köhler S, Gribkoff E, Wang M, Ludäscher B (2013) On implementing provenance-aware regular path queries with relational query engines. In: Proceedings of the joint EDBT/ICDT 2013 workshops on–EDBT ’13. ACM Press. https://doi.org/10.1145/2457317.2457353 Dey S, Cuevas-Vicenttín V, Köhler S, Gribkoff E, Wang M, Ludäscher B (2013) On implementing provenance-aware regular path queries with relational query engines. In: Proceedings of the joint EDBT/ICDT 2013 workshops on–EDBT ’13. ACM Press. https://​doi.​org/​10.​1145/​2457317.​2457353
30.
Zurück zum Zitat Konstas I, Stathopoulos V, Jose JM (2009) On social networks and collaborative recommendation. In: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval–SIGIR ’09. ACM Press. https://doi.org/10.1145/1571941.1571977 Konstas I, Stathopoulos V, Jose JM (2009) On social networks and collaborative recommendation. In: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval–SIGIR ’09. ACM Press. https://​doi.​org/​10.​1145/​1571941.​1571977
33.
38.
Zurück zum Zitat Selmer P, Poulovassilis A, Wood PT (2015) Implementing flexible operators for regular path queries. CEUR Workshop Proc 1330:149–156 Selmer P, Poulovassilis A, Wood PT (2015) Implementing flexible operators for regular path queries. CEUR Workshop Proc 1330:149–156
40.
Zurück zum Zitat Tetzel F, Voigt H, Paradies M, Lehner W (2017) An analysis of the feasibility of graph compression techniques for indexing regular path queries. In: Proceedings of the fifth international workshop on graph data-management experiences & systems–GRADES’17. ACM Press. https://doi.org/10.1145/3078447.3078458 Tetzel F, Voigt H, Paradies M, Lehner W (2017) An analysis of the feasibility of graph compression techniques for indexing regular path queries. In: Proceedings of the fifth international workshop on graph data-management experiences & systems–GRADES’17. ACM Press. https://​doi.​org/​10.​1145/​3078447.​3078458
42.
43.
Zurück zum Zitat Wadhwa S, Prasad A, Ranu S, Bagchi A, Bedathur S (2019) Efficiently answering regular simple path queries on large labeled networks. In: Proceedings of the 2019 international conference on management of data—SIGMOD ’19. ACM Press. https://doi.org/10.1145/3299869.3319882 Wadhwa S, Prasad A, Ranu S, Bagchi A, Bedathur S (2019) Efficiently answering regular simple path queries on large labeled networks. In: Proceedings of the 2019 international conference on management of data—SIGMOD ’19. ACM Press. https://​doi.​org/​10.​1145/​3299869.​3319882
44.
Zurück zum Zitat Yakovets N, Godfrey P, Gryz J (2013) Evaluation of sparql property paths via recursive sql. AMW 1087 Yakovets N, Godfrey P, Gryz J (2013) Evaluation of sparql property paths via recursive sql. AMW 1087
Metadaten
Titel
Distributed processing of regular path queries in RDF graphs
verfasst von
Xintong Guo
Hong Gao
Zhaonian Zou
Publikationsdatum
13.01.2021
Verlag
Springer London
Erschienen in
Knowledge and Information Systems / Ausgabe 4/2021
Print ISSN: 0219-1377
Elektronische ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-020-01536-2

Weitere Artikel der Ausgabe 4/2021

Knowledge and Information Systems 4/2021 Zur Ausgabe