Skip to main content
Top
Published in:
Cover of the book

2018 | OriginalPaper | Chapter

An Empirical Evaluation of RDF Graph Partitioning Techniques

Authors : Adnan Akhter, Axel-Cyrille Ngomo Ngonga, Muhammad Saleem

Published in: Knowledge Engineering and Knowledge Management

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

With the significant growth of RDF data sources in both numbers and volume comes the need to improve the scalability of RDF storage and querying solutions. Current implementations employ various RDF graph partitioning techniques. However, choosing the most suitable partitioning for a given RDF graph and application is not a trivial task. To the best of our knowledge, no detailed empirical evaluation exists to evaluate the performance of these techniques. In this work, we present an empirical evaluation of RDF graph partitioning techniques applied to real-world RDF data sets and benchmark queries. We evaluate the selected RDF graph partitioning techniques in terms of their partitioning time, partitioning imbalance (in sizes), and query run time performances achieved, based on real-world data sets and queries selected using the FEASIBLE benchmark generation framework.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
3.
go back to reference Charalambidis, A., et al.: SemaGrow: optimizing federated SPARQL queries. In: SEMANTICS (2015) Charalambidis, A., et al.: SemaGrow: optimizing federated SPARQL queries. In: SEMANTICS (2015)
4.
go back to reference Erling, O., Mikhailov, I.: Towards web scale RDF. In: Proceedings of SSWS (2008) Erling, O., Mikhailov, I.: Towards web scale RDF. In: Proceedings of SSWS (2008)
5.
go back to reference Janke, D., et al.: Impact analysis of data placement strategies on query efforts in distributed RDF stores. JWS (2018) Janke, D., et al.: Impact analysis of data placement strategies on query efforts in distributed RDF stores. JWS (2018)
6.
go back to reference Galárraga, L., et al.: Partout: a distributed engine for efficient RDF processing. In: WWW (2014) Galárraga, L., et al.: Partout: a distributed engine for efficient RDF processing. In: WWW (2014)
7.
go back to reference Görlitz, O., Staab, S.: SPLENDID: SPARQL endpoint federation exploiting void descriptions. In: COLD (2011) Görlitz, O., Staab, S.: SPLENDID: SPARQL endpoint federation exploiting void descriptions. In: COLD (2011)
8.
go back to reference Gurajada, S., et al.: Triad: a distributed shared-nothing RDF engine based on asynchronous message passing. In: SIGMOD (2014) Gurajada, S., et al.: Triad: a distributed shared-nothing RDF engine based on asynchronous message passing. In: SIGMOD (2014)
9.
go back to reference Hammoud, M., et al.: DREAM: distributed RDF engine with adaptive query planner and minimal communication. In: VLDB (2015) Hammoud, M., et al.: DREAM: distributed RDF engine with adaptive query planner and minimal communication. In: VLDB (2015)
10.
go back to reference Harris, S., et al.: 4store: the design and implementation of a clustered RDF store. In: SSWS (2009) Harris, S., et al.: 4store: the design and implementation of a clustered RDF store. In: SSWS (2009)
11.
12.
go back to reference Herodotou, H., et al.: Query optimization techniques for partitioned tables. In: SIGMOD (2011) Herodotou, H., et al.: Query optimization techniques for partitioned tables. In: SIGMOD (2011)
13.
go back to reference Huang, J., et al.: Scalable SPARQL querying of large RDF graphs. In: VLDB (2011) Huang, J., et al.: Scalable SPARQL querying of large RDF graphs. In: VLDB (2011)
14.
go back to reference Janke, D., et al.: Koral: a glass box profiling system for individual components of distributed RDF stores. In: BLINK-ISWC (2017) Janke, D., et al.: Koral: a glass box profiling system for individual components of distributed RDF stores. In: BLINK-ISWC (2017)
15.
go back to reference Karypis, G., et al.: A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM JSC 20, 359–392 (1998)MathSciNetMATH Karypis, G., et al.: A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM JSC 20, 359–392 (1998)MathSciNetMATH
16.
go back to reference Khandelwal, A., et al.: ZipG: a memory-efficient graph store for interactive queries. In: ACM ICMD (2017) Khandelwal, A., et al.: ZipG: a memory-efficient graph store for interactive queries. In: ACM ICMD (2017)
17.
go back to reference Neumann, T., et al.: The RDF-3X engine for scalable management of RDF data. In: VLDB (2010) Neumann, T., et al.: The RDF-3X engine for scalable management of RDF data. In: VLDB (2010)
18.
go back to reference Owens, A., et al.: Clustered TDB: a clustered triple store for Jena (2008) Owens, A., et al.: Clustered TDB: a clustered triple store for Jena (2008)
20.
go back to reference Saleem, M., et al.: A fine-grained evaluation of SPARQL endpoint federation systems. SWJ (2016) Saleem, M., et al.: A fine-grained evaluation of SPARQL endpoint federation systems. SWJ (2016)
21.
go back to reference Schätzle, A., Przyjaciel-Zablocki, M., Neu, A., Lausen, G.: Sempala: interactive SPARQL query processing on Hadoop. In: Mika, P., et al. (eds.) The Semantic Web - ISWC 2014. ISWC 2014. Lecture Notes in Computer Science, vol. 8796, pp. 164–179. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-11964-9_11 Schätzle, A., Przyjaciel-Zablocki, M., Neu, A., Lausen, G.: Sempala: interactive SPARQL query processing on Hadoop. In: Mika, P., et al. (eds.) The Semantic Web - ISWC 2014. ISWC 2014. Lecture Notes in Computer Science, vol. 8796, pp. 164–179. Springer, Cham (2014). https://​doi.​org/​10.​1007/​978-3-319-11964-9_​11
22.
go back to reference Schätzle, A., et al.: S2RDF: RDF querying with SPARQL on spark. In: VLDB (2016) Schätzle, A., et al.: S2RDF: RDF querying with SPARQL on spark. In: VLDB (2016)
23.
go back to reference Schwarte, A., Haase, P., Hose, K., Schenkel, R., Schmidt, M.: FedX: optimization techniques for federated query processing on linked data. In: Aroyo, L., et al. (eds.) The Semantic Web - ISWC 2011. ISWC 2011. Lecture Notes in Computer Science, vol. 7031, pp. 601–616. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-25073-6_38CrossRef Schwarte, A., Haase, P., Hose, K., Schenkel, R., Schmidt, M.: FedX: optimization techniques for federated query processing on linked data. In: Aroyo, L., et al. (eds.) The Semantic Web - ISWC 2011. ISWC 2011. Lecture Notes in Computer Science, vol. 7031, pp. 601–616. Springer, Heidelberg (2011). https://​doi.​org/​10.​1007/​978-3-642-25073-6_​38CrossRef
25.
go back to reference Wang, X., et al.: LHD: optimising linked data query processing using parallelisation. In: LDOW (2013) Wang, X., et al.: LHD: optimising linked data query processing using parallelisation. In: LDOW (2013)
26.
go back to reference Yan, Y., et al.: Efficient indices using graph partitioning in RDF triple stores. In: ICDE (2009) Yan, Y., et al.: Efficient indices using graph partitioning in RDF triple stores. In: ICDE (2009)
27.
go back to reference Zeng, K., et al.: A distributed graph engine for web scale RDF data. In: Proceedings of the VLDB Endowment (2013)CrossRef Zeng, K., et al.: A distributed graph engine for web scale RDF data. In: Proceedings of the VLDB Endowment (2013)CrossRef
Metadata
Title
An Empirical Evaluation of RDF Graph Partitioning Techniques
Authors
Adnan Akhter
Axel-Cyrille Ngomo Ngonga
Muhammad Saleem
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-030-03667-6_1

Premium Partner