Skip to main content

2016 | OriginalPaper | Buchkapitel

TripleID: A Low-Overhead Representation and Querying Using GPU for Large RDFs

verfasst von : Chantana Chantrapornchai, Chidchanok Choksuchat, Michael Haidl, Sergei Gorlatch

Erschienen in: Beyond Databases, Architectures and Structures. Advanced Technologies for Data Mining and Knowledge Discovery

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Resource Description Framework (RDF) is a commonly used format for semantic web processing. It basically contains strings representing terms and their relationships which can be queried or inferred. RDF is usually a large text file which contains many million relationships. In this work, we propose a framework, TripleID, for processing queries of large RDF data. The framework utilises Graphics Processing Units (GPUs) to search RDF relations. The RDF data is first transformed to the encoded form suitable for storing in the GPU memory. Then parallel threads on the GPU search the required data. We show in the experiments that one GPU on a personal desktop can handle 100 million triple relations, while a traditional RDF processing tool can process up to 10 million triples. Furthermore, we can query sample relations within 0.18 s with the GPU in 7 million triples, while the traditional tool takes at least 6 s for 1.8 million triples.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
3.
Zurück zum Zitat Atre, M., Chaoji, V., Zaki, M.J., Hendler, J.A.: Matrix “bit” loaded: A scalable lightweight join query processor for RDF data. In: Proceedings of the 19th International Conference on World Wide Web WWW 2010, pp. 41–50. ACM, New York (2010) Atre, M., Chaoji, V., Zaki, M.J., Hendler, J.A.: Matrix “bit” loaded: A scalable lightweight join query processor for RDF data. In: Proceedings of the 19th International Conference on World Wide Web WWW 2010, pp. 41–50. ACM, New York (2010)
4.
Zurück zum Zitat Atre, M., Hendler, J.A.: BitMat: A main memory bit-matrix of RDF triples. In: Proceedings of the 5th International Workshop on Scalable Semantic Web Knowledge Base Systems (2009) Atre, M., Hendler, J.A.: BitMat: A main memory bit-matrix of RDF triples. In: Proceedings of the 5th International Workshop on Scalable Semantic Web Knowledge Base Systems (2009)
5.
Zurück zum Zitat Beckett, D.: The design and implementation of the Redland librdf RDF API Library. In: Proceedings of WWW10, Hong Kong, May 2001 Beckett, D.: The design and implementation of the Redland librdf RDF API Library. In: Proceedings of WWW10, Hong Kong, May 2001
7.
Zurück zum Zitat Bizer, C., Lehmann, J., Kobilarov, G., Auer, R., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia - A crystallization point for the Web of Data. Web Semant. 7(3), 154–165 (2009)CrossRef Bizer, C., Lehmann, J., Kobilarov, G., Auer, R., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia - A crystallization point for the Web of Data. Web Semant. 7(3), 154–165 (2009)CrossRef
8.
Zurück zum Zitat Choksuchat, C., Chantrapornchai, C.: Large RDF representation framework for GPUs case study key-value storage and binary triple pattern. In: International Computer Science and Engineering Conference (ICSEC), pp. 13–18, September 2013 Choksuchat, C., Chantrapornchai, C.: Large RDF representation framework for GPUs case study key-value storage and binary triple pattern. In: International Computer Science and Engineering Conference (ICSEC), pp. 13–18, September 2013
9.
Zurück zum Zitat Choksuchat, C., Chantrapornchai, C., Haidl, M., Gorlatch, S.: Accelerating keyword search for big RDF web data on many-core systems. In: Fujita, H., Guizzi, G. (eds.) SoMeT 2015. CCIS, vol. 532, pp. 190–202. Springer, Heidelberg (2015)CrossRef Choksuchat, C., Chantrapornchai, C., Haidl, M., Gorlatch, S.: Accelerating keyword search for big RDF web data on many-core systems. In: Fujita, H., Guizzi, G. (eds.) SoMeT 2015. CCIS, vol. 532, pp. 190–202. Springer, Heidelberg (2015)CrossRef
12.
Zurück zum Zitat He, B., Yang, K., Fang, R., Lu, M., Govindaraju, N., Luo, Q., Sander, P.: Relational joins on graphics processors. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data SIGMOD 2008, pp. 511–524. ACM, New York (2008). http://doi.acm.org/10.1145/1376616.1376670 He, B., Yang, K., Fang, R., Lu, M., Govindaraju, N., Luo, Q., Sander, P.: Relational joins on graphics processors. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data SIGMOD 2008, pp. 511–524. ACM, New York (2008). http://​doi.​acm.​org/​10.​1145/​1376616.​1376670
13.
Zurück zum Zitat Heino, N., Pan, J.Z.: RDFs reasoning on massively parallel hardware. In: Cudré-Mauroux, P., Heflin, J., Sirin, E., Tudorache, T., Euzenat, J., Hauswirth, M., Parreira, J.X., Hendler, J., Schreiber, G., Bernstein, A., Blomqvist, E. (eds.) ISWC 2012, Part I. LNCS, vol. 7649, pp. 133–148. Springer, Heidelberg (2012)CrossRef Heino, N., Pan, J.Z.: RDFs reasoning on massively parallel hardware. In: Cudré-Mauroux, P., Heflin, J., Sirin, E., Tudorache, T., Euzenat, J., Hauswirth, M., Parreira, J.X., Hendler, J., Schreiber, G., Bernstein, A., Blomqvist, E. (eds.) ISWC 2012, Part I. LNCS, vol. 7649, pp. 133–148. Springer, Heidelberg (2012)CrossRef
14.
Zurück zum Zitat Kim, J., Kim, S.G., Nam, B.: Parallel multi-dimensional range query processing with R-trees on GPU. J. Parallel Distrib. Comput. 73(8), 1195–1207 (2013)CrossRef Kim, J., Kim, S.G., Nam, B.: Parallel multi-dimensional range query processing with R-trees on GPU. J. Parallel Distrib. Comput. 73(8), 1195–1207 (2013)CrossRef
15.
Zurück zum Zitat Kim, Y., Lee, Y., Lee, J.: An efficient approach to triple search and join of HDT processing using GPU. In: Proceedings of The Seventh International Conference on Advances in Databases, Knowledge, and Data Applications (DBKDA), pp. 70–74. IARIA (2015) Kim, Y., Lee, Y., Lee, J.: An efficient approach to triple search and join of HDT processing using GPU. In: Proceedings of The Seventh International Conference on Advances in Databases, Knowledge, and Data Applications (DBKDA), pp. 70–74. IARIA (2015)
16.
Zurück zum Zitat Liu, C., Urbani, J., Qi, G.: Efficient RDF stream reasoning with graphics processing units (GPUs). In: Proceedings of the Companion Publication of the 23rd International Conference on World Wide Web Companion Steering Committee, Republic and Canton of Geneva, Switzerland, pp. 343–344. WWW Companion 2014, International World Wide Web Conferences (2014) Liu, C., Urbani, J., Qi, G.: Efficient RDF stream reasoning with graphics processing units (GPUs). In: Proceedings of the Companion Publication of the 23rd International Conference on World Wide Web Companion Steering Committee, Republic and Canton of Geneva, Switzerland, pp. 343–344. WWW Companion 2014, International World Wide Web Conferences (2014)
17.
Zurück zum Zitat Madduri, K., Wu, K.: Massive-scale RDF processing using compressed bitmap indexes. In: Bayard Cushing, J., French, J., Bowers, S. (eds.) SSDBM 2011. LNCS, vol. 6809, pp. 470–479. Springer, Heidelberg (2011). doi:10.1007/978-3-642-22351-8_30 CrossRef Madduri, K., Wu, K.: Massive-scale RDF processing using compressed bitmap indexes. In: Bayard Cushing, J., French, J., Bowers, S. (eds.) SSDBM 2011. LNCS, vol. 6809, pp. 470–479. Springer, Heidelberg (2011). doi:10.​1007/​978-3-642-22351-8_​30 CrossRef
22.
Zurück zum Zitat Schmidt, M., Hornung, T., Meier, M., Pinkel, C., Lausen, G.: SP2Bench: A SPARQL performance benchmark. In: de Virgilio, R., Giunchiglia, F., Tanca, L. (eds.) Semantic Web Information Management, pp. 371–393. Springer, Heidelberg (2010). doi:10.1007/978-3-642-04329-1_16 CrossRef Schmidt, M., Hornung, T., Meier, M., Pinkel, C., Lausen, G.: SP2Bench: A SPARQL performance benchmark. In: de Virgilio, R., Giunchiglia, F., Tanca, L. (eds.) Semantic Web Information Management, pp. 371–393. Springer, Heidelberg (2010). doi:10.​1007/​978-3-642-04329-1_​16 CrossRef
26.
Zurück zum Zitat Wei, Z., Jaja, J.: A fast algorithm for constructing inverted files on heterogeneous platforms. In: 2011 IEEE International Parallel Distributed Processing Symposium (IPDPS), pp. 1124–1134, May 2011 Wei, Z., Jaja, J.: A fast algorithm for constructing inverted files on heterogeneous platforms. In: 2011 IEEE International Parallel Distributed Processing Symposium (IPDPS), pp. 1124–1134, May 2011
Metadaten
Titel
TripleID: A Low-Overhead Representation and Querying Using GPU for Large RDFs
verfasst von
Chantana Chantrapornchai
Chidchanok Choksuchat
Michael Haidl
Sergei Gorlatch
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-34099-9_31

Premium Partner