Skip to main content
Erschienen in: Information Systems Frontiers 4/2018

26.09.2016

Practical parallel string matching framework for RDF entailments with GPUs

Erschienen in: Information Systems Frontiers | Ausgabe 4/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Resource Description Framework (RDF) is a commonly used format for semantic web processing. It basically contains strings representing items and their relationships which can be queried or inferred. In this paper, we propose a framework for processing large RDF data sets. It is based on Brute-force string matching on GPUs (BFG). Graphics Processing Units (GPUs) are used as a parallel platform that allows thousands of threads to find RDF data. Our search algorithm is customized to suit the nature of RDF processing and GPU memory architecture. Then, the algorithm is integrated into the proposed framework for computing queries and chaining rules for RDF data. Experiments show that utilizing these algorithms can achieve the speedup of 7 times for querying and for forward chaining compared to using the sequential version. The proposed framework can achieve a string comparison rate of 67,000 comparisons per second using 2 GPUs.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
With our hardware, transferring each half of RDF data and k e y w o r d A r r a y to 2 GPUs costs 10-20 times the time required to transfer one d a t a A r r a y and one k e y w o r d A r r a y to one GPU due to the bottleneck on PCI bus on our mainboard.
 
Literatur
Zurück zum Zitat Atre, M., & Hendler, J. A. (2009). BitMat: A main memory bit-matrix of RDF triples. In Proceedings of the 5th International Workshop on Scalable Semantic Web Knowledge Base Systems. Atre, M., & Hendler, J. A. (2009). BitMat: A main memory bit-matrix of RDF triples. In Proceedings of the 5th International Workshop on Scalable Semantic Web Knowledge Base Systems.
Zurück zum Zitat Atre, M., Chaoji, V., Zaki, M. J., & Hendler, J. A. (2010). Matrix bit loaded: A scalable lightweight Join query processor for RDF data. In Proceedings of the 19th International Conference on World Wide Web, WWW ’10, (New York, NY, USA), pp. 41–50, ACM. Atre, M., Chaoji, V., Zaki, M. J., & Hendler, J. A. (2010). Matrix bit loaded: A scalable lightweight Join query processor for RDF data. In Proceedings of the 19th International Conference on World Wide Web, WWW ’10, (New York, NY, USA), pp. 41–50, ACM.
Zurück zum Zitat Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., & Ives, Z. (2007). The Semantic Web: 6th International Semantic Web Conference. In 2nd Asian Semantic Web Conference, ISWC 2007 + ASWC 2007, Busan, Korea, November 11-15, 2007. Proceedings, ch. DBpedia: A Nucleus for a Web of Open Data, pp. 722–735. Berlin, Heidelberg: Springer Berlin Heidelberg. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., & Ives, Z. (2007). The Semantic Web: 6th International Semantic Web Conference. In 2nd Asian Semantic Web Conference, ISWC 2007 + ASWC 2007, Busan, Korea, November 11-15, 2007. Proceedings, ch. DBpedia: A Nucleus for a Web of Open Data, pp. 722–735. Berlin, Heidelberg: Springer Berlin Heidelberg.
Zurück zum Zitat Beckett, D. (2001). The design and implementation of the Redland libRDF API Library, Vol. 5. Beckett, D. (2001). The design and implementation of the Redland libRDF API Library, Vol. 5.
Zurück zum Zitat Chang, Y.-S., Sheu, R.-K., Yuan, S.-M., & Hsu, J.-J. (2012). Scaling database performance on GPUs. Information Systems Frontiers, 14(4), 909–924.CrossRef Chang, Y.-S., Sheu, R.-K., Yuan, S.-M., & Hsu, J.-J. (2012). Scaling database performance on GPUs. Information Systems Frontiers, 14(4), 909–924.CrossRef
Zurück zum Zitat Fernández, J.D., Martńez-Prieto, M.A., Gutiérrez, C., Polleres, A., & Arias, M. (2013). Binary RDF representation for publication and exchange (HDT), Web Semantics: Science. Services and Agents on the World Wide Web, 19, 22–41.CrossRef Fernández, J.D., Martńez-Prieto, M.A., Gutiérrez, C., Polleres, A., & Arias, M. (2013). Binary RDF representation for publication and exchange (HDT), Web Semantics: Science. Services and Agents on the World Wide Web, 19, 22–41.CrossRef
Zurück zum Zitat Fernndez, N., Arias, J., Snchez, L., Fuentes-Lorenzo, D., & Corcho, s. (2014). RDSZ: An approach for lossless RDF stream compression. In The Semantic Web: Trends and Challenges (V. Presutti, C. dAmato, F. Gandon, M. dAquin, S. Staab, and A. Tordai, eds.), vol. 8465 of Lecture Notes in Computer Science, pp. 52–67, Springer International Publishing. Fernndez, N., Arias, J., Snchez, L., Fuentes-Lorenzo, D., & Corcho, s. (2014). RDSZ: An approach for lossless RDF stream compression. In The Semantic Web: Trends and Challenges (V. Presutti, C. dAmato, F. Gandon, M. dAquin, S. Staab, and A. Tordai, eds.), vol. 8465 of Lecture Notes in Computer Science, pp. 52–67, Springer International Publishing.
Zurück zum Zitat Google (2014). Data dumps - Freebase API: Google Developers, 2014. Retrieved 23 Nov 2014 . Google (2014). Data dumps - Freebase API: Google Developers, 2014. Retrieved 23 Nov 2014 .
Zurück zum Zitat Groppe, J., & Groppe, S. (2011). Parallelizing join computations of SPARQL queries for large semantic web databases. In Proceedings of the 2011 ACM Symposium on Applied Computing, SAC ’11, (New York, NY, USA), pp. 1681–1686, ACM. Groppe, J., & Groppe, S. (2011). Parallelizing join computations of SPARQL queries for large semantic web databases. In Proceedings of the 2011 ACM Symposium on Applied Computing, SAC ’11, (New York, NY, USA), pp. 1681–1686, ACM.
Zurück zum Zitat He, B., Fang, W., Luo, Q., Govindaraju, N.K., & Wang, T. (2008). Mars: A MapReduce framework on graphics processors. In Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, PACT ’08, (New York, NY, USA), pp. 260–269, ACM. He, B., Fang, W., Luo, Q., Govindaraju, N.K., & Wang, T. (2008). Mars: A MapReduce framework on graphics processors. In Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, PACT ’08, (New York, NY, USA), pp. 260–269, ACM.
Zurück zum Zitat He, B., Yang, K., Fang, R., Lu, M., Govindaraju, N., Luo, Q., & Sander, P. (2008). Relational joins on graphics processors. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, SIGMOD ’08, (New York, NY, USA), pp. 511–524, ACM. He, B., Yang, K., Fang, R., Lu, M., Govindaraju, N., Luo, Q., & Sander, P. (2008). Relational joins on graphics processors. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, SIGMOD ’08, (New York, NY, USA), pp. 511–524, ACM.
Zurück zum Zitat Heino, N., & Pan, J.Z. (2012). RDFS reasoning on massively parallel hardware. In The Semantic Web–ISWC (P. Cudr-Mauroux, J. Heflin, E. Sirin, T. Tudorache, J. Euzenat, M. Hauswirth, J. Parreira, J. Hendler, G. Schreiber, A. Bernstein, and E. Blomqvist, eds.), vol. 7649 of Lecture Notes in Computer Science, pp. 133–148, Springer Berlin Heidelberg. Heino, N., & Pan, J.Z. (2012). RDFS reasoning on massively parallel hardware. In The Semantic Web–ISWC (P. Cudr-Mauroux, J. Heflin, E. Sirin, T. Tudorache, J. Euzenat, M. Hauswirth, J. Parreira, J. Hendler, G. Schreiber, A. Bernstein, and E. Blomqvist, eds.), vol. 7649 of Lecture Notes in Computer Science, pp. 133–148, Springer Berlin Heidelberg.
Zurück zum Zitat Kaoudi, Z., & Kementsietsidis, A. (2014). Query processing for RDF databases, in Reasoning Web. Reasoning on the Web in the Big Data Era. In Koubarakis, M., Stamou, G., Stoilos, G., Horrocks, I., Kolaitis, P., Lausen, G., & Weikum, G. (Eds.) vol. 8714 of Lecture Notes in Computer Science, pp. 141–170, Springer International Publishing. Kaoudi, Z., & Kementsietsidis, A. (2014). Query processing for RDF databases, in Reasoning Web. Reasoning on the Web in the Big Data Era. In Koubarakis, M., Stamou, G., Stoilos, G., Horrocks, I., Kolaitis, P., Lausen, G., & Weikum, G. (Eds.) vol. 8714 of Lecture Notes in Computer Science, pp. 141–170, Springer International Publishing.
Zurück zum Zitat Kidwelly, P. (Ed.) (2015). Reasoning on Web Data: Algorithms and Performance, (Seoul, South Korea), Vol. 4. Kidwelly, P. (Ed.) (2015). Reasoning on Web Data: Algorithms and Performance, (Seoul, South Korea), Vol. 4.
Zurück zum Zitat Kotsis, G., & Khalil, I. (2013). Special issue on semantic information management guest editorial. Information Systems Frontiers, 15(2), 151–157.CrossRef Kotsis, G., & Khalil, I. (2013). Special issue on semantic information management guest editorial. Information Systems Frontiers, 15(2), 151–157.CrossRef
Zurück zum Zitat Liu, C., & Qi, G. (2012). Toward scalable reasoning over annotated RDF data using MapReduce. In Web Reasoning and Rule Systems (M. Krtzsch and U. Straccia, eds.), vol. 7497 of Lecture Notes in Computer Science, pp. 238–241, Springer Berlin Heidelberg. Liu, C., & Qi, G. (2012). Toward scalable reasoning over annotated RDF data using MapReduce. In Web Reasoning and Rule Systems (M. Krtzsch and U. Straccia, eds.), vol. 7497 of Lecture Notes in Computer Science, pp. 238–241, Springer Berlin Heidelberg.
Zurück zum Zitat Madduri, K., & Wu, K. (2011). Massive-scale RDF processing using compressed bitmap indexes. In Scientific and Statistical Database Management (J. Bayard Cushing, J. French, and S. Bowers, eds.), vol. 6809 of Lecture Notes in Computer Science, pp. 470–479, Springer Berlin Heidelberg. Madduri, K., & Wu, K. (2011). Massive-scale RDF processing using compressed bitmap indexes. In Scientific and Statistical Database Management (J. Bayard Cushing, J. French, and S. Bowers, eds.), vol. 6809 of Lecture Notes in Computer Science, pp. 470–479, Springer Berlin Heidelberg.
Zurück zum Zitat Stuart, J.A., & Owens, J.D. (2011). Multi-GPU MapReduce on GPU clusters. Stuart, J.A., & Owens, J.D. (2011). Multi-GPU MapReduce on GPU clusters.
Zurück zum Zitat ter Horst, H.J. (2004). Completeness, decidability and complexity of entailment for RDF schema and a semantic extension involving the owlvocabulary. In Web Semantics: Science, Services and Agents on the World Wide Web, vol. 3, no. 23, pp. 79–115, 2005. Selcted Papers from the International Semantic Web Conference, 2004 ISWC, 2004 3rd. International Semantic Web Conference. ter Horst, H.J. (2004). Completeness, decidability and complexity of entailment for RDF schema and a semantic extension involving the owlvocabulary. In Web Semantics: Science, Services and Agents on the World Wide Web, vol. 3, no. 23, pp. 79–115, 2005. Selcted Papers from the International Semantic Web Conference, 2004 ISWC, 2004 3rd. International Semantic Web Conference.
Zurück zum Zitat Urbani, J., Kotoulas, S., Oren, E., & van Harmelen, F. (2009). Scalable distributed reasoning using MapReduce. In The Semantic Web - ISWC 2009 (A. Bernstein, D. Karger, T. Heath, L. Feigenbaum, D. Maynard, E. Motta, and K. Thirunarayan, eds.), vol. 5823 of Lecture Notes in Computer Science, pp. 634–649: Springer Berlin Heidelberg. Urbani, J., Kotoulas, S., Oren, E., & van Harmelen, F. (2009). Scalable distributed reasoning using MapReduce. In The Semantic Web - ISWC 2009 (A. Bernstein, D. Karger, T. Heath, L. Feigenbaum, D. Maynard, E. Motta, and K. Thirunarayan, eds.), vol. 5823 of Lecture Notes in Computer Science, pp. 634–649: Springer Berlin Heidelberg.
Zurück zum Zitat Viriyakamonphan, P., & Chantrapornchai, C. (2016). Query processing framework for HDT Using GPUs. In Proceedings of International Joint Conference on Computer Science and Software Engineering (JCSSE), IEEE. Viriyakamonphan, P., & Chantrapornchai, C. (2016). Query processing framework for HDT Using GPUs. In Proceedings of International Joint Conference on Computer Science and Software Engineering (JCSSE), IEEE.
Metadaten
Titel
Practical parallel string matching framework for RDF entailments with GPUs
Publikationsdatum
26.09.2016
Erschienen in
Information Systems Frontiers / Ausgabe 4/2018
Print ISSN: 1387-3326
Elektronische ISSN: 1572-9419
DOI
https://doi.org/10.1007/s10796-016-9692-4

Weitere Artikel der Ausgabe 4/2018

Information Systems Frontiers 4/2018 Zur Ausgabe