Skip to main content
Top

2014 | OriginalPaper | Chapter

Distributed RDFS Reasoning with MapReduce

Authors : Yigit Cetin, Osman Abul

Published in: Information Sciences and Systems 2014

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We live in big data age in which many computational tasks either generate or need to use large datasets. This makes parallel and distributed computing a key for scalability. MapReduce is a programming model for processing large datasets in parallel and distributed fashion on cluster of computers. Today, since the size and complexity of RDFS documents increase rapidly, RDFS reasoning problem has to embrace and address the big data solutions. The output of RDFS reasoning job can be input to another job and the output of RDFS reasoning jobs grow big as the input documents gets bigger. In this study, an indexing method is proposed to speed up the RDFS reasoning over Hadoop clusters. We also explore the utility of caching and Hadoop ecosystem tools Apache Hive and Apache Pig for this task. Experimental evaluations on Dbpedia and Freebase datasets show that the indexing method is quite effective and offers scalable solutions. Performance of caching and Apache Hive is found acceptable too.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference T. Berners-Lee, J. Hendler, O. Lassila, The semantic web. Sci. Am. 284(5), 28–37 (2001)CrossRef T. Berners-Lee, J. Hendler, O. Lassila, The semantic web. Sci. Am. 284(5), 28–37 (2001)CrossRef
2.
go back to reference J. Weaver, J.A. Hendler, Parallel materialization of the finite rdfs closure for hundreds of millions of triples. in: Proceedings of the 8th International Semantic Web Conference (ISWC 2009), pp. 682–697, Springer (2009) J. Weaver, J.A. Hendler, Parallel materialization of the finite rdfs closure for hundreds of millions of triples. in: Proceedings of the 8th International Semantic Web Conference (ISWC 2009), pp. 682–697, Springer (2009)
3.
go back to reference M. Husain, L. Khan, M. Kantarcioglu, B. Thuraisingham, Data intensive query processing for large RDF graphs using cloud computing tools. in: Proceedings of the IEEE 3rd International Conference on Cloud Computing (CLOUD 2010), pp. 1–10, (2010) M. Husain, L. Khan, M. Kantarcioglu, B. Thuraisingham, Data intensive query processing for large RDF graphs using cloud computing tools. in: Proceedings of the IEEE 3rd International Conference on Cloud Computing (CLOUD 2010), pp. 1–10, (2010)
5.
go back to reference S.G.J. Dean,Mapreduce: simplified data processing on large clusters. in 6th Symposium on Operating Systems Design and Implementation (OSDI 2004), (2004) S.G.J. Dean,Mapreduce: simplified data processing on large clusters. in 6th Symposium on Operating Systems Design and Implementation (OSDI 2004), (2004)
8.
go back to reference A. Thusoo, J.S. Sarma, N. Jain, Z. Shao, P. Chakka, N. Zhang, R. Murthy, Hive-a petabyte scale data warehouse using Hadoop. in: Proceedings of the IEEE 26th International Conference on Data Engineering (ICDE 2010), pp. 996–1005, (2010) A. Thusoo, J.S. Sarma, N. Jain, Z. Shao, P. Chakka, N. Zhang, R. Murthy, Hive-a petabyte scale data warehouse using Hadoop. in: Proceedings of the IEEE 26th International Conference on Data Engineering (ICDE 2010), pp. 996–1005, (2010)
9.
go back to reference P. Papailiou, I. Konstantinou, D. Tsoumakos, P. Karras, N. Koziris, H2RDF+: High-performance distributed joins over large-scale RDF graphs. in: Proceedings of the IEEE International Conference on Big Data, pp. 255–263, (2013) P. Papailiou, I. Konstantinou, D. Tsoumakos, P. Karras, N. Koziris, H2RDF+: High-performance distributed joins over large-scale RDF graphs. in: Proceedings of the IEEE International Conference on Big Data, pp. 255–263, (2013)
10.
go back to reference S. Jianling, J. Qiang, Scalable RDF Store Based on HBase and MapReduce. in: Proceedings of the 3rd International Conference on Advanced Computer Theory and Engineering (ICACTE 2010), pp. 633–636, (2010) S. Jianling, J. Qiang, Scalable RDF Store Based on HBase and MapReduce. in: Proceedings of the 3rd International Conference on Advanced Computer Theory and Engineering (ICACTE 2010), pp. 633–636, (2010)
11.
go back to reference D. Brickley, R.V. Guha (eds.), RDF Vocabulary Description Language 1.0: RDF Schema. W3C Recommendation, (2004) D. Brickley, R.V. Guha (eds.), RDF Vocabulary Description Language 1.0: RDF Schema. W3C Recommendation, (2004)
12.
go back to reference J. Urbani, S. Kotoulas, E. Oren, F. Van Harmelen, Scalable distributed reasoning using mapreduce. in: Proceedings of the 8th International Semantic Web Conference (ISWC 2009), pp. 634–649, Springer (2009) J. Urbani, S. Kotoulas, E. Oren, F. Van Harmelen, Scalable distributed reasoning using mapreduce. in: Proceedings of the 8th International Semantic Web Conference (ISWC 2009), pp. 634–649, Springer (2009)
13.
go back to reference T. White, Hadoop The Definitive Guide (O’Reilly Media/Yahoo Press, Sebastopol, 2012) T. White, Hadoop The Definitive Guide (O’Reilly Media/Yahoo Press, Sebastopol, 2012)
14.
go back to reference Y. Zhanga, T. Chenb, W. Youc, J. Yud, J. Sune, H. Chenf, A new efficient semantic web platform based on the Solr, SIREn and RDF. in: Proceedings of the International Conference on Information Engineering (2012) Y. Zhanga, T. Chenb, W. Youc, J. Yud, J. Sune, H. Chenf, A new efficient semantic web platform based on the Solr, SIREn and RDF. in: Proceedings of the International Conference on Information Engineering (2012)
Metadata
Title
Distributed RDFS Reasoning with MapReduce
Authors
Yigit Cetin
Osman Abul
Copyright Year
2014
DOI
https://doi.org/10.1007/978-3-319-09465-6_32

Premium Partner