Abstract
The Resource Description Framework (RDF) is a flexible model for representing information about resources in the web. With the increasing amount of RDF data which is becoming available, efficient and scalable management of RDF data has become a fundamental challenge to achieve the SemanticWeb vision. The RDF model has attracted the attention of the database community and many researchers have proposed different solutions to store and query RDF data efficiently. This survey focuses on using relational query processors to store and query RDF data. We provide an overview of the different approaches and classify them according to their storage and query evaluation strategies.
- Daniel J. Abadi, Adam Marcus, Samuel Madden, and Kate Hollenbach. SW-Store: a vertically partitioned DBMS for Semantic Web data management. VLDB Journal, 18(2):385--406, 2009. Google ScholarDigital Library
- Sofia Alexaki, Vassilis Christophides, Gregory Karvounarakis, Dimitris Plexousakis, and Karsten Tolle. The ICS-FORTH RDFSuite: Managing Voluminous RDF Description Bases. In Proceedings of the 2nd InternationalWorkshop on the Semantic Web (SemWeb), 2001.Google Scholar
- Jennifer L. Beckmann, Alan Halverson, Rajasekar Krishnamurthy, and Jeffrey F. Naughton. Extending RDBMSs To Support Sparse Datasets Using An Interpreted Attribute Storage Format. In Proceedings of the 22nd International Conference on Data Engineering (ICDE), page 58, 2006. Google ScholarDigital Library
- Christian Bizer and Andreas Schultz. Benchmarking the Performance of Storage Systems that expose SPARQL Endpoints. In Proceedings of the 4th International Workshop on Scalable Semantic Web knowledge Base Systems (SSWS)., 2008.Google Scholar
- Viorica Botea, Daniel Mallett, Mario A. Nascimento, and Jörg Sander. PIST: An Efficient and Practical Indexing Technique for Historical Spatio-Temporal Point Data. GeoInformatica, 12(2):143--168, 2008. Google ScholarDigital Library
- Jeen Broekstra, Arjohn Kampman, and Frank van Harmelen. Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema. In Proceedings of the First International Semantic Web Conference(ISWC), pages 54--68, 2002. Google ScholarDigital Library
- Surajit Chaudhuri and Gerhard Weikum. Rethinking Database System Architecture: Towards a Self-Tuning RISC-Style Database System. In Proceedings of 26th International Conference on Very Large Data Bases (VLDB), pages 1--10, 2000. Google ScholarDigital Library
- Eugene Inseok Chong, Souripriya Das, George Eadon, and Jagannathan Srinivasan. An Efficient SQL-based RDF Querying Scheme. In Proceedings of the 31st International Conference on Very Large Data Bases (VLDB), pages 1216--1227, 2005. Google ScholarDigital Library
- Eric Chu, Jennifer L. Beckmann, and Jeffrey F. Naughton. The case for a wide-table approach to manage sparse relational data sets. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 821--832, 2007. Google ScholarDigital Library
- George P. Copeland and Setrag Khoshafian. A Decomposition Storage Model. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 268--279, 1985. Google ScholarDigital Library
- Torsten Grust, Sherif Sakr, and Jens Teubner. XQuery on SQL Hosts. In Proceedings of the Thirtieth International Conference on Very Large Data Bases (VLDB), pages 252--263, 2004. Google ScholarDigital Library
- Stephen Harris and Nicholas Gibbins. 3store: Efficient Bulk RDF Storage. In Proceedings of the First International Workshop on Practical and Scalable Semantic Systems (PSSS), 2003.Google Scholar
- Andreas Harth and Stefan Decker. Optimized Index Structures for Querying RDF from the Web. In Proceedings of the Third Latin American Web Congress (LA-WEB), pages 71--80, 2005. Google ScholarDigital Library
- Shawn R. Jeffery, Michael J. Franklin, and Alon Y. Halevy. Pay-as-you-go user feedback for dataspace systems. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 847--860, 2008. Google ScholarDigital Library
- Justin J. Levandoski and Mohamed F. Mokbel. RDF Data-Centric Storage. In Proceedings of the IEEE International Conference on Web Services (ICWS), 2009. Google ScholarDigital Library
- Li Ma, Zhong Su, Yue Pan, Li Zhang, and Tao Liu. RStar: an RDF storage and query system for enterprise resource management. In Proceedings of the ACM International Conference on Information and Knowledge Management (CIKM), pages 484--491, 2004. Google ScholarDigital Library
- Frank Manola and Eric Miller. RDF Primer, W3C Recommendation, February 2004. http://www.w3.org/TR/REC-rdf-syntax/.Google Scholar
- Akiyoshi Matono, Toshiyuki Amagasa, Masatoshi Yoshikawa, and Shunsuke Uemura. A Path-based Relational RDF Database. In Proceedings of the 16th Australasian Database Conference (ADC), pages 95--103, 2005. Google ScholarDigital Library
- Brian McBride. Jena: A Semantic Web Toolkit. IEEE Internet Computing, 6(6):55--59, 2002. Google ScholarDigital Library
- Thomas Neumann and Gerhard Weikum. RDF-3X: a RISC-style engine for RDF. Proceedings of the VLDB Endownment (PVLDB), 1(1):647--659, 2008. Google ScholarDigital Library
- Thomas Neumann and Gerhard Weikum. Scalable join processing on very large RDF graphs. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 627--640, 2009. Google ScholarDigital Library
- Eric Prud'hommeaux and Andy Seaborne. SPARQL Query Language for RDF, W3C Recommendation, January 2008. http://www.w3.org/TR/rdf-sparql-query/.Google Scholar
- Michael Schmidt, Thomas Hornung, Norbert Küchlin, Georg Lausen, and Christoph Pinkel. An Experimental Comparison of RDF Data Management Approaches in a SPARQL Benchmark Scenario. In Proceedings of the 7th International Semantic Web Conference (ISWC), pages 82--97, 2008. Google ScholarDigital Library
- Michael Schmidt, Thomas Hornung, Georg Lausen, and Christoph Pinkel. SP2Bench: A SPARQL Performance Benchmark. In Proceedings of the 25th International Conference on Data Engineering (ICDE), pages 222--233, 2009. Google ScholarDigital Library
- Lefteris Sidirourgos, Romulo Goncalves, Martin L. Kersten, Niels Nes, and Stefan Manegold. Column-store support for RDF data management: not all swans are white. Proceedings of the VLDB Endownment (PVLDB), 1(2):1553--1563, 2008. Google ScholarDigital Library
- Michael Stonebraker, Daniel J. Abadi, Adam Batkin, Xuedong Chen, Mitch Cherniack, Miguel Ferreira, Edmond Lau, Amerson Lin, Samuel Madden, Elizabeth J. O'Neil, Patrick E. O'Neil, Alex Rasin, Nga Tran, and Stanley B. Zdonik. C-Store: A Column-oriented DBMS. In Proceedings of the 31st International Conference on Very Large Data Bases (VLDB), pages 553--564, 2005. Google ScholarDigital Library
- Can Türker and Michael Gertz. Semantic integrity support in SQL: 1999 and commercial (object-)relational database management systems. VLDB Journal, 10(4):241--269, 2001. Google ScholarDigital Library
- Cathrin Weiss, Panagiotis Karras, and Abraham Bernstein. Hexastore: sextuple indexing for semantic web data management. Proceedings of the VLDB Endownment (PVLDB), 1(1):1008--1019, 2008. Google ScholarDigital Library
Index Terms
- Relational processing of RDF queries: a survey
Recommendations
Towards distributed processing of RDF path queries
A technical infrastructure for storing, querying and managing RDF data is a key element in the current semantic web development. Systems like Jena, Sesame or the ICS-FORTH RDF Suite are widely used for building semantic web applications. Currently, none ...
Processing SPARQL queries with regular expressions in RDF databases
DTMBIO '10: Proceedings of the ACM fourth international workshop on Data and text mining in biomedical informaticsAs the Resource Description Framework (RDF) data model is widely used for modeling and sharing a lot of online bioinformatics resources such as Uniprot (dev.isb-sib.ch/projects/uniprot-rdf) or Bio2RDF (bio2rdf.org), SPARQL -- a W3C recommendation query ...
Rewriting queries using view for RDF/RDFS-Based relational data integration
ICDCIT'05: Proceedings of the Second international conference on Distributed Computing and Internet TechnologyWe study the problem of answering queries through a target RDF-based ontology, given a set of view-based mappings between one or more source relational schemas and this target ontology. Particularly, we consider a set of RDFS semantic constraints such ...
Comments