skip to main content
research-article

x-RDF-3X: fast querying, high update rates, and consistency for RDF databases

Published:01 September 2010Publication History
Skip Abstract Section

Abstract

The RDF data model is gaining importance for applications in computational biology, knowledge sharing, and social communities. Recent work on RDF engines has focused on scalable performance for querying, and has largely disregarded updates. In addition to incremental bulk loading, applications also require online updates with flexible control over multi-user isolation levels and data consistency. The challenge lies in meeting these requirements while retaining the capability for fast querying.

This paper presents a comprehensive solution that is based on an extended deferred-indexing method with integrated versioning. The version store enables time-travel queries that are efficiently processed without adversely affecting queries on the current data. For flexible consistency, transactional concurrency control is provided with options for either snapshot isolation or full serializability. All methods are integrated in an extension of the RDF-3X system, and their very good performance for both queries and updates is demonstrated by measurements of multi-user workloads with real-life data as well as stress-test synthetic loads.

References

  1. D. J. Abadi et al. Scalable semantic web data management using vertical partitioning. In VLDB, pages 411--422, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Adya, B. Liskov, and P. E. O'Neil. Generalized isolation level definitions. In ICDE, pages 67--78, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Bröcheler, A. Pugliese, and V. S. Subrahmanian. Dogma: A disk-oriented graph matching algorithm for rdf databases. In International Semantic Web Conference, pages 97--113, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. E. I. Chong et al. An efficient SQL-based RDF querying scheme. In VLDB, pages 1216--1227, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. T. Cormen, C. Leiserson, R. Rivest, and C. Stein. Introduction to Algorithms. MIT Press, 2001. 2nd Edition. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. G. Graefe. Hierarchical locking in b-tree indexes. In BTW, pages 18--42, 2007.Google ScholarGoogle Scholar
  7. A. Harth et al. YARS2: A federated repository for querying graph structured data from the web. In ISWC/ASWC, pages 211--224, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Jain. The Art of Computer Systems Performance Analysis. Wiley, 1991.Google ScholarGoogle Scholar
  9. Jena: a Semantic Web Framework for Java. http://jena.sourceforge.net/.Google ScholarGoogle Scholar
  10. D. B. Lomet et al. Transaction time support inside a database engine. In ICDE, page 35, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. B. Lomet and B. Salzberg. The performance of a multiversion access method. In SIGMOD, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T. Neumann and G. Weikum. RDF-3X: a RISC-style engine for RDF. PVLDB, 1(1):647--659, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. T. Neumann and G. Weikum. The RDF-3X engine for scalable management of RDF data. VLDB J., 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. T. Neumann and G. Weikum. Scalable join processing on very large RDF graphs. In SIGMOD, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. P. E. O'Neil et al. The log-structured merge-tree (lsm-tree). Acta Inf., 33(4):351--385, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. A. Pugliese, O. Udrea, and V. S. Subrahmanian. Scaling rdf with time. In WWW, pages 605--614, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. RDF-3X. http://www.mpi-inf.mpg.de/~neumann/rdf3x.Google ScholarGoogle Scholar
  18. D. P. Reed. Implementing atomic actions on decentralized data. TOCS, 1(1):3--23, 1983. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. B. Salzberg and V. J. Tsotras. Comparison of access methods for time-evolving data. ACM Comput. Surv., 31(2):158--221, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. L. Sidirourgos et al. Column-store support for RDF data management: not all swans are white. PVLDB, 1(2):1553--1563, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. M. Stonebraker. The design of the postgres storage system. In VLDB, pages 289--300, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. O. Udrea, A. Pugliese, and V. S. Subrahmanian. GRIN: A graph based RDF index. In AAAI, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. W3C: Resource Description Framework (RDF). http://www.w3.org/RDF/.Google ScholarGoogle Scholar
  24. G. Weikum and G. Vossen. Transactional Information Systems: Theory, Algorithms, and the Practice of Concurrency Control and Recovery. Morgan Kaufmann, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. C. Weiss, P. Karras, and A. Bernstein. Hexastore: sextuple indexing for semantic web data management. PVLDB, 1(1):1008--1019, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. K. Wilkinson et al. Efficient RDF storage and retrieval in Jena2. In SWDB, pages 131--150, 2003.Google ScholarGoogle Scholar
  27. Yars2. http://sw.deri.org/svn/sw/2004/06/yars.Google ScholarGoogle Scholar

Index Terms

  1. x-RDF-3X: fast querying, high update rates, and consistency for RDF databases
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image Proceedings of the VLDB Endowment
        Proceedings of the VLDB Endowment  Volume 3, Issue 1-2
        September 2010
        1658 pages

        Publisher

        VLDB Endowment

        Publication History

        • Published: 1 September 2010
        Published in pvldb Volume 3, Issue 1-2

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader