skip to main content
10.1145/1999299.1999306acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

What are real SPARQL queries like?

Published:12 June 2011Publication History

ABSTRACT

We present statistics on real world SPARQL queries that may be of interest for building SPARQL query processing engines and benchmarks. In particular, we analyze the syntactical structure of queries in a log of about 3 million queries, harvested from the DBPedia SPARQL endpoint. Although a sizable portion of the log is shown to consist of so-called conjunctive SPARQL queries, non-conjunctive queries that use SPARQL's union or optional operators are more than substantial. It is known, however, that query evaluation quickly becomes hard for queries including the non-conjunctive operators union or optional. We therefore drill deeper into the syntactical structure of the queries that are not conjunctive and show that in 50% of the cases, these queries satisfy certain structural restrictions that imply tractable evaluation in theory. We hope that the identification of these restrictions can aid in the future development of practical heuristics for processing non-conjunctive SPARQL queries.

References

  1. D. J. Abadi, A. Marcus, S. Madden, and K. J. Hollenbach. Scalable semantic web data management using vertical partitioning. In Proceedings of VLDB 2007, pages 411--422. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison-Wesley, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Arias, J. D. Fernández, M. A. Martínez-Prieto, and P. de la Fuente. An empirical study of real-world sparql queries. CoRR, abs/1103.5043, 2011.Google ScholarGoogle Scholar
  4. E. I. Chong, S. Das, G. Eadon, and J. Srinivasan. An efficient SQL-based RDF querying scheme. In Proceedings of VLDB 2005, pages 1216--1227, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. G. H. L. Fletcher and P. W. Beck. Scalable indexing of RDF graphs for efficient join processing. In D. W.-L. Cheung, I.-Y. Song, W. W. Chu, X. Hu, and J. J. Lin, editors, CIKM, pages 1513--1516. ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. G. Gottlob, N. Leone, and F. Scarcello. The complexity of acyclic conjunctive queries. J. ACM, 48(3):431--498, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. K. Möller, M. Hausenblas, R. Cyganiak, S. Handschuh, and G. Grimnes. Learning from linked open data usage: Patterns & metrics. In Proceedings of the Web Science Conference 2010, 2010.Google ScholarGoogle Scholar
  8. T. Neumann and G. Weikum. Scalable join processing on very large RDF graphs. In SIGMOD 2009 Conference Proceedings, pages 627--640, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. T. Neumann and G. Weikum. The RDF-3X engine for scalable management of RDF data. VLDB J., 19(1):91--113, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. Pérez, M. Arenas, and C. Gutierrez. Semantics and complexity of SPARQL. ACM Trans. Database Syst., 34(3), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Polleres. From SPARQL to rules (and back). In WWW 2007 Conference Proceedings, pages 787--796. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. E. Prud'hommeaux and A. Seaborne. SPARQL query language for RDF. Technical report, W3C Recommendation, 2008.Google ScholarGoogle Scholar
  13. Resource description framework( (RDF). Technical report, W3C. http://www.w3.org/RDF/.Google ScholarGoogle Scholar
  14. M. Schmidt, M. Meier, and G. Lausen. Foundations of SPARQL query optimization. In ICDT 2010 Proceedings, pages 4--33, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. L. Sidirourgos, R. Goncalves, M. L. Kersten, N. Nes, and S. Manegold. Column-store support for RDF data management: not all swans are white. PVLDB, 1(2):1553--1563, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Y. Vardi. The complexity of relational query languages (extended abstract). In STOC, pages 137--146. ACM, 1982. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M.-E. Vidal, E. Ruckhaus, T. Lampo, A. Martínez, J. Sierra, and A. Polleres. Efficiently joining group patterns in SPARQL queries. In 7th Extended Semantic Web Conference, volume 6088 of Lecture Notes in Computer Science, pages 228--242. Springer, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. W3C SWEO Community Project. Linking open data. http://www.w3.org/wiki/SweoIG/TaskForces/CommunityProjects/LinkingOpenData.Google ScholarGoogle Scholar
  19. C. Weiss, P. Karras, and A. Bernstein. Hexastore: sextuple indexing for semantic web data management. PVLDB, 1(1):1008--1019, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. What are real SPARQL queries like?

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SWIM '11: Proceedings of the International Workshop on Semantic Web Information Management
          June 2011
          61 pages
          ISBN:9781450306515
          DOI:10.1145/1999299

          Copyright © 2011 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 12 June 2011

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader