Skip to main content

2015 | OriginalPaper | Buchkapitel

GraSS: An Efficient Method for RDF Subgraph Matching

verfasst von : Xuedong Lyu, Xin Wang, Yuan-Fang Li, Zhiyong Feng, Junhu Wang

Erschienen in: Web Information Systems Engineering – WISE 2015

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Resource Description Framework (RDF) is a standard data model of the Semantic Web, and it has been widely adopted in various domains in recent years for data and knowledge representation. Unlike queries on relational databases, most of queries applied on RDF data are known as graph queries, expressed in the SPARQL language. Subgraph matching, a basic SPARQL operation, is known to be NP-complete. Coupled with the rapidly increasing volumes of RDF data, it makes efficient graph query processing a very challenging problem. This paper primarily focuses on providing an index scheme and corresponding algorithms that support the efficient solution of such queries. We present a subgraph matching query engine based on the FFD-index which is an indexing mechanism encoding a star subgraph into a bit string. A SPARQL query graph is decomposed into several star query subgraphs which can be efficiently processed benefiting from succinct FFD-index data structure. Extensive evaluation shows that our approach outperforms RDF-3X and gStore on solving subgraph matching.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semant. Web Inf. Syst. 5(3), 1–22 (2009)CrossRef Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semant. Web Inf. Syst. 5(3), 1–22 (2009)CrossRef
2.
Zurück zum Zitat Bunke, H.: Graph matching: theoretical foundations, algorithms, and applications. In: Proceedings of Vision Interface, pp. 82–88 (2000) Bunke, H.: Graph matching: theoretical foundations, algorithms, and applications. In: Proceedings of Vision Interface, pp. 82–88 (2000)
3.
Zurück zum Zitat Cheng, J., Ke, Y., Ng, W., Lu, A.: Fg-index: towards verification-free query processing on graph databases. In: Proceedings of SIGMOD, pp. 857–872. ACM (2007) Cheng, J., Ke, Y., Ng, W., Lu, A.: Fg-index: towards verification-free query processing on graph databases. In: Proceedings of SIGMOD, pp. 857–872. ACM (2007)
4.
Zurück zum Zitat Conte, D., Foggia, P., Sansone, C., Vento, M.: Thirty years of graph matching in pattern recognition. Int. J. Pattern Recogn. Artif. Intell. 18(03), 265–298 (2004)CrossRef Conte, D., Foggia, P., Sansone, C., Vento, M.: Thirty years of graph matching in pattern recognition. Int. J. Pattern Recogn. Artif. Intell. 18(03), 265–298 (2004)CrossRef
5.
Zurück zum Zitat Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A (sub) graph isomorphism algorithm for matching large graphs. IEEE Trans. Pattern Anal. Mach. Intell. 26(10), 1367–1372 (2004)CrossRef Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A (sub) graph isomorphism algorithm for matching large graphs. IEEE Trans. Pattern Anal. Mach. Intell. 26(10), 1367–1372 (2004)CrossRef
6.
Zurück zum Zitat Fortin, S.: The Graph Isomorphism Problem. Technical Report, University of Alberta, Canada (1996) Fortin, S.: The Graph Isomorphism Problem. Technical Report, University of Alberta, Canada (1996)
7.
Zurück zum Zitat Harris, S., Seaborne, A.: SPARQL 1.1 Query Language. W3C Recommendation (2013) Harris, S., Seaborne, A.: SPARQL 1.1 Query Language. W3C Recommendation (2013)
8.
Zurück zum Zitat Kim, H., Ravindra, P., Anyanwu, K.: From SPARQL to mapreduce: the journey using a nested triplegroup algebra. Proc. VLDB Endow. 4(12), 1426–1429 (2011) Kim, H., Ravindra, P., Anyanwu, K.: From SPARQL to mapreduce: the journey using a nested triplegroup algebra. Proc. VLDB Endow. 4(12), 1426–1429 (2011)
9.
Zurück zum Zitat Klyne, G., Carroll, J.J., McBride, B.: RDF 1.1 Concepts and Abstract Syntax. W3C Recommendation (2014) Klyne, G., Carroll, J.J., McBride, B.: RDF 1.1 Concepts and Abstract Syntax. W3C Recommendation (2014)
10.
Zurück zum Zitat Michael, R.G., David, S.J.: Computers and Intractability: A Guide to the Theory of NP-Completeness. WH Freeman Co., San Francisco (1979)MATH Michael, R.G., David, S.J.: Computers and Intractability: A Guide to the Theory of NP-Completeness. WH Freeman Co., San Francisco (1979)MATH
11.
Zurück zum Zitat Neumann, T., Weikum, G.: RDF-3X: a RISC-style engine for RDF. Proc. VLDB Endow. 1(1), 647–659 (2008)CrossRef Neumann, T., Weikum, G.: RDF-3X: a RISC-style engine for RDF. Proc. VLDB Endow. 1(1), 647–659 (2008)CrossRef
12.
Zurück zum Zitat Papailiou, N., Tsoumakos, D., Konstantinou, I., Karras, P., Koziris, N.: H2RDF+: An efficient data management system for big RDF graphs. In: Proceedings of SIGMOD, pp. 909–912. ACM (2014) Papailiou, N., Tsoumakos, D., Konstantinou, I., Karras, P., Koziris, N.: H2RDF+: An efficient data management system for big RDF graphs. In: Proceedings of SIGMOD, pp. 909–912. ACM (2014)
13.
Zurück zum Zitat Shasha, D., Wang, J.T., Giugno, R.: Algorithmics and applications of tree and graph searching. In: Proceedings of PODS, pp. 39–52. ACM (2002) Shasha, D., Wang, J.T., Giugno, R.: Algorithmics and applications of tree and graph searching. In: Proceedings of PODS, pp. 39–52. ACM (2002)
14.
Zurück zum Zitat Udrea, O., Pugliese, A., Subrahmanian, V.S.: GRIN: a graph based RDF index. In: Proceedings of the National Conference on Artificial Intelligence, vol. 22 (2007) Udrea, O., Pugliese, A., Subrahmanian, V.S.: GRIN: a graph based RDF index. In: Proceedings of the National Conference on Artificial Intelligence, vol. 22 (2007)
16.
Zurück zum Zitat Weiss, C., Karras, P., Bernstein, A.: Hexastore: sextuple indexing for semantic web data management. Proc. VLDB Endow. 1(1), 1008–1019 (2008)CrossRef Weiss, C., Karras, P., Bernstein, A.: Hexastore: sextuple indexing for semantic web data management. Proc. VLDB Endow. 1(1), 1008–1019 (2008)CrossRef
17.
Zurück zum Zitat Lyu, X., Wang, X., Li, Y.-F., Feng, Z.: FFD-Index: an efficient indexing scheme for star subgraph matching on large RDF graphs. In: Liu, A., Ishikawa, Y., Qian, T., Nutanong, S., Cheema, M.A. (eds.) DASFAA 2015 Workshops. LNCS, vol. 9052, pp. 240–245. Springer, Heidelberg (2015) CrossRef Lyu, X., Wang, X., Li, Y.-F., Feng, Z.: FFD-Index: an efficient indexing scheme for star subgraph matching on large RDF graphs. In: Liu, A., Ishikawa, Y., Qian, T., Nutanong, S., Cheema, M.A. (eds.) DASFAA 2015 Workshops. LNCS, vol. 9052, pp. 240–245. Springer, Heidelberg (2015) CrossRef
18.
Zurück zum Zitat Yan, X., Yu, P.S., Han, J.: Graph indexing: a frequent structure-based approach. In: Proceedings of SIGMOD, pp. 335–346. ACM (2004) Yan, X., Yu, P.S., Han, J.: Graph indexing: a frequent structure-based approach. In: Proceedings of SIGMOD, pp. 335–346. ACM (2004)
19.
Zurück zum Zitat Zhang, S., Li, S., Yang, J.: GADDI: Distance index based subgraph matching in biological networks. In: Proceedings of EDBT, pp. 192–203. ACM (2009) Zhang, S., Li, S., Yang, J.: GADDI: Distance index based subgraph matching in biological networks. In: Proceedings of EDBT, pp. 192–203. ACM (2009)
20.
Zurück zum Zitat Zou, L., Chen, L., Yu, J. X., Lu, Y.: A novel spectral coding in a large graph database. In: Proceedings of EDBT, pp. 181–192. ACM (2008) Zou, L., Chen, L., Yu, J. X., Lu, Y.: A novel spectral coding in a large graph database. In: Proceedings of EDBT, pp. 181–192. ACM (2008)
21.
Zurück zum Zitat Zou, L., Mo, J., Chen, L., Özsu, M.T., Zhao, D.: gStore: answering SPARQl queries via subgraph matching. Proc. VLDB Endow. 4(8), 482–493 (2011)CrossRef Zou, L., Mo, J., Chen, L., Özsu, M.T., Zhao, D.: gStore: answering SPARQl queries via subgraph matching. Proc. VLDB Endow. 4(8), 482–493 (2011)CrossRef
Metadaten
Titel
GraSS: An Efficient Method for RDF Subgraph Matching
verfasst von
Xuedong Lyu
Xin Wang
Yuan-Fang Li
Zhiyong Feng
Junhu Wang
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-26190-4_8

Premium Partner