Skip to main content
Erschienen in: Knowledge and Information Systems 1/2017

20.04.2016 | Regular Paper

Handling failing RDF queries: from diagnosis to relaxation

verfasst von: Géraud Fokou, Stéphane Jean, Allel Hadjali, Mickael Baron

Erschienen in: Knowledge and Information Systems | Ausgabe 1/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Recent years have witnessed the development of large knowledge bases (KBs). Due to the lack of information about the content and schema semantics of KBs, users are often not able to correctly formulate KB queries that return the intended result. In this paper, we consider the problem of failing RDF queries, i.e., queries that return an empty set of answers. Query relaxation is one cooperative technique proposed to solve this problem. In the context of RDF data, several works proposed query relaxation operators and ranking models for relaxed queries. But none of them tried to find the causes of an RDF query failure given by Minimal Failing Subqueries (MFSs) as well as successful queries that have a maximal number of triple patterns named Ma \(\underline{x}\) imal Succeeding Subqueries (XSSs). Inspired by previous work in the context of relational databases and recommender systems, we propose two complementary approaches to fill this gap. The lattice-based approach (LBA) leverages the theoretical properties of MFSs and XSSs to efficiently explore the subquery lattice of the failing query. The matrix-based approach computes a matrix that records alternative answers to the failing query with the triple patterns they satisfy. The skyline of this matrix directly gives the XSSs of the failing query. This matrix can also be used as an index to improve the performance of LBA. The practical interest of these two approaches are shown via a set of experiments conducted on the LUBM benchmark and a comparative study with baseline and related work algorithms.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
Numbers current as of February 2015.
 
2
For readability, we use names instead of URIs to identify the query elements.
 
3
Given a set of objects described by a list of criteria, a skyline is a subset of objects that are not dominated (in the sense of Pareto) by any other object with respect to some criteria of interest.
 
4
To ensure that subqueries of an MFS are successful, it is defined that \([[\emptyset ]]_{D} \ne \emptyset \).
 
5
As the semantics of SPARQL is never null-rejecting, contrary to the relational algebra, this expression is not equivalent to: https://static-content.springer.com/image/art%3A10.1007%2Fs10115-016-0941-0/MediaObjects/10115_2016_941_IEq242_HTML.gif .
 
6
The coalesce function returns the first non-null expression in the list of parameters.
 
7
For readability, we shorten the URIs.
 
8
MFSs which are only poorly satisfied, i.e., that do not return any answer with at least a satisfaction degree equals to \(\alpha \) (a user-defined threshold).
 
Literatur
1.
Zurück zum Zitat Hoffart J, Suchanek FM, Berberich K, Weikum G (2013) YAGO2: a spatially and temporally enhanced knowledge base from Wikipedia. Artif Intell 194:28–61MathSciNetCrossRefMATH Hoffart J, Suchanek FM, Berberich K, Weikum G (2013) YAGO2: a spatially and temporally enhanced knowledge base from Wikipedia. Artif Intell 194:28–61MathSciNetCrossRefMATH
2.
Zurück zum Zitat Bizer C, Lehmann J, Kobilarov G, Auer S, Becker C, Cyganiak R, Hellmann S (2009) DBpedia—a crystallization point for the web of data. J Web Semant 7(3):154–165CrossRef Bizer C, Lehmann J, Kobilarov G, Auer S, Becker C, Cyganiak R, Hellmann S (2009) DBpedia—a crystallization point for the web of data. J Web Semant 7(3):154–165CrossRef
3.
Zurück zum Zitat Dong X, Gabrilovich E, Heitz G, Horn W, Lao N, Murphy K, Strohmann T, Sun S, Zhang W (2014) Knowledge vault: a web-scale approach to probabilistic knowledge fusion. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining (KDD’14), pp 601–610 Dong X, Gabrilovich E, Heitz G, Horn W, Lao N, Murphy K, Strohmann T, Sun S, Zhang W (2014) Knowledge vault: a web-scale approach to probabilistic knowledge fusion. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining (KDD’14), pp 601–610
4.
Zurück zum Zitat Deshpande O, Lamba DS, Tourn M, Das S, Subramaniam S, Rajaraman A, Harinarayan V, Doan A (2013) Building, maintaining, and using knowledge bases: a report from the trenches. In: Proceedings of the 2013 ACM SIGMOD international conference on management of data (SIGMOD’13), pp 1209–1220 Deshpande O, Lamba DS, Tourn M, Das S, Subramaniam S, Rajaraman A, Harinarayan V, Doan A (2013) Building, maintaining, and using knowledge bases: a report from the trenches. In: Proceedings of the 2013 ACM SIGMOD international conference on management of data (SIGMOD’13), pp 1209–1220
8.
9.
Zurück zum Zitat Bollacker KD, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase: a collaboratively created graph database for structuring human knowledge. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD’08), pp 1247–1250 Bollacker KD, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase: a collaboratively created graph database for structuring human knowledge. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD’08), pp 1247–1250
10.
Zurück zum Zitat Guo Y, Pan Z, Heflin J (2005) LUBM: a benchmark for OWL knowledge base systems. Web Semant 3(2–3):158–182CrossRef Guo Y, Pan Z, Heflin J (2005) LUBM: a benchmark for OWL knowledge base systems. Web Semant 3(2–3):158–182CrossRef
11.
Zurück zum Zitat Hurtado CA, Poulovassilis A, Wood PT (2008) Query relaxation in RDF. J Data Semant X 10:31–61CrossRefMATH Hurtado CA, Poulovassilis A, Wood PT (2008) Query relaxation in RDF. J Data Semant X 10:31–61CrossRefMATH
12.
Zurück zum Zitat Hurtado CA, Poulovassilis A, Wood PT (2009) Ranking approximate answers to semantic web queries. In: Proceeding of the 6th extended semantic web conference (ESWC’09), pp 263–277 Hurtado CA, Poulovassilis A, Wood PT (2009) Ranking approximate answers to semantic web queries. In: Proceeding of the 6th extended semantic web conference (ESWC’09), pp 263–277
13.
Zurück zum Zitat Huang H, Liu C, Zhou X (2008) Computing relaxed answers on RDF databases. In: Proceedings of the 9th international conference on web information systems engineering (WISE’08), pp 163–175 Huang H, Liu C, Zhou X (2008) Computing relaxed answers on RDF databases. In: Proceedings of the 9th international conference on web information systems engineering (WISE’08), pp 163–175
14.
Zurück zum Zitat Huang H, Liu C, Zhou X (2012) Approximating query answering on RDF databases. World Wide Web 15(1):89–114CrossRef Huang H, Liu C, Zhou X (2012) Approximating query answering on RDF databases. World Wide Web 15(1):89–114CrossRef
15.
Zurück zum Zitat Fokou G, Jean S, Hadjali A (2014) Endowing semantic query languages with advanced relaxation capabilities. In: Proceeding of the 21st international symposium on methodologies for intelligent systems (ISMIS 2014), Roskilde, Denmark, pp 512–517 Fokou G, Jean S, Hadjali A (2014) Endowing semantic query languages with advanced relaxation capabilities. In: Proceeding of the 21st international symposium on methodologies for intelligent systems (ISMIS 2014), Roskilde, Denmark, pp 512–517
16.
Zurück zum Zitat Poulovassilis A, Wood PT (2010) Combining Approximation and relaxation in semantic web path queries. In: Proceedings of the 9th international semantic web conference (ISWC’10), pp 631–646 Poulovassilis A, Wood PT (2010) Combining Approximation and relaxation in semantic web path queries. In: Proceedings of the 9th international semantic web conference (ISWC’10), pp 631–646
17.
Zurück zum Zitat Calí A, Frosini R, Poulovassilis A, Wood P (2014) Flexible querying for SPARQL. In: Proceedings of the 13th international conference on ontologies, databases, and applications of semantics (ODBASE’14), pp 473–490 Calí A, Frosini R, Poulovassilis A, Wood P (2014) Flexible querying for SPARQL. In: Proceedings of the 13th international conference on ontologies, databases, and applications of semantics (ODBASE’14), pp 473–490
18.
Zurück zum Zitat Hogan A, Mellotte M, Powell G, Stampouli D (2012) Towards fuzzy query-relaxation for RDF. In: Proceeding of the 9th extended semantic web conference (ESWC’12), pp 687–702 Hogan A, Mellotte M, Powell G, Stampouli D (2012) Towards fuzzy query-relaxation for RDF. In: Proceeding of the 9th extended semantic web conference (ESWC’12), pp 687–702
19.
Zurück zum Zitat Elbassuoni S, Ramanath M, Weikum G (2011) Query relaxation for entity-relationship search. In: Proceeding of the 8th extended semantic web conference (ESWC’11), pp 62–76 Elbassuoni S, Ramanath M, Weikum G (2011) Query relaxation for entity-relationship search. In: Proceeding of the 8th extended semantic web conference (ESWC’11), pp 62–76
20.
Zurück zum Zitat Dolog P, Stuckenschmidt H, Wache H, Diederich J (2009) Relaxing RDF queries based on user and domain preferences. J Intell Inf Syst 33(3):239–260CrossRef Dolog P, Stuckenschmidt H, Wache H, Diederich J (2009) Relaxing RDF queries based on user and domain preferences. J Intell Inf Syst 33(3):239–260CrossRef
21.
Zurück zum Zitat Godfrey P (1997) Minimization in cooperative response to failing database queries. Int J Coop Inf Syst 6(2):95–149MathSciNetCrossRef Godfrey P (1997) Minimization in cooperative response to failing database queries. Int J Coop Inf Syst 6(2):95–149MathSciNetCrossRef
22.
Zurück zum Zitat Jannach D (2009) Fast computation of query relaxations for knowledge-based recommenders. AI Commun 22(4):235–248MathSciNetMATH Jannach D (2009) Fast computation of query relaxations for knowledge-based recommenders. AI Commun 22(4):235–248MathSciNetMATH
23.
Zurück zum Zitat Pérez J, Arenas M, Gutierrez C (2009) Semantics and complexity of SPARQL. ACM Trans Database Syst 34(3):16:1–16:45CrossRef Pérez J, Arenas M, Gutierrez C (2009) Semantics and complexity of SPARQL. ACM Trans Database Syst 34(3):16:1–16:45CrossRef
24.
Zurück zum Zitat Fokou G, Jean S, Hadjali A, Baron M (2015) Cooperative techniques for SPARQL query relaxation in RDF databases. In: Proceeding of the 12th extended semantic web conference (ESWC 2015), pp 237–252 Fokou G, Jean S, Hadjali A, Baron M (2015) Cooperative techniques for SPARQL query relaxation in RDF databases. In: Proceeding of the 12th extended semantic web conference (ESWC 2015), pp 237–252
25.
Zurück zum Zitat Sakr S, Al-Naymat G (2009) Relational processing of RDF queries: a survey. SIGMOD Rec 38(4):23–28CrossRef Sakr S, Al-Naymat G (2009) Relational processing of RDF queries: a survey. SIGMOD Rec 38(4):23–28CrossRef
26.
Zurück zum Zitat Galindo-Legaria CA (1992) Algebraic optimization of outerjoin queries. PhD thesis, Harvard University Galindo-Legaria CA (1992) Algebraic optimization of outerjoin queries. PhD thesis, Harvard University
27.
Zurück zum Zitat Gallego MA, Fernández JD, Martínez-Prieto MA, de la Fuente P (2011) An empirical study of real-world SPARQL queries. In: Proceedings of the USEWOD workshop co-located with WWW’11 Gallego MA, Fernández JD, Martínez-Prieto MA, de la Fuente P (2011) An empirical study of real-world SPARQL queries. In: Proceedings of the USEWOD workshop co-located with WWW’11
29.
Zurück zum Zitat Hose K, Vlachou A (2012) A survey of skyline processing in highly distributed environments. VLDB J 21(3):359–384CrossRef Hose K, Vlachou A (2012) A survey of skyline processing in highly distributed environments. VLDB J 21(3):359–384CrossRef
30.
Zurück zum Zitat Chambi S, Lemire D, Kaser O, Godin R (2014) Better bitmap performance with Roaring bitmaps. CoRR abs/1402.6407 Chambi S, Lemire D, Kaser O, Godin R (2014) Better bitmap performance with Roaring bitmaps. CoRR abs/1402.6407
31.
Zurück zum Zitat Gombos G, Kiss A (2014) SPARQL query writing with recommendations based on datasets. In: Yamamoto S (ed) Human interface and the management of information. Information and knowledge design and evaluation. Springer International Publishing, Switzerland, pp 310–319 Gombos G, Kiss A (2014) SPARQL query writing with recommendations based on datasets. In: Yamamoto S (ed) Human interface and the management of information. Information and knowledge design and evaluation. Springer International Publishing, Switzerland, pp 310–319
32.
Zurück zum Zitat Lehmann J, Bühmann L (2011) AutoSPARQL: let users query your knowledge base. In: Proceeding of the 8th Extended Semantic Web Conference (ESWC’11), pp 63–79 Lehmann J, Bühmann L (2011) AutoSPARQL: let users query your knowledge base. In: Proceeding of the 8th Extended Semantic Web Conference (ESWC’11), pp 63–79
33.
Zurück zum Zitat Campinas S (2014) Live SPARQL auto-completion. In: Proceedings of the 13th international semantic web conference (ISWC’14 Posters & Demos), pp 477–480 Campinas S (2014) Live SPARQL auto-completion. In: Proceedings of the 13th international semantic web conference (ISWC’14 Posters & Demos), pp 477–480
34.
Zurück zum Zitat Möller K, Ambrus O, Josan L, Handschuh S (2008) A visual interface for building SPARQL queries in Konduit. In: Proceedings of the 7th international semantic web conference (ISWC’08 Posters & Demos) Möller K, Ambrus O, Josan L, Handschuh S (2008) A visual interface for building SPARQL queries in Konduit. In: Proceedings of the 7th international semantic web conference (ISWC’08 Posters & Demos)
35.
Zurück zum Zitat Clark L (2010) SPARQL views: a visual SPARQL query builder for Drupal. In: Proceedings of the 9th international semantic web conference (ISWC’10 Posters & Demos) Clark L (2010) SPARQL views: a visual SPARQL query builder for Drupal. In: Proceedings of the 9th international semantic web conference (ISWC’10 Posters & Demos)
36.
Zurück zum Zitat Bosc P, Hadjali A, Pivert O (2009) Incremental controlled relaxation of failing flexible queries. J Intell Inf Syst 33(3):261–283 Bosc P, Hadjali A, Pivert O (2009) Incremental controlled relaxation of failing flexible queries. J Intell Inf Syst 33(3):261–283
37.
Zurück zum Zitat Pivert O, Smits G, Hadjali A, Jaudoin H (2011) Efficient detection of minimal failing subqueries in a fuzzy querying context. In: Proceedings of the 15th East-European conference on advances in databases and information systems (ADBIS’11), pp 243–256 Pivert O, Smits G, Hadjali A, Jaudoin H (2011) Efficient detection of minimal failing subqueries in a fuzzy querying context. In: Proceedings of the 15th East-European conference on advances in databases and information systems (ADBIS’11), pp 243–256
38.
Zurück zum Zitat Pivert O, Smits G (2015) How to efficiently diagnose and repair fuzzy database queries that fail. In: Fifty years of fuzzy logic and its applications, studies in fuzziness and soft computing, pp 499–517 Pivert O, Smits G (2015) How to efficiently diagnose and repair fuzzy database queries that fail. In: Fifty years of fuzzy logic and its applications, studies in fuzziness and soft computing, pp 499–517
39.
Zurück zum Zitat McSherry D (2004) Incremental relaxation of unsuccessful queries. In: Advances in case-based reasoning, volume 3155, pp 131–148 McSherry D (2004) Incremental relaxation of unsuccessful queries. In: Advances in case-based reasoning, volume 3155, pp 131–148
40.
Zurück zum Zitat Bidoit N, Herschel M, Tzompanaki K (2014) Query-based why-not provenance with NedExplain. In: Proceedings of the 17th international conference on extending database technology (EDBT 2014), pp 145–156 Bidoit N, Herschel M, Tzompanaki K (2014) Query-based why-not provenance with NedExplain. In: Proceedings of the 17th international conference on extending database technology (EDBT 2014), pp 145–156
Metadaten
Titel
Handling failing RDF queries: from diagnosis to relaxation
verfasst von
Géraud Fokou
Stéphane Jean
Allel Hadjali
Mickael Baron
Publikationsdatum
20.04.2016
Verlag
Springer London
Erschienen in
Knowledge and Information Systems / Ausgabe 1/2017
Print ISSN: 0219-1377
Elektronische ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-016-0941-0

Weitere Artikel der Ausgabe 1/2017

Knowledge and Information Systems 1/2017 Zur Ausgabe