Skip to main content
Top
Published in: The VLDB Journal 3/2017

27-01-2017 | Regular Paper

AutoG: a visual query autocompletion framework for graph databases

Authors: Peipei Yi, Byron Choi, Sourav S. Bhowmick, Jianliang Xu

Published in: The VLDB Journal | Issue 3/2017

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Composing queries is evidently a tedious task. This is particularly true of graph queries as they are typically complex and prone to errors, compounded by the fact that graph schemas can be missing or too loose to be helpful for query formulation. Despite the great success of query formulation aids, in particular, automatic query completion, graph query autocompletion has received much less research attention. In this paper, we propose a novel framework for subgraph query autocompletion (called AutoG). Given an initial query q and a user’s preference as input, AutoG returns ranked query suggestions \(Q'\) as output. Users may choose a query from \(Q'\) and iteratively apply AutoG to compose their queries. The novelties of AutoG are as follows: First, we formalize query composition. Second, we propose to increment a query with the logical units called c-prime features that are (i) frequent subgraphs and (ii) constructed from smaller c-prime features in no more than c ways. Third, we propose algorithms to rank candidate suggestions. Fourth, we propose a novel index called feature Dag (FDag) to optimize the ranking. We study the query suggestion quality with simulations and real users and conduct an extensive performance evaluation. The results show that the query suggestions are useful (saved roughly 40% of users’ mouse clicks), and AutoG returns suggestions shortly under a large variety of parameter settings.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Footnotes
3
Ideally, the user interface may automatically show useful suggestions to users. The current gui (Fig. 1) provides an “Autocomplete” button for users to fetch the top-k suggestions to allow an explicit comparison of the experiences with and without suggestions for user tests.
 
4
Query composition refers to an intuitive step when users are composing their queries, which is obviously different from the compositions used in the functional programming literature.
 
5
We assume the non-feature parts of the queries are inputted by users. That is, they are not composed from AutoG.
 
6
In contrast, for keyword search (of strings), key phrases are simply composed by a union/concatenation of keywords.
 
8
Almost all query templates of PubChem on their user interface are also frequent subgraphs, returned by gSpan using its default parameters. This shows that the domain experts indeed set frequent subgraphs as query templates. Yet, we omit the advanced templates of PubChem after user’s click on the UI.
 
9
The automorphism of \(f_i\) is not used for pruning because \(f_i\) is embedded in q, and the automorphism of q is unknown offline.
 
10
The automorphism relation \(A_{\mathsf {cs}}\) of \(\mathsf {cs}\) can be retrieved from FDag.
 
11
Readers may find the full list of queries for investigating the suggestion quality from the project site https://​goo.​gl/​Xr9MRY. Further, a short video shows how users may interact with the AutoG prototype.
 
12
The questionnaire used in the tests can be found at http://​goo.​gl/​dFRdwj.
 
Literature
1.
go back to reference Abiteboul, S., Amsterdamer, Y., Milo, T., Senellart, P.: Auto-completion learning for xml. In: SIGMOD (2012) Abiteboul, S., Amsterdamer, Y., Milo, T., Senellart, P.: Auto-completion learning for xml. In: SIGMOD (2012)
2.
go back to reference Bast, H., Weber, I.: Type less, find more: fast autocompletion search with a succinct index. In: SIGIR (2006) Bast, H., Weber, I.: Type less, find more: fast autocompletion search with a succinct index. In: SIGIR (2006)
3.
go back to reference Bhowmick, S.S., Choi, B., Zhou, S.: VOGUE: towards a visual interaction-aware graph query processing framework. In: CIDR (2013) Bhowmick, S.S., Choi, B., Zhou, S.: VOGUE: towards a visual interaction-aware graph query processing framework. In: CIDR (2013)
4.
go back to reference Bhowmick, S.S., Chua, H.-E., Thian, B., Choi, B.: ViSual: An hci-inspired simulator for blending visual subgraph query construction and processing. In: ICDE (2015) Bhowmick, S.S., Chua, H.-E., Thian, B., Choi, B.: ViSual: An hci-inspired simulator for blending visual subgraph query construction and processing. In: ICDE (2015)
5.
go back to reference Bhowmick, S.S., Dyreson, C.E., Choi, B., Ang, M.-H.: Interruption-sensitive empty result feedback: Rethinking the visual query feedback paradigm for semistructured data. In: CIKM (2015) Bhowmick, S.S., Dyreson, C.E., Choi, B., Ang, M.-H.: Interruption-sensitive empty result feedback: Rethinking the visual query feedback paradigm for semistructured data. In: CIKM (2015)
6.
go back to reference Borodin, A., Lee, H.C., Ye, Y.: Max-sum diversification, monotone submodular functions and dynamic updates. In: PODS (2012) Borodin, A., Lee, H.C., Ye, Y.: Max-sum diversification, monotone submodular functions and dynamic updates. In: PODS (2012)
7.
go back to reference Braga, D., Campi, A., Ceri, S.: XQBE (XQuery By Example), A visual interface to the standard xml query language. In: TODS (2005) Braga, D., Campi, A., Ceri, S.: XQBE (XQuery By Example), A visual interface to the standard xml query language. In: TODS (2005)
8.
go back to reference Broder, A.Z.: On the resemblance and containment of documents. In: Compression and complexity of sequences (1997) Broder, A.Z.: On the resemblance and containment of documents. In: Compression and complexity of sequences (1997)
9.
go back to reference Bunke, H., Shearer, K.: A graph distance metric based on the maximal common subgraph. Pattern Recognit. Lett. 19(3), 255–259 (1998)CrossRefMATH Bunke, H., Shearer, K.: A graph distance metric based on the maximal common subgraph. Pattern Recognit. Lett. 19(3), 255–259 (1998)CrossRefMATH
10.
go back to reference Cheng, J., Ke, Y., Ng, W., Lu, A.: Fg-index: towards verification-free query processing on graph databases. In: SIGMOD (2007) Cheng, J., Ke, Y., Ng, W., Lu, A.: Fg-index: towards verification-free query processing on graph databases. In: SIGMOD (2007)
11.
go back to reference Comai, S., Damiani, E., Fraternali, P.: Computing graphical queries over xml data. In: TOIS (2001) Comai, S., Damiani, E., Fraternali, P.: Computing graphical queries over xml data. In: TOIS (2001)
12.
go back to reference Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A (sub)graph isomorphism algorithm for matching large graphs. In: PAMI (2004) Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A (sub)graph isomorphism algorithm for matching large graphs. In: PAMI (2004)
13.
go back to reference Fan, Z., Peng, Y., Choi, B., Xu, J., Bhowmick, S.S.: Towards efficient authenticated subgraph query service in outsourced graph databases. In: TSC (2014) Fan, Z., Peng, Y., Choi, B., Xu, J., Bhowmick, S.S.: Towards efficient authenticated subgraph query service in outsourced graph databases. In: TSC (2014)
14.
go back to reference Feng, J., Li, G.: Efficient fuzzy type-ahead search in xml data. In: TKDE, pp. 882–895 (2012) Feng, J., Li, G.: Efficient fuzzy type-ahead search in xml data. In: TKDE, pp. 882–895 (2012)
15.
go back to reference Gollapudi, S., Sharma, A.: An axiomatic approach for result diversification. In: WWW (2009) Gollapudi, S., Sharma, A.: An axiomatic approach for result diversification. In: WWW (2009)
16.
go back to reference Han, W.-S., Lee, J., Pham, M.-D., Yu, J.X.: iGraph: a framework for comparisons of disk-based graph indexing techniques. In: PVLDB, pp. 449–459 (2010) Han, W.-S., Lee, J., Pham, M.-D., Yu, J.X.: iGraph: a framework for comparisons of disk-based graph indexing techniques. In: PVLDB, pp. 449–459 (2010)
18.
go back to reference Hung, H.H., Bhowmick, S.S., Truong, B.Q., Choi, B., Zhou, S.: QUBLE: blending visual subgraph query formulation with query processing on large networks. In: SIGMOD, pp. 1097–1100 (2013) Hung, H.H., Bhowmick, S.S., Truong, B.Q., Choi, B., Zhou, S.: QUBLE: blending visual subgraph query formulation with query processing on large networks. In: SIGMOD, pp. 1097–1100 (2013)
19.
go back to reference Jayaram, N., Goyal, S., Li, C.: VIIQ: auto-suggestion enabled visual interface for interactive graph query formulation. In: PVLDB, pp. 1940–1951 (2015) Jayaram, N., Goyal, S., Li, C.: VIIQ: auto-suggestion enabled visual interface for interactive graph query formulation. In: PVLDB, pp. 1940–1951 (2015)
20.
go back to reference Jayaram, N., Gupta, M., Khan, A., Li, C., Yan, X., Elmasri, R.: GQBE: querying knowledge graphs by example entity tuples. In: ICDE (2014) Jayaram, N., Gupta, M., Khan, A., Li, C., Yan, X., Elmasri, R.: GQBE: querying knowledge graphs by example entity tuples. In: ICDE (2014)
21.
go back to reference Jin, C., Bhowmick, S.S., Xiao, X., Cheng, J., Choi, B.: GBLENDER: towards blending visual query formulation and query processing in graph databases. In: SIGMOD (2010) Jin, C., Bhowmick, S.S., Xiao, X., Cheng, J., Choi, B.: GBLENDER: towards blending visual query formulation and query processing in graph databases. In: SIGMOD (2010)
22.
go back to reference Kriege, N., Mutzel, P., Schäfer, T.: Practical sahn clustering for very large data sets and expensive distance metrics. J. Graph Algorithms Appl. 18, 577–602 (2014) Kriege, N., Mutzel, P., Schäfer, T.: Practical sahn clustering for very large data sets and expensive distance metrics. J. Graph Algorithms Appl. 18, 577–602 (2014)
23.
go back to reference Li, Y., Yu, C., Jagadish, H.V.: Enabling schema-free xquery with meaningful query focus. VLDB J. 17, 355–377 (2008) Li, Y., Yu, C., Jagadish, H.V.: Enabling schema-free xquery with meaningful query focus. VLDB J. 17, 355–377 (2008)
24.
go back to reference Lin, C., Lu, J., Ling, T.W., Cautis, B.: LotusX: a position-aware xml graphical search system with auto-completion. In: ICDE (2012) Lin, C., Lu, J., Ling, T.W., Cautis, B.: LotusX: a position-aware xml graphical search system with auto-completion. In: ICDE (2012)
25.
go back to reference Luks, E.M.: Isomorphism of graphs of bounded valence can be tested in polynomial time. J. Comput. Syst. Sci. 25, 42–65 (1982) Luks, E.M.: Isomorphism of graphs of bounded valence can be tested in polynomial time. J. Comput. Syst. Sci. 25, 42–65 (1982)
26.
go back to reference Marchionini, G.: Exploratory search: from finding to understanding. Commun. ACM 49, 41–46 (2006) Marchionini, G.: Exploratory search: from finding to understanding. Commun. ACM 49, 41–46 (2006)
27.
go back to reference McGregor, J.J.: Backtrack search algorithms and the maximal common subgraph problem. Softw. Pract. Exp. 12, 23–34 (1982) McGregor, J.J.: Backtrack search algorithms and the maximal common subgraph problem. Softw. Pract. Exp. 12, 23–34 (1982)
28.
go back to reference Mottin, D., Bonchi, F., Gullo, F.: Graph query reformulation with diversity. In: KDD, pp. 825–834 (2015) Mottin, D., Bonchi, F., Gullo, F.: Graph query reformulation with diversity. In: KDD, pp. 825–834 (2015)
29.
go back to reference Nandi, A., Jagadish, H.V.: Effective phrase prediction. In: VLDB, pp. 219–230 (2007) Nandi, A., Jagadish, H.V.: Effective phrase prediction. In: VLDB, pp. 219–230 (2007)
32.
go back to reference Pandey, S., Punera, K.: Unsupervised extraction of template structure in web search queries. In: WWW, pp. 409–418 (2012) Pandey, S., Punera, K.: Unsupervised extraction of template structure in web search queries. In: WWW, pp. 409–418 (2012)
33.
go back to reference Papakonstantinou, Y., Petropoulos, M., Vassalos, V.: QURSED: querying and reporting semistructured data. In: SIGMOD (2002) Papakonstantinou, Y., Petropoulos, M., Vassalos, V.: QURSED: querying and reporting semistructured data. In: SIGMOD (2002)
35.
go back to reference Shasha, D., Wang, J.T.-L., Giugno, R.: Algorithmics and applications of tree and graph searching. In: PODS (2002) Shasha, D., Wang, J.T.-L., Giugno, R.: Algorithmics and applications of tree and graph searching. In: PODS (2002)
36.
go back to reference Venero, M.L.F., Valiente, G.: A graph distance metric combining maximum common subgraph and minimum common supergraph. Pattern Recognit. Lett. 22, 753–758 (2001)CrossRefMATH Venero, M.L.F., Valiente, G.: A graph distance metric combining maximum common subgraph and minimum common supergraph. Pattern Recognit. Lett. 22, 753–758 (2001)CrossRefMATH
37.
go back to reference Vieira, M.R., Razente, H.L., Barioni, M.C.N., Hadjieleftheriou, M., Srivastava, D., Traina, C., Tsotras, V.J.: On query result diversification. In: ICDE (2011) Vieira, M.R., Razente, H.L., Barioni, M.C.N., Hadjieleftheriou, M., Srivastava, D., Traina, C., Tsotras, V.J.: On query result diversification. In: ICDE (2011)
38.
go back to reference Wallis, W.D., Shoubridge, P., Kraetzl, M., Ray, D.: Graph distances using graph union. Pattern Recognit. Lett. 22, 701–704 (2001)CrossRefMATH Wallis, W.D., Shoubridge, P., Kraetzl, M., Ray, D.: Graph distances using graph union. Pattern Recognit. Lett. 22, 701–704 (2001)CrossRefMATH
39.
go back to reference Xiao, C., Qin, J., Wang, W., Ishikawa, Y., Tsuda, K., Sadakane, K.: Efficient error-tolerant query autocompletion. In: PVLDB (2013) Xiao, C., Qin, J., Wang, W., Ishikawa, Y., Tsuda, K., Sadakane, K.: Efficient error-tolerant query autocompletion. In: PVLDB (2013)
40.
go back to reference Xie, X., Fan, Z., Choi, B., Yi, P., Bhowmick, S.S., Zhou, S.: PIGEON: Progress indicator for subgraph queries. In: ICDE (2015) Xie, X., Fan, Z., Choi, B., Yi, P., Bhowmick, S.S., Zhou, S.: PIGEON: Progress indicator for subgraph queries. In: ICDE (2015)
41.
go back to reference Yan, X., Han, J.: gSpan: Graph-based substructure pattern mining. In: ICDM, pp. 721–724, (2002) Yan, X., Han, J.: gSpan: Graph-based substructure pattern mining. In: ICDM, pp. 721–724, (2002)
42.
go back to reference Yan, X., Yu, P.S., Han, J.: Graph indexing: a frequent structure-based approach. In: SIGMOD (2004) Yan, X., Yu, P.S., Han, J.: Graph indexing: a frequent structure-based approach. In: SIGMOD (2004)
44.
go back to reference Yi, P., Choi, B., Bhowmick, S.S., Xu, J.: AutoG: a visual query autocompletion framework for graph databases [demo]. PVLDB 9, 1505–1508 (2016) Yi, P., Choi, B., Bhowmick, S.S., Xu, J.: AutoG: a visual query autocompletion framework for graph databases [demo]. PVLDB 9, 1505–1508 (2016)
45.
go back to reference Yuan, D., Mitra, P.: Lindex: a lattice-based index for graph databases. VLDB J. 22, 229–252 (2013)CrossRef Yuan, D., Mitra, P.: Lindex: a lattice-based index for graph databases. VLDB J. 22, 229–252 (2013)CrossRef
Metadata
Title
AutoG: a visual query autocompletion framework for graph databases
Authors
Peipei Yi
Byron Choi
Sourav S. Bhowmick
Jianliang Xu
Publication date
27-01-2017
Publisher
Springer Berlin Heidelberg
Published in
The VLDB Journal / Issue 3/2017
Print ISSN: 1066-8888
Electronic ISSN: 0949-877X
DOI
https://doi.org/10.1007/s00778-017-0454-9

Other articles of this Issue 3/2017

The VLDB Journal 3/2017 Go to the issue

Premium Partner