Skip to main content

2018 | OriginalPaper | Buchkapitel

Graph Pattern Matching Preserving Label-Repetition Constraints

verfasst von : Houari Mahfoud

Erschienen in: Model and Data Engineering

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Graph pattern matching is a routine process for a wide variety of applications such as social network analysis. It is typically defined in terms of subgraph isomorphism which is NP-Complete. To lower its complexity, many extensions of graph simulation have been proposed which focus on some topological constraints of pattern graphs that can be preserved in polynomial-time over data graphs. We discuss the satisfaction of a new topological constraint, called Label-Repetition constraint. To the best of our knowledge, existing polynomial approaches fail to preserve this constraint, and moreover, one can adopt only subgraph isomorphism for this end which is cost-prohibitive. We present first a necessary and sufficient condition that a data subgraph must satisfy to preserve the Label-Repetition constraints of the pattern graph. Furthermore, we define matching based on a notion of triple simulation, an extension of graph simulation by considering the new topological constraint. We show that with this extension, graph pattern matching can be performed in polynomial-time, by providing such an algorithm. We extend our solution to deal with edges that have counting quantifiers of the form “\(\ge p\)”.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Proofs and other details are available in the full version [16].
 
2
This match result can be defined similarly to graph (dual) simulation.
 
3
This procedure finds the maximum matching over \(BG_1\) (resp. \(BG_2\)), using the algorithm of Hopcroft et al. [13], and then checks whether the size of this maximum matching is equals to \(|X_1|\) (resp. \(|X_2|\)).
 
4
Given a graph G(VE), \(|G|=|V|+|E|\).
 
5
Not given here since its definition is trivial.
 
6
This result is a combination of our Theorem 2 and Theorem 4.1 of Ma et al. [14].
 
7
See [11] for the semantic.
 
Literatur
1.
Zurück zum Zitat Arias, M., Fernández, J.D., Martínez-Prieto, M.A., de la Fuente, P.: An empirical study of real-world SPARQL queries. CoRR (2011) Arias, M., Fernández, J.D., Martínez-Prieto, M.A., de la Fuente, P.: An empirical study of real-world SPARQL queries. CoRR (2011)
2.
Zurück zum Zitat Cho, J., Shivakumar, N., Garcia-Molina, H.: Finding replicated web collections. In: Proceedings of SIGMOD (2000) Cho, J., Shivakumar, N., Garcia-Molina, H.: Finding replicated web collections. In: Proceedings of SIGMOD (2000)
3.
Zurück zum Zitat Cong, G., Fan, W., Kementsietsidis, A.: Distributed query evaluation with performance guarantees. In: Proceedings of SIGMOD (2007) Cong, G., Fan, W., Kementsietsidis, A.: Distributed query evaluation with performance guarantees. In: Proceedings of SIGMOD (2007)
4.
Zurück zum Zitat Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A (Sub)Graph isomorphism algorithm for matching large graphs. IEEE Trans. Pattern Anal. Mach. Intell. 26, 1367–1372 (2004)CrossRef Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A (Sub)Graph isomorphism algorithm for matching large graphs. IEEE Trans. Pattern Anal. Mach. Intell. 26, 1367–1372 (2004)CrossRef
5.
Zurück zum Zitat Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms, 3rd edn. MIT Press, Cambridge (2009)MATH Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms, 3rd edn. MIT Press, Cambridge (2009)MATH
6.
Zurück zum Zitat Fan, W.: Graph pattern matching revised for social network analysis. In: Proceedings of ICDT (2012) Fan, W.: Graph pattern matching revised for social network analysis. In: Proceedings of ICDT (2012)
7.
Zurück zum Zitat Fan, W., Li, J., Ma, S., Tang, N., Wu, Y.: Adding regular expressions to graph reachability and pattern queries. In: Proceedings of ICDE (2011) Fan, W., Li, J., Ma, S., Tang, N., Wu, Y.: Adding regular expressions to graph reachability and pattern queries. In: Proceedings of ICDE (2011)
8.
Zurück zum Zitat Fan, W., Li, J., Ma, S., Tang, N., Wu, Y.: Graph pattern matching: from intractable to polynomial time. Proc. VLDB Endow. 3, 264–275 (2010)CrossRef Fan, W., Li, J., Ma, S., Tang, N., Wu, Y.: Graph pattern matching: from intractable to polynomial time. Proc. VLDB Endow. 3, 264–275 (2010)CrossRef
9.
Zurück zum Zitat Fan, W., Li, J., Wang, X., Wu, Y.: Query preserving graph compression. In: Proceedings of SIGMOD (2012) Fan, W., Li, J., Wang, X., Wu, Y.: Query preserving graph compression. In: Proceedings of SIGMOD (2012)
10.
Zurück zum Zitat Fan, W., Wang, X., Wu, Y.: Incremental graph pattern matching. ACM Trans. Database Syst. 38 (2013) Fan, W., Wang, X., Wu, Y.: Incremental graph pattern matching. ACM Trans. Database Syst. 38 (2013)
11.
Zurück zum Zitat Fan, W., Wu, Y., Xu, J.: Adding counting quantifiers to graph patterns. In: Proceedings of SIGMOD (2016) Fan, W., Wu, Y., Xu, J.: Adding counting quantifiers to graph patterns. In: Proceedings of SIGMOD (2016)
12.
Zurück zum Zitat Grujic, I., Dinic, S.B., Stoimenov, L.: Collecting and analyzing data from e-government facebook pages (2014) Grujic, I., Dinic, S.B., Stoimenov, L.: Collecting and analyzing data from e-government facebook pages (2014)
13.
Zurück zum Zitat Hopcroft, J.E., Karp, R.M.: An n5/2 algorithm for maximum matchings in bipartite graphs. SIAM J. Comput. 2, 225–231 (1973)MathSciNetCrossRef Hopcroft, J.E., Karp, R.M.: An n5/2 algorithm for maximum matchings in bipartite graphs. SIAM J. Comput. 2, 225–231 (1973)MathSciNetCrossRef
14.
Zurück zum Zitat Ma, S., Cao, Y., Fan, W., Huai, J., Wo, T.: Strong simulation: capturing topology in graph pattern matching. ACM Trans. Database Syst. 39 (2014) Ma, S., Cao, Y., Fan, W., Huai, J., Wo, T.: Strong simulation: capturing topology in graph pattern matching. ACM Trans. Database Syst. 39 (2014)
15.
Zurück zum Zitat Maccioni, A., Abadi, D.J.: Scalable pattern matching over compressed graphs via dedensification. In: Proceedings of SIGKDD, pp. 1755–1764 (2016) Maccioni, A., Abadi, D.J.: Scalable pattern matching over compressed graphs via dedensification. In: Proceedings of SIGKDD, pp. 1755–1764 (2016)
16.
Zurück zum Zitat Mahfoud, H.: Graph pattern matching preserving label-repetition constraints. CoRR (2018) Mahfoud, H.: Graph pattern matching preserving label-repetition constraints. CoRR (2018)
17.
Zurück zum Zitat Milner, R.: Communication and Concurrency (1989) Milner, R.: Communication and Concurrency (1989)
18.
Zurück zum Zitat Shemshadi, A., Sheng, Q.Z., Qin, Y.: Efficient pattern matching for graphs with multi-labeled nodes. Knowl. Based Syst. 109, 256–265 (2016)CrossRef Shemshadi, A., Sheng, Q.Z., Qin, Y.: Efficient pattern matching for graphs with multi-labeled nodes. Knowl. Based Syst. 109, 256–265 (2016)CrossRef
19.
Zurück zum Zitat Tung, L.-D., Nguyen-Van, Q., Hu, Z.: Efficient query evaluation on distributed graphs with hadoop environment. In: Proceedings of SoICT (2013) Tung, L.-D., Nguyen-Van, Q., Hu, Z.: Efficient query evaluation on distributed graphs with hadoop environment. In: Proceedings of SoICT (2013)
Metadaten
Titel
Graph Pattern Matching Preserving Label-Repetition Constraints
verfasst von
Houari Mahfoud
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-00856-7_17