research-article

Taming verification hardness: an efficient algorithm for testing subgraph isomorphism

Authors:
Haichuan Shang

University of New South Wales & NICTA, Sydney, Australia

University of New South Wales & NICTA, Sydney, Australia
View Profile

,
Ying Zhang

University of New South Wales & NICTA, Sydney, Australia

University of New South Wales & NICTA, Sydney, Australia
View Profile

,
Xuemin Lin

University of New South Wales & NICTA, Sydney, Australia

University of New South Wales & NICTA, Sydney, Australia
View Profile

,
Jeffrey Xu Yu

Chinese university of Hong Kong, Hong Kong, China

Chinese university of Hong Kong, Hong Kong, China
View Profile

Proceedings of the VLDB Endowment Volume 1 Issue 1pp 364–375https://doi.org/10.14778/1453856.1453899

Published:01 August 2008Publication History

Proceedings of the VLDB Endowment

Abstract

Graphs are widely used to model complicated data semantics in many applications. In this paper, we aim to develop efficient techniques to retrieve graphs, containing a given query graph, from a large set of graphs. Considering the problem of testing subgraph isomorphism is generally NP-hard, most of the existing techniques are based on the framework of filtering-and-verification to reduce the precise computation costs; consequently various novel feature-based indexes have been developed. While the existing techniques work well for small query graphs, the verification phase becomes a bottleneck when the query graph size increases. Motivated by this, in the paper we firstly propose a novel and efficient algorithm for testing subgraph isomorphism, QuickSI. Secondly, we develop a new feature-based index technique to accommodate QuickSI in the filtering phase. Our extensive experiments on real and synthetic data demonstrate the efficiency and scalability of the proposed techniques, which significantly improve the existing techniques.

References

A. V. Aho, J. E. Hopcroft, and J. Ullman. Data Structures and Algorithms. 1983. Google ScholarDigital Library
V. Arvind and J. Köbler. Graph isomorphism is low for zpp(np) and other lowness results. In STACS, pages 431--442, 2000. Google ScholarDigital Library
S. Berretti, A. D. Bimbo, and E. Vicario. Efficient matching and indexing of graph models in content-based retrieval. IEEE Trans. Pattern Anal. Mach. Intell., 23 (10):1089--1105, 2001. Google ScholarDigital Library
J. Cheng, Y. Ke, W. Ng, and A. Lu. Fg-index: towards verification-free query processing on graph databases. In Proceedings of the ACM SIGMOD international conference on Management of data, pages 857--872, 2007. Google ScholarDigital Library
Y. Chi, Y. Yang, and R. R. Muntz. Canonical forms for labelled trees and their applications in frequent subtree mining. Knowl. Inf. Syst., 8(2):203--234, 2005.Google ScholarCross Ref
T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein. Introduction to Algorithms. 2001. Google ScholarDigital Library
M. R. Garey and D. S. Johnson. Computers and Intractability; A Guide to the Theory of NP-Completeness. New York, NY, USA, 1990. Google ScholarDigital Library
R. Giugno and D. Shasha. Graphgrep: A fast and universal method for querying graphs. Proceedings of the International Conference on Pattern Recognition, 2:112--115 vol.2, 2002.Google ScholarCross Ref
H. He and A. K. Singh. Closure-tree: An index structure for graph queries. In Proceedings of the International Conference on Data Engineering, page 38, 2006. Google ScholarDigital Library
H. Jiang, H. Wang, P. S. Yu, and S. Zhou. Gstring: A novel approach for efficient search in graph databases. In Proceedings of the International Conference on Data Engineering, pages 566--575, 2007.Google ScholarCross Ref
A. H. Land and A. G. Doig. An automatic method of solving discrete programming problems. Econometrica, 28(3):497--520.Google ScholarCross Ref
B. T. Messmer and H. Bunke. A decision tree approach to graph and subgraph isomorphism detection. Pattern Recognition, 32(12):1979--1998, 1999.Google ScholarCross Ref
J. R. Ullmann. An algorithm for subgraph isomorphism. J. ACM, 23(1):31--42, 1976. Google ScholarDigital Library
D. W. Williams, J. Huan, and W. Wang. Graph database indexing using structured graph decomposition. In Proceedings of the International Conference on Data Engineering, pages 976--985, 2007.Google ScholarCross Ref
X. Yan and J. Han. gspan: Graph-based substructure pattern mining. In Proceedings of the IEEE International Conference on Data Mining, page 721, 2002. Google ScholarDigital Library
X. Yan, P. S. Yu, and J. Han. Graph indexing: a frequent structure-based approach. In Proceedings of the ACM SIGMOD international conference on Management of data, pages 335--346, 2004. Google ScholarDigital Library
X. Yan, P. S. Yu, and J. Han. Substructure similarity search in graph databases. In Proceedings of the ACM SIGMOD international conference on Management of data, pages 766--777, 2005. Google ScholarDigital Library
S. Zhang, M. Hu, and J. Yang. Treepi: A novel graph indexing method. In Proceedings of the International Conference on Data Engineering, pages 966--975, 2007.Google ScholarCross Ref
P. Zhao and J. X. Yu. Fast frequent free tree mining in graph databases. World Wide Web, 11(1):71--92, 2008. Google ScholarDigital Library
P. Zhao, J. X. Yu, and P. S. Yu. Graph indexing: tree + delta <= graph. In Proceedings of the International Conference on Very Large Data Bases, pages 938--949, 2007. Google ScholarDigital Library
L. Zou, L. Chen, J. X. Yu, and Y. Lu. A novel spectral coding in a large graph database. In Proceedings of the International Conference on Extending Database Technology, 2008. Google ScholarDigital Library

Index Terms

Taming verification hardness: an efficient algorithm for testing subgraph isomorphism
1. Information systems
  1. Information retrieval
2. Mathematics of computing
  1. Discrete mathematics
    1. Graph theory
      1. Graph algorithms

Recommendations

NP-hardness of multiple bondage in graphs

Let p 2 be a positive integer. The p -bondage number of a graph G is the minimum number of edges whose removal from G results in a graph with larger p -domination number. The p -total bondage number of a graph G with no isolated vertex is the minimum ...
Read More
Cosecure Domination: Hardness Results and Algorithms
Combinatorial Algorithms
Abstract
For a simple graph $G = (V, E)$ without any isolated vertex, a cosecure dominating set S of G satisfies two properties, (i) S is a dominating set of G, (ii) for every vertex $v \in S$ , there exists a vertex $u \in V \ S$ such that $u v \in E$ and $(S \ {v}) \cup {u}$ is a ... $_{}$ $_{}$
Read More
Hardness transitions and uniqueness of acyclic colouring
Abstract
For k ∈ N, a k-acyclic colouring of a graph G is a function f : V ( G ) → { 0 , 1 , … , k − 1 } such that (i) f ( u ) ≠ f ( v ) for every edge u v of G, and (ii) there is no cycle in G bicoloured by f. For k ∈ N, the problem k-Acyclic ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
Proceedings of the VLDB Endowment Volume 1, Issue 1
August 2008
1216 pages
ISSN:2150-8097
Editors:
Peter Buneman,
Beng Chin Ooi,
Kenneth Ross,
Gerald Weber
Issue’s Table of Contents
Sponsors
In-Cooperation
Publisher
VLDB Endowment
Publication History
- Published: 1 August 2008
Published in pvldb Volume 1, Issue 1
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 136
  Total Citations
  View Citations
- 1,006
  Total Downloads
- Downloads (Last 12 months)81
- Downloads (Last 6 weeks)13
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Taming verification hardness: an efficient algorithm for testing subgraph isomorphism

Proceedings of the VLDB Endowment

Abstract

References

Cited By

Index Terms

Recommendations

NP-hardness of multiple bondage in graphs

Cosecure Domination: Hardness Results and Algorithms

Hardness transitions and uniqueness of acyclic colouring

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Taming verification hardness: an efficient algorithm for testing subgraph isomorphism

Proceedings of the VLDB Endowment

Abstract

References

Cited By

Index Terms

Recommendations

NP-hardness of multiple bondage in graphs

Cosecure Domination: Hardness Results and Algorithms

Hardness transitions and uniqueness of acyclic colouring

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media