skip to main content
research-article

Taming verification hardness: an efficient algorithm for testing subgraph isomorphism

Authors Info & Claims
Published:01 August 2008Publication History
Skip Abstract Section

Abstract

Graphs are widely used to model complicated data semantics in many applications. In this paper, we aim to develop efficient techniques to retrieve graphs, containing a given query graph, from a large set of graphs. Considering the problem of testing subgraph isomorphism is generally NP-hard, most of the existing techniques are based on the framework of filtering-and-verification to reduce the precise computation costs; consequently various novel feature-based indexes have been developed. While the existing techniques work well for small query graphs, the verification phase becomes a bottleneck when the query graph size increases. Motivated by this, in the paper we firstly propose a novel and efficient algorithm for testing subgraph isomorphism, QuickSI. Secondly, we develop a new feature-based index technique to accommodate QuickSI in the filtering phase. Our extensive experiments on real and synthetic data demonstrate the efficiency and scalability of the proposed techniques, which significantly improve the existing techniques.

References

  1. A. V. Aho, J. E. Hopcroft, and J. Ullman. Data Structures and Algorithms. 1983. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. V. Arvind and J. Köbler. Graph isomorphism is low for zpp(np) and other lowness results. In STACS, pages 431--442, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Berretti, A. D. Bimbo, and E. Vicario. Efficient matching and indexing of graph models in content-based retrieval. IEEE Trans. Pattern Anal. Mach. Intell., 23 (10):1089--1105, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Cheng, Y. Ke, W. Ng, and A. Lu. Fg-index: towards verification-free query processing on graph databases. In Proceedings of the ACM SIGMOD international conference on Management of data, pages 857--872, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Y. Chi, Y. Yang, and R. R. Muntz. Canonical forms for labelled trees and their applications in frequent subtree mining. Knowl. Inf. Syst., 8(2):203--234, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  6. T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein. Introduction to Algorithms. 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. R. Garey and D. S. Johnson. Computers and Intractability; A Guide to the Theory of NP-Completeness. New York, NY, USA, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Giugno and D. Shasha. Graphgrep: A fast and universal method for querying graphs. Proceedings of the International Conference on Pattern Recognition, 2:112--115 vol.2, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  9. H. He and A. K. Singh. Closure-tree: An index structure for graph queries. In Proceedings of the International Conference on Data Engineering, page 38, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. H. Jiang, H. Wang, P. S. Yu, and S. Zhou. Gstring: A novel approach for efficient search in graph databases. In Proceedings of the International Conference on Data Engineering, pages 566--575, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  11. A. H. Land and A. G. Doig. An automatic method of solving discrete programming problems. Econometrica, 28(3):497--520.Google ScholarGoogle ScholarCross RefCross Ref
  12. B. T. Messmer and H. Bunke. A decision tree approach to graph and subgraph isomorphism detection. Pattern Recognition, 32(12):1979--1998, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  13. J. R. Ullmann. An algorithm for subgraph isomorphism. J. ACM, 23(1):31--42, 1976. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. W. Williams, J. Huan, and W. Wang. Graph database indexing using structured graph decomposition. In Proceedings of the International Conference on Data Engineering, pages 976--985, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  15. X. Yan and J. Han. gspan: Graph-based substructure pattern mining. In Proceedings of the IEEE International Conference on Data Mining, page 721, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. X. Yan, P. S. Yu, and J. Han. Graph indexing: a frequent structure-based approach. In Proceedings of the ACM SIGMOD international conference on Management of data, pages 335--346, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. X. Yan, P. S. Yu, and J. Han. Substructure similarity search in graph databases. In Proceedings of the ACM SIGMOD international conference on Management of data, pages 766--777, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Zhang, M. Hu, and J. Yang. Treepi: A novel graph indexing method. In Proceedings of the International Conference on Data Engineering, pages 966--975, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  19. P. Zhao and J. X. Yu. Fast frequent free tree mining in graph databases. World Wide Web, 11(1):71--92, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. P. Zhao, J. X. Yu, and P. S. Yu. Graph indexing: tree + delta <= graph. In Proceedings of the International Conference on Very Large Data Bases, pages 938--949, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. L. Zou, L. Chen, J. X. Yu, and Y. Lu. A novel spectral coding in a large graph database. In Proceedings of the International Conference on Extending Database Technology, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Taming verification hardness: an efficient algorithm for testing subgraph isomorphism

                  Recommendations

                  Comments

                  Login options

                  Check if you have access through your login credentials or your institution to get full access on this article.

                  Sign in

                  Full Access

                  PDF Format

                  View or Download as a PDF file.

                  PDF

                  eReader

                  View online with eReader.

                  eReader