skip to main content
10.1145/1353343.1353369acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article
Free Access

A novel spectral coding in a large graph database

Authors Info & Claims
Published:25 March 2008Publication History

ABSTRACT

Retrieving related graphs containing a query graph from a large graph database is a key issue in many graph-based applications, such as drug discovery and structural pattern recognition. Because sub-graph isomorphism is a NP-complete problem [4], we have to employ a filter-and-verification framework to speed up the search efficiency, that is, using an effective and efficient pruning strategy to filter out the false positives (graphs that are not possible in the results) as many as possible first, then validating the remaining candidates by subgraph isomorphism checking. In this paper, we propose a novel filtering method, a spectral encoding method, i.e. GCoding. Specifically, we assign a signature to each vertex based on its local structures. Then, we generate a spectral graph code by combining all vertex signatures in a graph. Based on spectral graph codes, we derive a necessary condition for sub-graph isomorphism. Then we propose two pruning rules for sub-graph search problem, and prove that they satisfy the no-false-negative requirement (no dismissal in answers). Since graph codes are in numerical space, we take this advantage and conduct efficient filtering over graph codes. Extensive experiments show that GCoding outperforms existing counterpart methods.

References

  1. D. Cai, Z. Shao, X. He, X. Yan, and J. Han. Community mining from multi-relational networks. In PKDD, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  2. J. Cheng, Y. Ke, W. Ng, and A. Lu. fg-index: Towards verification-free query processing on graph databases. In SIGMOD, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. H. D. W. Williams and W. Wang. Graph database indexing using structured graph decomposition. In ICDE, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  4. S. Fortin. The graph isomorphism problem. Department of Computing Science, University of Alberta, 1996.Google ScholarGoogle Scholar
  5. H. H. Goldstine, F. J. Murray, and J. von Neumann. The jacobi method for real symmetric matrices. J. ACM, 6(1):59--96, 1959. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. P. Y. H. Jiang, H. Wang and S. Zhou. Gstring: A novel approach for efficient search in graph databases. In ICDE, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  7. H. He and A. K. Singh. Closure-tree: An index structure for graph queries. In ICDE, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Inokuchi, T. Washio, and H. Motoda. An apriori-based algorithm for mining frequent substructures from graph data. In PKDD, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. C. A. James, D. Weininger, and J. Delany. Daylight theory manual daylisght version 4.82. Daylight Chemical Information Systems, Inc., 2003.Google ScholarGoogle Scholar
  10. Y. Ke, J. Cheng, and W. Ng. Correlation search in graph databases. In SIGKDD, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. H. Kitagawa and Y. Ishikawa. False drop analysis of set retrieval with signature files. Inf. Syst., 27(2), 2002.Google ScholarGoogle Scholar
  12. M. Kuramochi and G. Karypis. Frequent subgraph discovery. In ICDM, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. E. G. M. Petrakis and C. Faloutsos. Similarity searching in medical image databases. IEEE Transactions on Knowledge and Data Enginnering, 9(3), 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. Shasha, J. T.-L. Wang, and R. Giugno. Algorithmics and applications of tree and graph searching. In PODS, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Shokoufandeh, S. J. Dickinson, K. Siddiqi, and S. W. Zucker. Indexing using a spectral encoding of topological structure. In CVPR, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  16. E. Tousidou, P. Bozanis, and Y. Manolopoulos. Signature-based structures for objects with set-valued attributes. Inf. Syst., 27(2), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. R. Ullmann. An algorithm for subgraph isomorphism. J. ACM, 23(1), 1976. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. D. B. West. Introduction to Graph Theory (2nd Edition). Prentice Hall, 2000.Google ScholarGoogle Scholar
  19. P. Willett. Chemical similarity searching. J. Chem. Inf. Comput. Sci, 38(6), 1998.Google ScholarGoogle ScholarCross RefCross Ref
  20. X. Yan and J. Han. gspan: Graph-based substructure pattern mining. In ICDM, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. X. Yan and J. Han. gspan: Graph-based substructure pattern mining. Proc. of Int. Conf. on Data Mining, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. X. Yan, P. S. Yu, and J. Han. Graph indexing: A frequent structure-based approach. In SIGMOD, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. N. Zhang, M. T. Özsu, I. F. Ilyas, and A. Aboulnaga. Fix: Feature-based indexing technique for xml documents. In VLDB, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. S. Zhang, M. Hu, and J. Yang. Treepi: A novel graph indexing method. In ICDE, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  25. P. Zhao, J. X. Yu, and P. S. Yu. Graph indexing: Tree + delta >= graph. In VLDB, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A novel spectral coding in a large graph database

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        EDBT '08: Proceedings of the 11th international conference on Extending database technology: Advances in database technology
        March 2008
        762 pages
        ISBN:9781595939265
        DOI:10.1145/1353343

        Copyright © 2008 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 25 March 2008

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate7of10submissions,70%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader