skip to main content
10.1145/956750.956784acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

CloseGraph: mining closed frequent graph patterns

Authors Info & Claims
Published:24 August 2003Publication History

ABSTRACT

Recent research on pattern discovery has progressed form mining frequent itemsets and sequences to mining structured patterns including trees, lattices, and graphs. As a general data structure, graph can model complicated relations among data with wide applications in bioinformatics, Web exploration, and etc. However, mining large graph patterns in challenging due to the presence of an exponential number of frequent subgraphs. Instead of mining all the subgraphs, we propose to mine closed frequent graph patterns. A graph g is closed in a database if there exists no proper supergraph of g that has the same support as g. A closed graph pattern mining algorithm, CloseGraph, is developed by exploring several interesting pruning methods. Our performance study shows that CloseGraph not only dramatically reduces unnecessary subgraphs to be generated but also substantially increases the efficiency of mining, especially in the presence of large graph patterns.

References

  1. R. Agrawal and R. Srikant. Fast algorithms for mining association rules. In VLDB'94.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. T. Asai, K. Abe, S. Kawasoe, H. Arimura, H. Satamoto, and S. Arikawa. Efficient substructure discovery from large semi-structured data. In SDM'02.]]Google ScholarGoogle Scholar
  3. C. Borgelt and M. R. Berthold. Mining molecular fragments: Finging relevant substructures of molecules. In ICDM'02.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Burdick, M. Calimlim, and J. Gehrke. MAFIA: A maximal frequent itemset algorithm for transactional databases. In ICDE'01.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. T. Cormen, C. Leiserson, R. Rivest, and C. Stein. Introduction to Algorithms, 2nd ed. The MIT Press, Cambridge, MA, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. L. Dehaspe, H. Toivonen, and R. King. Finding frequent substructures in chemical compounds. In KDD'98.]]Google ScholarGoogle Scholar
  7. I. Eidhammer, I. Jonassen, and W. R. Taylor Structure comparison and structure patterns. In J. of Comp. Bio. 7, 2000.]]Google ScholarGoogle Scholar
  8. J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. In SIGMOD'00.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. L. B. Holder, D. J. Cook, and S. Djoko. Substructure discovery in the subdue system. In KDD'94.]]Google ScholarGoogle Scholar
  10. A. Inokuchi, T. Washio, and H. Motoda. An apriori-based algorithm for mining frequent substructures from graph data. In PKDD'00.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. Kuramochi and G. Karypis. Frequent subgraph discovery. In ICDM'01.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Kuramochi and G. Karypis. Discovering frequent geometric subgraphs. In ICDM'02.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. N. Pasquier, Y. Bastide, R. Taouil, and L. Lakhal. Discovering frequent closed itemsets for association rules. In ICDT'99.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Pei, J. Han, and R. Mao. CLOSET: An efficient algorithm for mining frequent closed itemsets. In DMKD'00.]]Google ScholarGoogle Scholar
  15. N. Vanetik, E. Gudes, and S. E. Shimony. Computing frequent graph patterns from semistructured data. In ICDM'02.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. X. Yan and J. Han. gSpan: Graph-based substructure pattern mining. UIUC-CS Tech. Report: R-2002-2296, (a 4-page short version in ICDM'02).]]Google ScholarGoogle Scholar
  17. X. Yan, J. Han, and R. Afshar. CloSpan: Mining closed sequential patterns in large datasets. In SDM'03.]]Google ScholarGoogle Scholar
  18. M. J. Zaki and C. J. Hsiao. CHARM: An efficient algorithm for closed itemset mining. In SDM'02.]]Google ScholarGoogle Scholar
  19. M. J. Zaki. Efficiently mining frequent trees in a forest. In KDD'02.]] Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. CloseGraph: mining closed frequent graph patterns

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '03: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
      August 2003
      736 pages
      ISBN:1581137370
      DOI:10.1145/956750

      Copyright © 2003 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 August 2003

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      KDD '03 Paper Acceptance Rate46of298submissions,15%Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader