skip to main content
10.1145/1871437.1871457acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Path-hop: efficiently indexing large graphs for reachability queries

Published:26 October 2010Publication History

ABSTRACT

Graph reachability is a fundamental research problem that finds its use in many applications such as geographic navigation, bioinformatics, web ontologies and XML databases, etc. Given two vertices, u and v, in a directed graph, a reachability query asks if there is a directed path from u to v. Over the last two decades, many indexing schemes have been proposed to support reachability queries on large graphs. Typically, those schemes based on chain or tree covers work well when the graph is sparse. For dense graphs, they still have fast query time but require large storage for their indices. In contrast, the 2-Hop cover and its variations/extensions produce compact indices even for dense graphs but have slower query time than those chain/tree covers. In this paper, we propose a new indexing scheme, called Path-Hop, which is even more space-efficient than those schemes based on 2-Hop cover and yet has query processing speed comparable to those chain/tree covers. We conduct extensive experiments to illustrate the effectiveness of our approach relative to other state-of-the-art methods.

References

  1. R. Agrawal, A. Borgida, and H. V. Jagadish. Efficient management of transitive relationships in large data and knowledge bases. In Proc. of SIGMOD, pp 253--262, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Aho, M. Garey, J. Ullman. The transitive reduction of a directed graph. SIAM Journal on Computing, 1(2):131--137, 1972.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. L. Chen, A. Gupta, M. E. Kurul. Stack-based algorithms for pattern matching on DAGs. In Proc. of 31st VLDB, pp 493--504, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. V. Chvatal. A greedy heuristic for the set-covering problem. Math. Oper. Res, 4:233--235, 1979.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. E. Cohen, E. Halperin, H. Kaplan, and U. Zwick. Reachability and distance queries via 2-hop labels. In Proc. of 13th Annual ACM-SIAM Symposium on Discrete Algorithms, pp 937--946, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. H. V. Jagadish. A compression technique to materialize transitive closure. ACM Trans. on Database Systems, 15(4):558--598, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Jin, Y. Xiang, N. Ruan, and H. Wang. Efficiently answering reachability queries on very large directed graphs. In Proc. of SIGMOD, pp 595--608, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Jin, Y. Xiang, N. Ruan, and D. Fuhry. 3-HOP: a high-compression indexing scheme for reachability query. In Proc. of SIGMOD, pp 813--826, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. I. M. Keseler, J. Collado-Vides, S. Gama-Castro, J. Ingraham, S. Paley, I. T. Paulsen, M. Peralta-Gil, and P. D. Karp. Ecocyc: A comprehensive database resource for escherichia coli. In Nucleic Acids Research, 33 (D334-D337), 2005.Google ScholarGoogle ScholarCross RefCross Ref
  10. R. Paige and R. Tarjan. Three partition refinement algorithms. SIAM Journal on Computing, 16:973--988,1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. P. Romero, J. Wagg, M. L. Green, D. Kaiser, M. Krummenacker, and P. D Karp. Computational prediction of human metabolic pathways from the complete human genome. Genome Biology, 6(1):1--17,2004Google ScholarGoogle ScholarCross RefCross Ref
  12. R. Schenkel, A. Theobald, and G. Weikum. HOPI: an efficient connection index for complex XML document collections. In 7th EDBT, LNCS 2992, pp 237--255, 2004.Google ScholarGoogle Scholar
  13. S. Trißl and U. Leser. Fast and practical indexing and querying of very large graphs. In Proc. of SIGMOD, pp 845--856, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. H. Wang, H. He, J. Yang, P. S. Yu, and J. X. Yu. Dual labeling: answering graph reachability queries in constant time. In Proc. of 22nd ICDE, pp 75, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. E. Zimanyi and S. Gabouje. Semantic visualization of biochemical databases. In Proc. of 2004 Int'l Conf. on Semantics of a Networked World, pp 199--214, 2004.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Path-hop: efficiently indexing large graphs for reachability queries

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          CIKM '10: Proceedings of the 19th ACM international conference on Information and knowledge management
          October 2010
          2036 pages
          ISBN:9781450300995
          DOI:10.1145/1871437

          Copyright © 2010 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 26 October 2010

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate1,861of8,427submissions,22%

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader