skip to main content
research-article

GRAIL: scalable reachability index for large graphs

Published:01 September 2010Publication History
Skip Abstract Section

Abstract

Given a large directed graph, rapidly answering reachability queries between source and target nodes is an important problem. Existing methods for reachability trade-off indexing time and space versus query time performance. However, the biggest limitation of existing methods is that they simply do not scale to very large real-world graphs. We present a very simple, but scalable reachability index, called GRAIL, that is based on the idea of randomized interval labeling, and that can effectively handle very large graphs. Based on an extensive set of experiments, we show that while more sophisticated methods work better on small graphs, GRAIL is the only index that can scale to millions of nodes and edges. GRAIL has linear indexing time and space, and the query time ranges from constant time to being linear in the graph order and size.

References

  1. R. Agrawal, A. Borgida, and H. V. Jagadish. Efficient management of transitive relationships in large data and knowledge bases. SIGMOD Rec., 18(2):253--262, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. P. Bouros, S. Skiadopoulos, T. Dalamagas, D. Sacharidis, and T. Sellis. Evaluating reachability queries over path collections. In SSDBM, page 416, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Bramandia, B. Choi, and W. K. Ng. On incremental maintenance of 2-hop labeling of graphs. In WWW, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Y. Chen. General spanning trees and reachability query evaluation. In Canadian Conference on Computer Science and Software Engineering, Montreal, Quebec, Canada, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Y. Chen and Y. Chen. An efficient algorithm for answering graph reachability queries. In ICDE, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. Cheng, J. X. Yu, X. Lin, H. Wang, and P. S. Yu. Fast computing reachability labelings for large graphs with high compression rate. In EBDT, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. E. Cohen, E. Halperin, H. Kaplan, and U. Zwick. Reachability and distance queries via 2-hop labels. SIAM Journal of Computing, 32(5):1335--1355, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein. Introduction to Algorithms. MIT Press, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. C. Demetrescu and G. Italiano. Fully Dynamic Transitive Closure: Breaking through the O(n 2) Barrier. In FOCS, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. C. Demetrescu and G. Italiano. Dynamic shortest paths and transitive closure: Algorithmic techniques and data structures. Journal of Discrete Algorithms, 4(3):353--383, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  11. P. F. Dietz. Maintaining order in a linked list. In STOC, 1982. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. H. He, H. Wang, J. Yang, and P. S. Yu. Compact reachability labeling for graph-structured data. In CIKM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. H. V. Jagadish. A compression technique to materialize transitive closure. ACM Trans. Database Syst., 15(4):558--598, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. Jin, Y. Xiang, N. Ruan, and D. Fuhry. 3-hop: a high-compression indexing scheme for reachability query. In SIGMOD, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. Jin, Y. Xiang, N. Ruan, and H. Wang. Efficient answering reachability queries on very large directed graphs. In SIGMOD, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. V. King and G. Sagert. A fully dynamic algorithm for maintaining the transitive closure. J. Comput. Syst. Sci., 65(1):150--167, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. I. Krommidas and C. Zaroliagis. An experimental study of algorithms for fully dynamic transitive closure. Journal of Experimental Algorithmics, 12:16, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. L. Roditty and U. Zwick. A fully dynamic reachability algorithm for directed graphs with an almost linear update time. In STOC, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. R. Schenkel, A. Theobald, and G. Weikum. HOPI: an efficient connection index for complex XML document collections. In EBDT, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  20. R. Schenkel, A. Theobald, and G. Weikum. Efficient creation and incremental maintenance of the hopi index for complex xml document collections. In ICDE, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. S. Trissl and U. Leser. Fast and practical indexing and querying of very large graphs. In SIGMOD, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. H. Wang, H. He, J. Yang, P. Yu, and J. X. Yu. Dual labeling: Answering graph reachability queries in constant time. In ICDE, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. X. Yu, X. Lin, H. Wang, P. S. Yu, and J. Cheng. Fast computation of reachability labeling for large graphs. In EBDT, 2006.Google ScholarGoogle Scholar

Index Terms

  1. GRAIL: scalable reachability index for large graphs
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image Proceedings of the VLDB Endowment
      Proceedings of the VLDB Endowment  Volume 3, Issue 1-2
      September 2010
      1658 pages

      Publisher

      VLDB Endowment

      Publication History

      • Published: 1 September 2010
      Published in pvldb Volume 3, Issue 1-2

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader