skip to main content
research-article

Distance-constraint reachability computation in uncertain graphs

Published:01 June 2011Publication History
Skip Abstract Section

Abstract

Driven by the emerging network applications, querying and mining uncertain graphs has become increasingly important. In this paper, we investigate a fundamental problem concerning uncertain graphs, which we call the distance-constraint reachability (DCR) problem: Given two vertices s and t, what is the probability that the distance from s to t is less than or equal to a user-defined threshold d in the uncertain graph? Since this problem is #P-Complete, we focus on efficiently and accurately approximating DCR online. Our main results include two new estimators for the probabilistic reachability. One is a Horvitz-Thomson type estimator based on the unequal probabilistic sampling scheme, and the other is a novel recursive sampling estimator, which effectively combines a deterministic recursive computational procedure with a sampling process to boost the estimation accuracy. Both estimators can produce much smaller variance than the direct sampling estimator, which considers each trial to be either 1 or 0. We also present methods to make these estimators more computationally efficient. The comprehensive experiment evaluation on both real and synthetic datasets demonstrates the efficiency and accuracy of our new estimators.

References

  1. C. C. Aggarwal, editor. Managing and Mining Uncertain Data. Advances in Database Systems. Springer, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. O. Ball. Computational complexity of network reliability analysis: An overview. IEEE Transactions on Reliability, 35:230--239, 1986.Google ScholarGoogle ScholarCross RefCross Ref
  3. A. L. Barabasi and R. Albert. Emergence of scaling in random networks. Science, 286(5439):509--512, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  4. I. Clarke, O. Sandberg, B. Wiley, and T. W. Hong. Freenet: a distributed anonymous information storage and retrieval system. In International workshop on Designing privacy enhancing technologies: design issues in anonymity and unobservability, pages 46--66, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. J. Colbourn. The Combinatorics of Network Reliability. Oxford University Press, Inc., 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. G. S. Fishman. A comparison of four monte carlo methods for estimating the probability of s-t connectedness. IEEE Transactions on Reliability, 35(2):145--155, 1986.Google ScholarGoogle ScholarCross RefCross Ref
  7. M. Hua and J. Pei. Probabilistic path queries in road networks: traffic uncertainty aware path selection. In EDBT, pages 347--358, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Jin, Y. Xiang, N. Ruan, and D. Fuhry. 3-hop: a high-compression indexing scheme for reachability query. In SIGMOD Conference, pages 813--826, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. R. M. Karp and M. G. Luby. A new monte-carlo method for estimating the failure probability of an n-component system. Technical report, Berkeley, CA, USA, 1983. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. H. Kumamoto, K. Tanaka, I. Koichi, and H. E. J. Dagger-sampling monte carlo for system unavailability evaluation. IEEE Transactions on Reliability, 29(2):122--125.Google ScholarGoogle ScholarCross RefCross Ref
  11. G. Pandurangan, P. Raghavan, and E. Upfal. Building low-diameter p2p networks. In FOCS, pages 492--499, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. L. Petingi. Combinatorial and computational properties of a diameter constrained network reliability model. In Proceedings of the WSEAS International Conference on Applied Computing Conference, ACC'08, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M. Potamias, F. Bonchi, A. Gionis, and G. Kollios. k-nearest neighbors in uncertain graphs. PVLDB, 3(1):997--1008, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. G. Rubino. Network reliability evaluation, pages 275--302. Gordon and Breach Science Publishers, Inc., Newark, NJ, USA, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. J. Russell and P. Norvig. Artificial Intelligence: A Modern Approach. Pearson Education, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. A. Satyanarayana and R. K. Wood. A linear-time algorithm for computing k-terminal reliability in series-parallel networks. SIAM Journal on Computing, 14(4):818--832, 1985.Google ScholarGoogle ScholarCross RefCross Ref
  17. G. Swamynathan, C. Wilson, B. Boe, K. Almeroth, and B. Y. Zhao. Do social networks improve e-commerce?: a study on social marketplaces. In WOSP '08. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Thompson. Sampling, Second Edition. New York: Wiley., 2002.Google ScholarGoogle Scholar
  19. http://mips.helmholtz-muenchen.de/genre/proj/mpact/.Google ScholarGoogle Scholar
  20. http://thebiogrid.org/.Google ScholarGoogle Scholar
  21. L. G. Valiant. The complexity of enumeration and reliability problems. SIAM Journal on Computing, 8(3):410--421, 1979.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. H. Yildirim, V. Chaoji, and M. J. Zaki. Grail: Scalable reachability index for large graphs. PVLDB, 3(1):276--284, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Y. Yuan, L. Chen, and G. Wang. Efficiently answering probability threshold-based shortest path queries over uncertain graphs. In DASFAA (1) '10, pages 155--170, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Z. Zou, H. Gao, and J. Li. Discovering frequent subgraphs over uncertain graph databases under probabilistic semantics. KDD '10, pages 633--642, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Z. Zou, J. Li, H. Gao, and S. Zhang. Finding top-k maximal cliques in an uncertain graph. In ICDE, pages 649--652, 2010.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Distance-constraint reachability computation in uncertain graphs

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image Proceedings of the VLDB Endowment
        Proceedings of the VLDB Endowment  Volume 4, Issue 9
        June 2011
        70 pages

        Publisher

        VLDB Endowment

        Publication History

        • Published: 1 June 2011
        Published in pvldb Volume 4, Issue 9

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader