skip to main content
10.1145/2983323.2983726acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

GiraphAsync: Supporting Online and Offline Graph Processing via Adaptive Asynchronous Message Processing

Published:24 October 2016Publication History

ABSTRACT

It is highly desired for existing distributed graph processing systems to support both offline analytics and online queries adaptively. Existing offline graph analytics systems are mostly based on synchronous model. Although achieving high throughput, they suffer relatively high latency in answering simple queries due to synchronization overhead and slow convergence. On the other hand, online graph query systems adopting asynchronous model can response at any time, while incur overwhelmed messages and network packets, making them unable to meet the high throughput demand of offline analytics. In this work, we propose an adaptive asynchronous message processing (AAMP) method, which improves the efficiency of network communication while maintains low latency, to efficiently support offline analytics and online queries in one graph processing framework. We then design GiraphAsync, an implementation of AAMP on top of Apache Giraph, and evaluate it using several representative offline analytics and online queries on large graph datasets. Experimental results show that GiraphAsync gains an up to 10X improvement over synchronous model systems for graph analytics, while performs as well as specialized systems for online graph queries.

References

  1. Apache Giraph. http://incubator.apache.org/giraph/.Google ScholarGoogle Scholar
  2. Jena. http://jena.apache.org/.Google ScholarGoogle Scholar
  3. Neo4j. http://neo4j.com/.Google ScholarGoogle Scholar
  4. M. Atre, V. Chaoji, M. J. Zaki, and J. A. Hendler. Matrix "bit" loaded: a scalable lightweight join query processor for rdf data. In WWW, pages 41--50, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein. Introduction to Algorithms. The MIT Press, 3rd edition. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. Gao, C. Zhou, J. Zhou, and J. X. Yu. Continuous pattern detection over billion-edge graph using distributed framework. In ICDE, pages 556--567, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  7. G.Malewicz, M.H.Austern, A.J.C.Bik, J.C.Dehnert, I.Horn, N.Leiser, and G.Czajkowski. Pregel: a system for large-scale graph processing. In SIGMOD, pages 135--146, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Y. Guo, Z. Pan, and J. Heflin. Lubm: A benchmark for owl knowledge base systems. Journal of Web Semantics, 3(2--3):158--182, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Han and K. Daudjee. Giraph unchained: Barrierless asynchronous parallel execution in pregel-like graph processing systems. PVLDB, 8(9), 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Han, K. Daudjee, K. Ammar, M. Tamerozsu, X. Wang, and T. Jin. An experimental comparison of pregel-like graph processing systems. PVLDB, 7(12):1047--1058, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. B. Iordanov. Hypergraphdb: a generalized graph database. In WAIM, pages 25--36. Springer, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K.Haewoon, L.Changhyun, P.Hosung, and M.Sue. What is Twitter, a social network or a news media? In WWW, pages 591--600, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Kyrola, G. Blelloch, and C. Guestrin. Graphchi:large-scale graph computation on just a pc. OSDI, pages 31--46, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Y. Low, D. Bickson, J. Gonzalez, C. Guestrin, A. Kyrola, and J. M. Hellerstein. Distributed graphlab: A framework for machine learning and data mining in the cloud. PVLDB, 5(8):716--727, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Y. Lu, J. Cheng, D. Yan, and H. Wu. Large-scale distributed graph computing systems: An experimental evaluation. PVLDB, 8(3):281--292, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. R. McCune, T. Weninger, and G. Madey. Thinking like a vertex: a survey of vertex-centric frameworks for large-scale distributed graph processing. ACM Comput. Surv., 48(2), 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. T. Neumann and G. Weikum. Rdf-3x: a risc-style engine for rdf. PVLDB, 1(1), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Sarwat, S. Elnikety, Y. He, and M. F. Mokbel. HortonGoogle ScholarGoogle Scholar
  19. : A distributed system for processing declarative reachability queries over partitioned graphs. PVLDB, 6(14), 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. Shao, H. Wang, and Y. Li. Trinity: A distributed graph engine on a memory cloud. In SIGMOD, pages 505--516, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Y. Tian, A. Balmin, S. A. Corsten, S. Tatikonda, and J. McPherson. From "think like a vertex" to "think like a graph". PVLDB, 7(3):193--204, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. U.Kang, C.E.Tsourakakis, and C.Faloutsos. Pegasus: A peta-scale graph mining system. In ICDE, pages 229--238, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. L. G. Valiant. A bridging model for parallel computation. Commun. ACM, 33(8):103--111, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. G. Wang, W. Xie, A. Demers, and J. Gehrke. Asynchronous large-scale graph processing made easy. In CIDR, 2013.Google ScholarGoogle Scholar
  25. C. Xie, R. Chen, H. Guan, B. Zang, and H. Chen. ync or async: Time to fuse for distributed graph-parallel computation. PPoPP, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Y. Zhang, Q. Gao, L. Gao, and C. Wang. Accelerate large-scale iterative computation through asynchronous accumulative updates. In ScienceCloud '12, pages 13--22, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. C. Zhou, J. Gao, B. Sun, and J. X. Yu. Mocgraph: Scalable distributed graph processing using message online computing. PVLDB, 8(4):377--388, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. GiraphAsync: Supporting Online and Offline Graph Processing via Adaptive Asynchronous Message Processing

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
      October 2016
      2566 pages
      ISBN:9781450340731
      DOI:10.1145/2983323

      Copyright © 2016 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 October 2016

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      CIKM '16 Paper Acceptance Rate160of701submissions,23%Overall Acceptance Rate1,861of8,427submissions,22%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader