skip to main content
10.1145/2619239.2626315acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article
Free Access

Efficient coflow scheduling with Varys

Published:17 August 2014Publication History

ABSTRACT

Communication in data-parallel applications often involves a collection of parallel flows. Traditional techniques to optimize flow-level metrics do not perform well in optimizing such collections, because the network is largely agnostic to application-level requirements. The recently proposed coflow abstraction bridges this gap and creates new opportunities for network scheduling. In this paper, we address inter-coflow scheduling for two different objectives: decreasing communication time of data-intensive jobs and guaranteeing predictable communication time. We introduce the concurrent open shop scheduling with coupled resources problem, analyze its complexity, and propose effective heuristics to optimize either objective. We present Varys, a system that enables data-intensive frameworks to use coflows and the proposed algorithms while maintaining high network utilization and guaranteeing starvation freedom. EC2 deployments and trace-driven simulations show that communication stages complete up to 3.16X faster on average and up to 2X more coflows meet their deadlines using Varys in comparison to per-flow mechanisms. Moreover, Varys outperforms non-preemptive coflow schedulers by more than 5X.

References

  1. Akka. http://akka.io.Google ScholarGoogle Scholar
  2. Amazon EC2. http://aws.amazon.com/ec2.Google ScholarGoogle Scholar
  3. Apache Hadoop. http://hadoop.apache.org.Google ScholarGoogle Scholar
  4. Apache Hive. http://hive.apache.org.Google ScholarGoogle Scholar
  5. Kryo serialization library. https://code.google.com/p/kryo.Google ScholarGoogle Scholar
  6. S. Agarwal et al. Reoptimizing data parallel computing. In NSDI'12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Al-Fares et al. Hedera: Dynamic flow scheduling for data center networks. In NSDI. 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Alizadeh et al. pFabric: Minimal near-optimal datacenter transport. In SIGCOMM. 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. G. Ananthanarayanan et al. Reining in the outliers in mapreduce clusters using Mantri. In OSDI. 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. G. Ananthanarayanan et al. PACMan: Coordinated memory caching for parallel jobs. In NSDI. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. H. Ballani et al. Towards predictable datacenter networks. In SIGCOMM. 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T. Benson et al. MicroTE: Fine grained traffic engineering for data centers. In CoNEXT. 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. D. Borthakur. The Hadoop distributed file system: Architecture and design. Hadoop Project Website, 2007.Google ScholarGoogle Scholar
  14. R. Chaiken et al. SCOPE: Easy and efficient parallel processing of massive datasets. In VLDB. 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. Chowdhury et al. Managing data transfers in computer clusters with Orchestra. In SIGCOMM. 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Chowdhury et al. Coflow: A networking abstraction for cluster applications. In HotNets-XI, pages 31--36. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M. Chowdhury et al. Leveraging endpoint flexibility in data-intensive clusters. In SIGCOMM. 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. Dean et al. MapReduce: Simplified data processing on large clusters. In OSDI, pages 137--150. 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. F. Dogar et al. Decentralized task-aware scheduling for data center networks. Technical Report MSR-TR-2013--96, 2013.Google ScholarGoogle Scholar
  20. A. Dragojevi et al. FaRM: Fast remote memory. In NSDI. 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. A. D. Ferguson et al. Participatory networking: An API for application control of SDNs. In SIGCOMM. 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. A. Greenberg et al. VL2: A scalable and flexible data center network. In SIGCOMM. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Z. Guo et al. Spotting code optimizations in data-parallel pipelines through PeriSCOPE. In OSDI. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. B. Hindman et al. Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center. In NSDI. 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. C.-Y. Hong et al. Finishing flows quickly with preemptive scheduling. In SIGCOMM. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. M. Isard et al. Dryad: Distributed data-parallel programs from sequential building blocks. In EuroSys, pages 59--72. 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. N. Kang et al. Optimizing the "One Big Switch" abstraction in Software-Defined Networks. In CoNEXT. 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. J. E. Kelley. Critical-path planning and scheduling: Mathematical basis. Operations Research, 9(3):296--320, 1961.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. G. Malewicz et al. Pregel: A system for large-scale graph processing. In SIGMOD. 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. M. Mastrolilli et al. Minimizing the sum of weighted completion times in a concurrent open shop. Operations Research Letters, 38(5):390--395, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. N. McKeown et al. Achieving 100% throughput in an input-queued switch. IEEE Transactions on Communications, 47(8), 1999.Google ScholarGoogle ScholarCross RefCross Ref
  32. R. N. Mysore et al. PortLand: A scalable fault-tolerant layer 2 data center network fabric. In SIGCOMM, pages 39--50. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. L. Popa et al. FairCloud: Sharing the network in cloud computing. In SIGCOMM. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. T. A. Roemer. A note on the complexity of the concurrent open shop problem. Journal of Scheduling, 9(4):389--396, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. A. Shieh et al. Sharing the data center network. In NSDI. 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. N. Tolia et al. An architecture for internet data transfer. In NSDI'06. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. L. G. Valiant. A bridging model for parallel computation. Communications of the ACM, 33(8):103--111, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. C. A. Waldspurger et al. Lottery scheduling: Flexible proportional-share resource management. In OSDI. 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. D. Xie et al. The only constant is change: Incorporating time-varying network reservations in data centers. In SIGCOMM. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. M. Zaharia et al. Delay scheduling: A simple technique for achieving locality and fairness in cluster scheduling. In EuroSys. 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. M. Zaharia et al. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. In NSDI. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Efficient coflow scheduling with Varys

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SIGCOMM '14: Proceedings of the 2014 ACM conference on SIGCOMM
        August 2014
        662 pages
        ISBN:9781450328364
        DOI:10.1145/2619239

        Copyright © 2014 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 17 August 2014

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        SIGCOMM '14 Paper Acceptance Rate45of242submissions,19%Overall Acceptance Rate554of3,547submissions,16%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader