skip to main content
10.1145/1755913.1755940acmconferencesArticle/Chapter ViewAbstractPublication PageseurosysConference Proceedingsconference-collections
research-article

Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling

Published:13 April 2010Publication History

ABSTRACT

As organizations start to use data-intensive cluster computing systems like Hadoop and Dryad for more applications, there is a growing need to share clusters between users. However, there is a conflict between fairness in scheduling and data locality (placing tasks on nodes that contain their input data). We illustrate this problem through our experience designing a fair scheduler for a 600-node Hadoop cluster at Facebook. To address the conflict between locality and fairness, we propose a simple algorithm called delay scheduling: when the job that should be scheduled next according to fairness cannot launch a local task, it waits for a small amount of time, letting other jobs launch tasks instead. We find that delay scheduling achieves nearly optimal data locality in a variety of workloads and can increase throughput by up to 2x while preserving fairness. In addition, the simplicity of delay scheduling makes it applicable under a wide variety of scheduling policies beyond fair sharing.

References

  1. Amazon EC2. http://aws.amazon.com/ec2/.Google ScholarGoogle Scholar
  2. Apache Hadoop. http://hadoop.apache.org.Google ScholarGoogle Scholar
  3. Apache Hive. http://hadoop.apache.org/hive/.Google ScholarGoogle Scholar
  4. Hadoop Map/Reduce tutorial. http://hadoop.apache.org/common/docs/current/mapred_tutorial.html.Google ScholarGoogle Scholar
  5. Hive performance benchmarks. http://issues.apache.org/jira/browse/HIVE-396.Google ScholarGoogle Scholar
  6. HP Neoview Workload Management Services Guide. \smallurlhttp://www.docs.hp.com/en/544806-001/Neoview_WMS_Guide_R2.3.pdf.Google ScholarGoogle Scholar
  7. Max-Min Fairness (Wikipedia). http://en.wikipedia.org/wiki/Max-min_fairness.Google ScholarGoogle Scholar
  8. NSF Cluster Exploratory (CluE) Program Solicitation. http://www.nsf.gov/pubs/2008/nsf08560/nsf08560.htm.Google ScholarGoogle Scholar
  9. Official Google Blog: Sorting 1PB with MapReduce. \smallurlhttp://googleblog.blogspot.com/2008/11/sorting-1pb-with-mapreduce.%html.Google ScholarGoogle Scholar
  10. Open Cirrus. http://opencirrus.org/.Google ScholarGoogle Scholar
  11. Personal communication with Hans Zeller of HP.Google ScholarGoogle Scholar
  12. Personal communication with Owen O'Malley of the Yahoo! Hadoop team.Google ScholarGoogle Scholar
  13. TORQUE Resource Manager. http://www.clusterresources.com/pages/products/torque-resource-manager.php.Google ScholarGoogle Scholar
  14. Yahoo! Launches New Program to Advance Open-Source Software for Internet Computing. http://research.yahoo.com/node/1879.Google ScholarGoogle Scholar
  15. J. Bennett and H. Zhang. WF2Q): Worst-case fair weighted fair queueing. In IEEE INFOCOM'96, pages 120--128, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Bent, D. Thain, A. C. Arpaci-Dusseau, R. H. Arpaci-Dusseau, and M. Livny. Explicit control in a batch--aware distributed file system. In NSDI'04, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. A. Chervenak, E. Deelman, M. Livny, M.-H. Su, R. Schuler, S. Bharathi, G. Mehta, and K. Vahi. Data Placement for Scientific Applications in Distributed Environments. In Proc. 8th IEEE/ACM International Conference on Grid Computing (Grid 2007), September 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. Commun. ACM, 51(1):107--113, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. Demers, S. Keshav, and S. Shenker. Analysis and simulation of a fair queueing algorithm. In Journal of Internetworking Research and Experience, pages 3--26, Oct. 1990.Google ScholarGoogle Scholar
  20. S. Floyd and V. Jacobson. Link-sharing and resource management models for packet networks. IEEE/ACM Transactions on Networking, 3(4):365--386, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. S. Ghemawat, H. Gobioff, and S.-T. Leung. The Google File System. In Proc. SOSP 2003, pages 29--43, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. B. Hindman, A. Konwinski, M. Zaharia, and I. Stoica. A common substrate for cluster computing. In Workshop on Hot Topics in Cloud Computing (HotCloud) 2009, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly. Dryad: Distributed data-parallel programs from sequential building blocks. In EuroSys 2007, pages 59--72, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. M. Isard, V. Prabhakaran, J. Currey, U. Wieder, K. Talwar, and A. Goldberg. Quincy: Fair scheduling for distributed computing clusters. In SOSP 2009, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. J. Nieh and M. S. Lam. A SMART scheduler for multimedia applications. ACM TOCS, 21(2):117--163, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. A. Pavlo, E. Paulson, A. Rasin, D. J. Abadi, D. J. DeWitt, S. Madden, and M. Stonebraker. A comparison of approaches to large-scale data analysis. In SIGMOD'09, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. I. Stoica, H. Zhang, and T. Ng. A hierarchical fair service curve algorithm for link-sharing, real-time and priority service. In SIGCOMM'97, pages 162--173, Sept. 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. D. Thain, T. Tannenbaum, and M. Livny. Distributed computing in practice: the Condor experience. Concurrency and Computation Practice and Experience, 17(2-4):323--356, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. C. A. Waldspurger. Lottery and Stride Scheduling: Flexible Proportional-Share Resource Management. PhD thesis, MIT, Laboratory of Computer Science, 1995. MIT/LCS/TR-667. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. C. A. Waldspurger and W. E. Weihl. Lottery scheduling: Flexible proportional-share resource management. In OSDI 94, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    EuroSys '10: Proceedings of the 5th European conference on Computer systems
    April 2010
    388 pages
    ISBN:9781605585772
    DOI:10.1145/1755913

    Copyright © 2010 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 13 April 2010

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

    Acceptance Rates

    Overall Acceptance Rate241of1,308submissions,18%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader