skip to main content
research-article
Public Access

On Non-Preemptive VM Scheduling in the Cloud

Published:19 December 2017Publication History
Skip Abstract Section

Abstract

We study the problem of scheduling VMs (Virtual Machines) in a distributed server platform, motivated by cloud computing applications. The VMs arrive dynamically over time to the system, and require a certain amount of resources (e.g. memory, CPU, etc) for the duration of their service. To avoid costly preemptions, we consider non-preemptive scheduling: Each VM has to be assigned to a server which has enough residual capacity to accommodate it, and once a VM is assigned to a server, its service cannot be disrupted (preempted). Prior approaches to this problem either have high complexity, require synchronization among the servers, or yield queue sizes/delays which are excessively large. We propose a non-preemptive scheduling algorithm that resolves these issues. In general, given an approximation algorithm to Knapsack with approximation ratio r, our scheduling algorithm can provide fraction of the throughput region for β < r. In the special case of a greedy approximation algorithm to Knapsack, we further show that this condition can be relaxed to β<1. The parameters β and r can be tuned to provide a trade-off between achievable throughput, delay, and computational complexity of the scheduling algorithm. Finally extensive simulation results using both synthetic and real traffic traces are presented to verify the performance of our algorithm.

References

  1. AWS Pipeline 2017. AWS Data Pipeline. (2017). https://aws.amazon.com/datapipeline/.Google ScholarGoogle Scholar
  2. Theophilus Benson, Aditya Akella, and David A Maltz. 2010. Network traffic characteristics of data centers in the wild. In Proceedings of the 10th ACM SIGCOMM conference on Internet measurement. ACM, 267--280. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Thomas Bonald and Davide Cuda. 2012. Rate-Optimal scheduling schemes for asynchronous Input-Queued packet switches. ACM SIGMETRICS Performance Evaluation Review 40, 3 (2012), 95--97. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Christopher Clark, Keir Fraser, Steven Hand, Jacob Gorm Hansen, Eric Jul, Christian Limpach, Ian Pratt, and Andrew Warfield. 2005. Live migration of virtual machines. In Proceedings of the 2nd Conference on Symposium on Networked Systems Design & Implementation-Volume 2. USENIX Association, 273--286. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Waltenegus Dargie. 2014. Estimation of the cost of VM migration. Proceedings - International Conference on Computer Communications and Networks, ICCCN (2014).Google ScholarGoogle ScholarCross RefCross Ref
  6. EC2 2017. Elastic Compute Cloud (EC2) Cloud Server and Hosting - AWS. (2017). https://aws.amazon.com/ec2/Google ScholarGoogle Scholar
  7. Deniz Ersoz, Mazin S. Yousif, and Chita R. Das. 2007. Characterizing network traffic in a cluster-based, multi-tier data center. Proceedings - International Conference on Distributed Computing Systems 1 (2007). Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Expedia. 2017. http://www.expedia.com. (2017).Google ScholarGoogle Scholar
  9. Anja Feldmann and Ward Whitt. 1998. Fitting mixtures of exponentials to long-tail distributions to analyze network performance models. Performance Evaluation 31, 3--4 (1998), 245--279. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M R Garey and D S Johnson. 1979. Computers and Intractability: A Guide to the Theory of NP-Completeness (Series of Books in the Mathematical Sciences). Computers and Intractability (1979), 340. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Javad Ghaderi. 2016. Randomized algorithms for scheduling VMs in the cloud. In IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications. IEEE, 1--9.Google ScholarGoogle ScholarCross RefCross Ref
  12. Javad Ghaderi, Sanjay Shakkottai, and R Srikant. 2016. Scheduling Storms and Streams in the Cloud. ACM Transactions on Modeling and Performance Evaluation of Computing Systems 1, 4 (2016), 1--28. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Javad Ghaderi, Yuan Zhong, and R Srikant. 2014. Asymptotic optimality of BestFit for stochastic bin packing. ACM SIGMETRICS Performance Evaluation Review 42, 2 (2014), 64--66. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. James Glanz. 2012. Power, pollution and the internet. The New York Times 22 (2012).Google ScholarGoogle Scholar
  15. Robert Grandl, Ganesh Ananthanarayanan, Srikanth Kandula, Sriram Rao, and Aditya Akella. 2014. Multi-resource packing for cluster schedulers. In ACM SIGCOMM Computer Communication Review, Vol. 44. ACM, 455--466. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Joe Wenjie Jiang, Tian Lan, Sangtae Ha, Minghua Chen, and Mung Chiang. 2012. Joint VM placement and routing for data center traffic engineering. In Proceedings of IEEE INFOCOM. 2876--2880.Google ScholarGoogle ScholarCross RefCross Ref
  17. Wolfgang John, Kostas Pentikousis, George Agapiou, Eduardo Jacob, Mario Kind, Antonio Manzalini, Fulvio Risso, Dimitri Staessens, Rebecca Steinert, and Catalin Meirosu. 2013. Research directions in network service chaining. In Future Networks and Services (SDN4FNS), 2013 IEEE SDN for. IEEE, 1--7.Google ScholarGoogle Scholar
  18. Hans Kellerer, Ulrich Pferschy, and David Pisinger. 2004. Introduction to NP-Completeness of knapsack problems. Springer.Google ScholarGoogle Scholar
  19. Edward Yu-Hsien Lin. 1998. A Bibliographical Survey on Some Well-Known Non-Standard Knapsack Problems. Infor 36, 4 (1998), 274--317.Google ScholarGoogle Scholar
  20. Minghong Lin, Adam Wierman, Lachlan L H Andrew, and Eno Thereska. 2013. Dynamic right-sizing for powerproportional data centers. IEEE/ACM Transactions on Networking 21, 5 (2013), 1378--1391. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Siva Theja Maguluri and R Srikant. 2013. Scheduling jobs with unknown duration in clouds. In Proceedings 2013 IEEE INFOCOM. 1887--1895.Google ScholarGoogle ScholarCross RefCross Ref
  22. Siva Theja Maguluri and R Srikant. 2014. Scheduling jobs with unknown duration in clouds. IEEE/ACM Transactions on Networking 22, 6 (2014), 1938--1951. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Siva Theja Maguluri, R. Srikant, and Lei Ying. 2012. Stochastic models of load balancing and scheduling in cloud computing clusters. Proceedings - IEEE INFOCOM (2012), 702--710.Google ScholarGoogle ScholarCross RefCross Ref
  24. Marco Ajmone Marsan, Andrea Bianco, Paolo Giaccone, Emilio Leonardi, and Fabio Neri. 2002. Packet-mode scheduling in input-queued cell-based switches. IEEE/ACM Transactions on Networking (TON) 10, 5 (2002), 666--678. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Xiaoqiao Meng, Vasileios Pappas, and Li Zhang. 2010. Improving the scalability of data center networks with trafficaware virtual machine placement. In 2010 Proceedings of IEEE INFOCOM. 1--9. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Sean P Meyn and Richard L Tweedie. 1993. Stability of markovian processes II: continuous time processes and sampled chains. Advances in Applied Probability (1993), 487--517.Google ScholarGoogle Scholar
  27. Paul Nash. 2015. Introducing Preemptible VMs. https://cloudplatform.googleblog.com/2015/05/Introducing- Preemptible-VMs-a-new-class-of-compute-available-at-70-off-standard-pricing.html. (2015).Google ScholarGoogle Scholar
  28. Bernt K. Oksendal. 2003. Stochastic Differential Equations: An Introduction with Applications (Sixth ed.). Springer.Google ScholarGoogle Scholar
  29. Charles Reiss, Alexey Tumanov, Gregory R. Ganger, Randy H. Katz, and Michael a. Kozuch. 2012. Heterogeneity and dynamicity of clouds at scale : Google Trace Analysis. Proceedings of the Third ACM Symposium on Cloud Computing - SoCC '12 (2012), 1--13. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Charles Reiss, Alexey Tumanov, Gregory R Ganger, Randy H Katz, and Michael A Kozuch. 2012. Heterogeneity and dynamicity of clouds at scale: Google trace analysis. In Proceedings of the Third ACM Symposium on Cloud Computing. ACM, 7. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Devavrat Shah and Jinwoo Shin. 2012. Randomized scheduling algorithm for queueing networks. The Annals of Applied Probability 22, 1 (2012), 128--171.Google ScholarGoogle ScholarCross RefCross Ref
  32. Mark Stillwell, Frédéric Vivien, and Henri Casanova. 2012. Virtual machine resource allocation for service hosting on heterogeneous distributed platforms. In Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International. IEEE, 786--797. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Alexander Stolyar and Yuan Zhong. 2013. Asymptotic optimality of a greedy randomized algorithm in a large-scale service system with general packing constraints. arXiv preprint arXiv:1306.4991 (2013).Google ScholarGoogle Scholar
  34. Alexander L Stolyar. 2013. An infinite server system with general packing constraints. Operations Research 61, 5 (2013), 1200--1217.Google ScholarGoogle ScholarCross RefCross Ref
  35. Alexander L Stolyar and Yuan Zhong. 2013. A large-scale service system with packing constraints: Minimizing the number of occupied servers. In Proceedings of the ACM SIGMETRICS/international conference on and modeling of computer systems. ACM, 41--52. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Leandros Tassiulas and Anthony Ephremides. 1992. Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks. Automatic Control, IEEE Transactions on 37, 12 (1992), 1936--1948.Google ScholarGoogle ScholarCross RefCross Ref
  37. Vijay V Vazirani. 2013. Approximation algorithms. Springer Science & Business Media. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Abhishek Verma, Luis Pedrosa, Madhukar Korupolu, David Oppenheimer, Eric Tune, and John Wilkes. 2015. Large-scale cluster management at Google with Borg. Proceedings of the Tenth European Conference on Computer Systems - EuroSys '15 (2015), 1--17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. John Wilkes. 2011. Google Cluster Data. https://github.com/google/cluster-data. (2011).Google ScholarGoogle Scholar
  40. Jing Xu and Jose AB Fortes. 2010. Multi-objective virtual machine placement in virtualized data center environments. In 2010 IEEE/ACM Int'l Conference on Green Computing and Communications (GreenCom), & Int'l Conference on Cyber, Physical and Social Computing (CPSCom). 179--188. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Yağiz Onat Yazir, Chris Matthews, Roozbeh Farahbod, Stephen Neville, Adel Guitouni, Sudhakar Ganti, and Yvonne Coady. 2010. Dynamic resource allocation in computing clouds using distributed multiple criteria decision analysis. In IEEE Conference on Cloud Computing (CLOUD). 91--98. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Shunyuan Ye, Yanming Shen, and Shivendra Panwar. 2010. An O(1) scheduling algorithm for variable-size packet switching systems. In Annual Allerton Conference on Communication, Control, and Computing. 1683 1690.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. On Non-Preemptive VM Scheduling in the Cloud

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image Proceedings of the ACM on Measurement and Analysis of Computing Systems
          Proceedings of the ACM on Measurement and Analysis of Computing Systems  Volume 1, Issue 2
          December 2017
          480 pages
          EISSN:2476-1249
          DOI:10.1145/3175501
          Issue’s Table of Contents

          Copyright © 2017 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 19 December 2017
          Published in pomacs Volume 1, Issue 2

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader