skip to main content
10.1145/3126908.3126953acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article
Public Access

Probabilistic guarantees of execution duration for Amazon spot instances

Published:12 November 2017Publication History

ABSTRACT

In this paper we propose DrAFTS - a methodology for implementing probabilistic guarantees of instance reliability in the Amazon Spot tier. Amazon offers "unreliable" virtual machine instances (ones that may be terminated at any time) at a potentially large discount relative to "reliable" On-demand and Reserved instances. Our method predicts the "bid values" that users can specify to provision Spot instances which ensure at least a fixed duration of execution with a given probability. We illustrate the method and test its validity using Spot pricing data post facto, both randomly and using real-world workload traces. We also test the efficacy of the method experimentally by using it to launch Spot instances and then observing the instance termination rate. Our results indicate that it is possible to obtain the same level of reliability from unreliable instances that the Amazon service level agreement guarantees for reliable instances with a greatly reduced cost.

References

  1. Orna Agmon Ben-Yehuda, Muli Ben-Yehuda, Assaf Schuster, and Dan Tsafrir. 2013. Deconstructing Amazon EC2 spot instance pricing. ACM Transactions on Economics and Computation 1, 3 (2013), 16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Amazon Web Services. 2016. Amazon EC2 Service Level Agreement. (2016). https://aws.amazon.com/ec2/sla/ accessed August 2016.Google ScholarGoogle Scholar
  3. Amazon Web Services. 2016. Amazon EC2 Spot Instances. (2016). http://aws.amazon.com/ec2/purchasing-options/spot-instances/ accessed April 2016.Google ScholarGoogle Scholar
  4. Amazon Web Services. 2016. Elastic Compute Cloud. (2016). https://aws.amazon.com/ec2/ accessed April 2016.Google ScholarGoogle Scholar
  5. Aristotle Cloud Federation. 2016. Aristotle Cloud Federation. (2016). https://federatedcloud.org accessed August 2016.Google ScholarGoogle Scholar
  6. George EP Box, Gwilym M Jenkins, Gregory C Reinsel, and Greta M Ljung. 2015. Time series analysis: forecasting and control. John Wiley & Sons.Google ScholarGoogle Scholar
  7. Ryan Chard, Kyle Chard, Kris Bubendorfer, Lukasz Lacinski, Ravi Madduri, and Ian Foster. 2015. Cost-Aware Cloud Provisioning. In Proceedings of the 11th IEEE International Conference on e-Science. 136--144. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Ryan Chard, Kyle Chard, Bryan Ng, Kris Bubendorfer, Alex Rodriguez, Ravi Madduri, and Ian Foster. 2016. An Automated Tool Profiling Service for the Cloud. In Proceedings of the 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid). 223--232.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Ryan Chard, Kyle Chard, Rich Wolski, Ravi Madduri, Bryan Ng, Kris Bubendorfer, and Ian Foster. 2017. Cost-Aware Cloud Profiling, Prediction, and Provisioning as a Service. IEEE Cloud Computing Magazine PP (2017).Google ScholarGoogle Scholar
  10. Roy T Fielding and Richard N Taylor. 2002. Principled design of the modern Web architecture. ACM Transactions on Internet Technology (TOIT) 2, 2 (2002), 115--150. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Google Cloud Platform. 2016. Google Preemptable Virtual Machines. (2016). https://cloud.google.com/preemptible-vms/ accessed April 2016.Google ScholarGoogle Scholar
  12. Weichao Guo, Kang Chen, Yongwei Wu, and Weimin Zheng. 2015. Bidding for Highly Available Services with Low Price in Spot Instance Market. In Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing. ACM, 191--202. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Xin He, Prashant Shenoy, Ramesh Sitaraman, and David Irwin. 2015. Cutting the Cost of Hosting Online Services Using Cloud Spot Markets. In Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Bahman Javadi, Ruppa K Thulasiram, and Rajkumar Buyya. 2011. Statistical modeling of spot instance prices in public cloud environments. In Utility and Cloud Computing (UCC), 2011 Fourth IEEE International Conference on. IEEE, 219--228. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Bahman Javadi, Ruppa K Thulasiram, and Rajkumar Buyya. 2013. Characterizing spot price dynamics in public cloud environments. Future Generation Computer Systems 29, 4 (2013), 988--999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Bogumił Kamiński and Przemysław Szufel. 2015. On optimization of simulation execution on Amazon EC2 spot market. Simulation Modelling Practice and Theory 58 (2015), 172--187.Google ScholarGoogle ScholarCross RefCross Ref
  17. Ravi Madduri, Kyle Chard, Ryan Chard, Lukasz Lacinski, Alex Rodriguez, Dinanath Sulakhe, David Kelly, Utpal Dave, and Ian Foster. 2015. The Globus Galaxies platform: delivering science gateways as a service. Concurrency and Computation: Practice and Experience 27, 16 (2015), 4344--4360. CPE-15-0040.Google ScholarGoogle ScholarCross RefCross Ref
  18. Ravi K. Madduri, Dinanath Sulakhe, Lukasz Lacinski, Bo Liu, Alex Rodriguez, Kyle Chard, Utpal J. Dave, and Ian T. Foster. 2014. Experiences building Globus Genomics: a next-generation sequencing analysis service using Galaxy, Globus, and Amazon Web Services. Concurrency and Computation: Practice and Experience 26, 13 (2014), 2266--2279. CPE-13-0338.R2. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. D. Nurmi, J. Brevik, and R. Wolski. 2005. Modeling Machine Availability in Enterprise and Wide-area Distributed Computing Environments. In Proceedings of Europar 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Daniel Nurmi, John Brevik, and Rich Wolski. 2008. QBETS: Queue bounds estimation from time series. In Job Scheduling Strategies for Parallel Processing. Springer, 76--101. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Daniel Nurmi, Rich Wolski, and John Brevik. 2008. Probabilistic advanced reservations for batch-scheduled parallel machines. In Proceedings of the 13th ACM SIGPLAN symposium on principles and practice of parallel programming. ACM, 289--290. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Prateek Sharma, Stephen Lee, Tian Guo, David Irwin, and Prashant Shenoy. 2015. Spotcheck: Designing a derivative iaas cloud on the spot market. In Proceedings of the Tenth European Conference on Computer Systems. ACM, 16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Supreeth Shastri, Amr Rizk, and David Irwin. 2016. Transient Guarantees: Maximizing the Value of Idle Cloud Capacity. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC '16). IEEE Press, Piscataway, NJ, USA, Article 85, 11 pages. http://dl.acm.org/citation.cfm?id=3014904.3015019 Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Yang Song, Murtaza Zafer, and Kang-Won Lee. 2012. Optimal bidding in spot instance market. In INFOCOM, 2012 Proceedings IEEE. IEEE, 190--198.Google ScholarGoogle ScholarCross RefCross Ref
  25. Supreeth Subramanya, Tian Guo, Prateek Sharma, David Irwin, and Prashant Shenoy. 2015. SpotOn: a batch computing service for the spot market. In Proceedings of the Sixth ACM Symposium on Cloud Computing. ACM, 329--341. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. ShaoJie Tang, Jing Yuan, and Xiang-Yang Li. 2012. Towards optimal bidding strategy for Amazon EC2 cloud spot instance. In Cloud Computing (CLOUD), 2012 IEEE 5th International Conference on. IEEE, 91--98. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Robert M Wallace, Volodymyr Turchenko, Mehdi Sheikhalishahi, Iryna Turchenko, Vladyslav Shults, José Luis Vazquez-Poletti, and Lucio Grandinetti. 2013. Applications of neural-based spot market prediction for cloud computing. In Intelligent Data Acquisition and Advanced Computing Systems (IDAACS), 2013 IEEE 7th International Conference on, Vol. 2. IEEE, 710--716.Google ScholarGoogle ScholarCross RefCross Ref
  28. Rich Wolski and John Brevik. 2016. Probabilistic Guarantees of Execution Duration for Amazon Spot Instances. Technical Report 2016-05. University of California, Santa Barbara. https://www.cs.ucsb.edu/research/tech-reports/2016-05.Google ScholarGoogle Scholar
  29. Murtaza Zafer, Yang Song, and Kang-Won Lee. 2012. Optimal bids for spot vms in a cloud for deadline constrained jobs. In Cloud Computing (CLOUD), 2012 IEEE 5th International Conference on. IEEE, 75--82. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Liang Zheng, Carlee Joe-Wong, Chee Wei Tan, Mung Chiang, and Xinyu Wang. 2015. How to Bid the Cloud. In Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication. ACM, 71--84. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Probabilistic guarantees of execution duration for Amazon spot instances

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SC '17: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
          November 2017
          801 pages
          ISBN:9781450351140
          DOI:10.1145/3126908
          • General Chair:
          • Bernd Mohr,
          • Program Chair:
          • Padma Raghavan

          Copyright © 2017 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 12 November 2017

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          SC '17 Paper Acceptance Rate61of327submissions,19%Overall Acceptance Rate1,516of6,373submissions,24%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader