skip to main content
10.1145/2806777.2806851acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

SpotOn: a batch computing service for the spot market

Published:27 August 2015Publication History

ABSTRACT

Cloud spot markets enable users to bid for compute resources, such that the cloud platform may revoke them if the market price rises too high. Due to their increased risk, revocable resources in the spot market are often significantly cheaper (by as much as 10×) than the equivalent non-revocable on-demand resources. One way to mitigate spot market risk is to use various fault-tolerance mechanisms, such as checkpointing or replication, to limit the work lost on revocation. However, the additional performance overhead and cost for a particular fault-tolerance mechanism is a complex function of both an application's resource usage and the magnitude and volatility of spot market prices.

We present the design of a batch computing service for the spot market, called SpotOn, that automatically selects a spot market and fault-tolerance mechanism to mitigate the impact of spot revocations without requiring application modification. SpotOn's goal is to execute jobs with the performance of on-demand resources, but at a cost near that of the spot market. We implement and evaluate SpotOn in simulation and using a prototype on Amazon's EC2 that packages jobs in Linux Containers. Our simulation results using a job trace from a Google cluster indicate that SpotOn lowers costs by 91.9% compared to using on-demand resources with little impact on performance.

References

  1. Scientific Computing Using Spot Instances. http://aws.amazon.com/ec2/spot-and-science/, June 2013.Google ScholarGoogle Scholar
  2. ClusterK. https://clusterk.com/, July 10th 2015.Google ScholarGoogle Scholar
  3. Google preemptible instances. https://cloud.google.com/compute/docs/instances/preemptible, July 10th 2015.Google ScholarGoogle Scholar
  4. J. Barr. New - EC2 Spot Instance Termination Notices. https://aws.amazon.com/blogs/aws/new-ec2-spot-instance-termination-notices/, January 6th 2015.Google ScholarGoogle Scholar
  5. C. Clark, K. Fraser, S. Hand, J. Hansen, E. Jul, C. Limpach, I. Pratt, and A. Waeldr. Live Migration of Virtual Machines. In NSDI, May 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. B. Cully, G. Lefebvre, D. Meyer, M. Feeley, N. Hutchinson, and A. Warfield. Remus: High Availability via Asynchronous Virtual Machine Replication. In NSDI, April 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Di, D. Kondo, and F. Cappello. Characterizing and Modeling Cloud Applications/Jobs on a Google Data Center. Supercomputing, 69(1), July 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. R. Hines, U. Deshpande, and K. Gopalan. Post-copy Live Migration of Virtual Machines. SIGOPS Operating Systems Review, 43(3), July 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Khatua and N. Mukherjee. Application-centric Resource Provisioning for Amazon EC2 Spot Instances. In EuroPar, August 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Mao and M. Humphrey. A Performance Study on VM Startup Time in the Cloud. In CLOUD, June 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. Mattess, C. Vecchiola, and R. Buyya. Managing Peak Loads by Leasing Cloud Infrastructure Services from a Spot Market. In HPCC, September 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Mazzucco and M. Dumas. Achieving Performance and Availability Guarantees with Spot Instances. In HPCC, September 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. I. Menache, O. Shamir, and N. Jain. On-demand, Spot, or Both: Dynamic Resource Allocation for Executing Batch Jobs in the Cloud. In ICAC, 2014.Google ScholarGoogle Scholar
  14. A. Mu'alem and D. Feitelson. Utilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling. TPDS, 12(6), June 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. Reiss, J. Wilkes, and J. L. Hellerstein. Google Cluster-usage Traces: Format + Schema. Technical report, Google Inc., November 2011.Google ScholarGoogle Scholar
  16. P. Sharma, S. Lee, T. Guo, D. Irwin, and P. Shenoy. SpotCheck: Designing a Derivative IaaS Cloud on the Spot Market. In EuroSys, April 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. Singh, D. Irwin, P. Shenoy, and K. Ramakrishnan. Yank: Enabling Green Data Centers to Pull the Plug. In NSDI, April 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Y. Song, M. Zafer, and K. Lee. Optimal Bidding in Spot Instance Market. In Infocom, March 2012.Google ScholarGoogle ScholarCross RefCross Ref
  19. S. Tang, J. Yuan, and X. Li. Towards Optimal Bidding Strategy for Amazon EC2 Cloud Spot Instance. In CLOUD, June 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. D. Tsafrir, Y. Etsion, and D. Feitelson. Modeling User Runtime Estimates. In JSSP, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. W. Voorsluys and R. Buyya. Reliable Provisioning of Spot Instances for Compute-Intensive Applications. In AINA, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. S. Yi, D. Kondo, and A. Andrzejak. Reducing Costs of Spot Instances via Checkpointing in the Amazon Elastic Compute Cloud. In CLOUD, July 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. M. Zafer, Y. Song, and K. Lee. Optimal Bids for Spot VMs in a Cloud for Deadline Constrained Jobs. In CLOUD, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. S. Zaman and D. Grosu. Efficient Bidding for Virtual Machine Instances in Clouds. In CLOUD, July 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Q. Zhang, E. Gürses, R. Boutaba, and J. Xiao. Dynamic Resource Allocation for Spot Markets in Clouds. In HotICE, March 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. SpotOn: a batch computing service for the spot market

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SoCC '15: Proceedings of the Sixth ACM Symposium on Cloud Computing
        August 2015
        446 pages
        ISBN:9781450336512
        DOI:10.1145/2806777

        Copyright © 2015 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 27 August 2015

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        SoCC '15 Paper Acceptance Rate34of157submissions,22%Overall Acceptance Rate169of722submissions,23%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader