ABSTRACT
Dynamic resource provisioning aims at maintaining the end-to-end response time of a web application within a pre-defined SLA. Although the topic has been well studied for monolithic applications, provisioning resources for applications composed of multiple services remains a challenge. When the SLA is violated, one must decide which service(s) should be reprovisioned for optimal effect. We propose to assign an SLA only to the front-end service. Other services are not given any particular response time objectives. Services are autonomously responsible for their own provisioning operations and collaboratively negotiate performance objectives with each other to decide the provisioning service(s). We demonstrate through extensive experiments that our system can add/remove/shift both servers and caches within an entire multi-service application under varying workloads to meet the SLA target and improve resource utilization.
- T. F. Abdelzaher, K. G. Shin, and N. Bhatti. Performance guarantees for Web server end-systems: a control-theoretical application. IEEE Transactions on Parallel and Distributed Systems, 13(1), 2002. Google ScholarDigital Library
- J. Almeida, V. Almeida, D. Ardagna, C. Francalanci, and M. Trubian. Resource management in the autonomic service-oriented architecture. In Proc. ICAC, 2006. Google ScholarDigital Library
- DAS3: The Distributed ASCI Supercomputer 3. http://www.cs.vu.nl/das3/.Google Scholar
- R. P. Doyle, J. S. Chase, O. M. Asad, W. Jin, and A. M. Vahdat. Model-based resource provisioning in a Web service utility. In Proc. USITS, 2003. Google ScholarDigital Library
- J. Gray and W. Vogels. A conversation with Werner Vogels. ACM Queue, 4(4), 2006. Google ScholarDigital Library
- N. J. Gunther. Analyzing Computer System Performance with Perl::PDQ. Springer, 2005. Google ScholarDigital Library
- A. Kamra, V. Misra, and E. M. Nahum. Yaksha: a self-tuning controller for managing the performance of 3-tiered Web sites. In Proc. Intl. Workshop on Quality of Service, 2004.Google ScholarCross Ref
- D. Karger, E. Lehman, T. Leighton, M. Levine, D. Lewin, and R. Panigrahy. Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web. In Proc. ACM Symposium on Theory of Computing, 1997. Google ScholarDigital Library
- H. A. Lagar-Cavilla, J. A. Whitney, A. Scannell, P. Patchin, S. M. Rumble, E. de Lara, M. Brudno, and M. Satyanarayanan. SnowFlock: rapid virtual machine cloning for cloud computing. In Proc. EuroSys, 2009. Google ScholarDigital Library
- S. Ranjan, J. Rolia, H. Fu, and E. Knightly. QoS-driven server migration for Internet data centers. In Proc. Intl. Workshop on Quality of Service, 2002.Google ScholarCross Ref
- R. Shoup. eBay's architectural principles. http://jaoo.dk/london-2008/file?path=/qcon-london-2008/slides/RandyShoup_eBaysArchitecturalPrinciples.pdf.Google Scholar
- S. Sivasubramanian. Scalable hosting of web applications. PhD thesis, VU University Amsterdam, Netherlands, 2007.Google Scholar
- K. S. Trivedi. Probability and Statistics with Reliability, Queuing and Computer Science Applications. John Wiley and Sons, 2001. Google ScholarDigital Library
- B. Urgaonkar, G. Pacifici, P. Shenoy, M. Spreitzer, and A. Tantawi. An analytical model for multi-tier Internet services and its applications. In Proc. SIGMETRICS, 2005. Google ScholarDigital Library
- B. Urgaonkar, P. Shenoy, A. Chandra, P. Goyal, and T. Wood. Agile dynamic provisioning of multi-tier Internet applications. ACM Transactions on Autonomous Adaptive Systems, 3(1), 2008. Google ScholarDigital Library
- T. Vercauteren, P. Aggarwal, X. Wang, and T.-H. Li. Hierarchical forecasting of Web server workload using sequential Monte Carlo training. In Proc. Conf. on Information Sciences and Systems, 2006.Google ScholarCross Ref
- D. Villela, P. Pradhan, and D. Rubenstein. Provisioning servers in the application tier for e-commerce systems. ACM Transactions on Internet Technology, 7(1), 2007. Google ScholarDigital Library
- B. Y. Wu, C. H. Chi, and Z. Chen. Resource allocation based on workflow for enhancing the performance of composite service. In Proc. SCC, 2007.Google ScholarCross Ref
Index Terms
- Autonomous resource provisioning for multi-service web applications
Recommendations
Edge Provisioning and Fairness in VPN-DiffServ Networks
Customers of Virtual Private Networks (VPNs) over Differentiated Services (DiffServ) infrastructure are most likely to demand not only security but also guaranteed Quality-of-Service (QoS) in pursuance of their desire to have leased-line-like services. ...
Proactive resource provisioning
Allocating resources in networks to QoS flows may require undesirable delays or costs. We consider a dynamic Service Level Agreement (SLA) negotiation scheme between peer autonomous systems (ASes) that implement DiffServ per domain behaviors. For ...
Resource provisioning in SLA-based cluster computing
JSSPP'10: Proceedings of the 15th international conference on Job scheduling strategies for parallel processingCluster computing is excellent for parallel computation. It has become increasingly popular. In cluster computing, a service level agreement (SLA) is a set of quality of services (QoS) and a fee agreed between a customer and an application service ...
Comments