Abstract
Internet hosting centers serve multiple service sites from a common hardware base. This paper presents the design and implementation of an architecture for resource management in a hosting center operating system, with an emphasis on energy as a driving resource management issue for large server clusters. The goals are to provision server resources for co-hosted services in a way that automatically adapts to offered load, improve the energy efficiency of server clusters by dynamically resizing the active server set, and respond to power supply disruptions or thermal events by degrading service in accordance with negotiated Service Level Agreements (SLAs).Our system is based on an economic approach to managing shared server resources, in which services "bid" for resources as a function of delivered performance. The system continuously monitors load and plans resource allotments by estimating the value of their effects on service performance. A greedy resource allocation algorithm adjusts resource prices to balance supply and demand, allocating resources to their most efficient use. A reconfigurable server switching infrastructure directs request traffic to the servers assigned to each service. Experimental results from a prototype confirm that the system adapts to offered load and resource availability, and can reduce server energy usage by 29% or more for a typical Web workload.
Supplemental Material
Available for Download
Software for Managing energy and server resources in hosting centers
- 1 Tarek F. Abdelzaher and Chenyang Lu. Modeling and Performance Control of Interact Servers. In 39th IEEE Conference on Decision and Control, December 2000.]]Google Scholar
- 2 Tarek F. Abdelzaher, Kang G. Shin, and Nina Bhatti. Performance Guarantees for Web Server End-Systems: A Control-Theoreticni Approach. IEEE Transactions on Parallel and Distributed Systems, June 2001.]] Google ScholarDigital Library
- 3 Jeffrey Aman, Catherine K. Eilert, David Emmes, Peter Yocom, and Donna Dillenberger. Adaptive AIgorithrus for Managing a Distrubuted Data Processing Workload. IBM Systems Journal, 36(2), 1997.]] Google ScholarDigital Library
- 4 Darrell C. Anderson, Jeffrey S. Chase, and Amin M. Vahdat. Interposed Request Routing for Scalable Network Storage. In Proceedings of the Fourth Symposium on Operating System Design and Implementation ( OSD1), October 2000.]] Google ScholarDigital Library
- 5 Karen Appleby, Sameh Fakhouri, Liana Fong, German Goldszmidt, Michael Kalantar, Srirama Krishnakumar, Donald Pazel, John Pershing, and Benny Rochwerger. Oceano - SLA Based Management of a Computing Utility. In Proceedings of the 7th IFIP/IEEE International Symposium on Integrated Network Management, May 2001.]]Google Scholar
- 6 Martin Arlitt and Tal Jin. Workload Characterization of the 1998 World Cup Web Site. Technical Report HPL-1999-35R1, HI" Laboratories, September 1999. The trace is available from the Internet Traffic Archive at ita.ee.lbl.gov.]]Google Scholar
- 7 Armando Fox and Steven D. Gribble and Yatin Chawathe and Eric A. Brewer and Paul Ganthier. Cluster-based scalable network services. In Proceedings of the Sixteenth ACM Symposium on Operating System Principles (SOSP), pages 78-91, October 1997.]] Google ScholarDigital Library
- 8 Mohit Aron. Differentiated and Predictable Quality of Service in Web Server Systems. Phl) thesis, Department of Computer Science, Rice University, October 2000.]] Google ScholarDigital Library
- 9 Mohit Aron, Peter Druschel, and Willy Zwaenepoel. Cluster Reserves: A Mechanism for Resource Management in Cluster-based Network Servers. In Proceedings of the Joint International Conference on Measurement and Modeling of Computer Systems (ACM SIGMETRICS 2000), pages 90-101, June 2000.]] Google ScholarDigital Library
- 10 Gaurav Banga, Peter Druschel, and Jeffrey C. Mogul. Resource Containers: A New Facility for Resource Management in Server Systems. In Proceedings of the Third Symposium on Operating Systems Design and Implementation ( OSDI), February 1999.]] Google ScholarDigital Library
- 11 Paul Bafford and Mark E. Crovella. Generating Representative Web Workloads for Network and Server Performance Evaluation. In Proceedings of the ACM Conference on Measurement and Modeling of Computer Systems ( SIGMETRICS "98), pages 151-160, June 1998.]] Google ScholarDigital Library
- 12 Pat Bohrer, Elmootazbellah N. Elnozahy, Tom Keller, Michael Kistler, Charles Lefurgy, and Ray Rajamony. The Case for Power Management in Web Servers. In Power-Aware Computing (Robert Graybill and Rami Melhem, editors). Kluwer/Plenum series in Computer Science, to appear, January 2002.]] Google ScholarDigital Library
- 13 David Brooks and Margaret Martonosi. Dynamic Thermal Management for High-Performance Microprocessors. In Proceedings of the Seventh International Symposium on High-Performance Computer Architecture (HPCA-7), January 2001.]] Google ScholarDigital Library
- 14 Jeffrey S. Chase, Henry M. Levy, Michael J. Feeley, and Edward D. Lazowska. Sharing and Protection in a Single Address Space Operating System. ACM Transactions on Computer Systems, 12(4), November 1994.]] Google ScholarDigital Library
- 15 Fred Douglis, P. Krishnan, and Brian Bershad. Adaptive Disk Spin-down Policies for Mobile Computers. In 2nd USENIX Symposium on Mobile and Location-Independent Computing, April 1995. Monterey CA.]] Google ScholarDigital Library
- 16 Donald E Ferguson, Christos Nikolaou, Jakka Sairamesh, and Yechiam Yemidi. Economic Models for Allocating Resources in Computer Systems. In Market-Based Control: A Paradigm for Distributed Resource Allocation (Scott H. Clearwater, editor). World Scientific, 1996.]] Google ScholarDigital Library
- 17 Jason Flinn and Mahadev Satyanarayanan. Energy-aware Adaptation for Mobile Applications. In Proceedings of the Seventeenth Symposium on Operating Systems Principles (SOSP), pages 48-63, December 1999.]] Google ScholarDigital Library
- 18 Syam Gadde. The Proxycizer Web Proxy Tool Suite. http : //www. cs. duke. edu/ari/Proxycizer/.]]Google Scholar
- 19 Toshihide Ibaraki and Naoki Katoh, editors. Resource Allocation Problems: Algorithmic Approaches. MIT Press, Cambridge, MA, 1988.]] Google ScholarDigital Library
- 20 Anm Iyengar, Jim Challenger, Daniel Dias, and Paul Dantzig. High-performance Web Site Design Techniques. IEEE Internet Computing, 4(2):17-26, March 2000.]] Google ScholarDigital Library
- 21 Van Jacobsen. Congestion Avoidance and Control. ACM Computer Communication Review: Proceedings of the SIGCOMM Symposium, 18(4):314-329, August 1988.]] Google ScholarDigital Library
- 22 Sugib Jamin, Peter B. Danzig, Scott J. Shanker, and Lixia Zhang. A Measurement-based Admission Control Algorithm for Integrated Services Packet Networks. IEEE,/ACM Transactions on Networking, 5(1):56-70, February 1997.]] Google ScholarDigital Library
- 23 Sugih Jamin, Scott J. Shenker, and Peter B. Danzig. Comparison of Measurement-based Admission Control Algorithms for Controlled-Load Service. In Proceedings of lEEE lnfocom 1997, April 1997.]] Google ScholarDigital Library
- 24 Michael B. Jones, Daniela Rosu, and Marcel-Catalin Rosu. CPU Reservations and Time Constraints: Efficient, Predictable Scheduling of Independent Activities. In Proceedings of the 16th ACM Symposium on Operating Systems Principles, pages 198-211, October 1997.]] Google ScholarDigital Library
- 25 Minkyong Kim and Brian Noble. Mobile Network Estimation. In Proceedings of the Seventh Annual Conference on Mobile Computing and Networking, July 2001.]] Google ScholarDigital Library
- 26 Alvin R. Lebeck, Xiaobo Fan, Heng Zeng, and Carla S. Ellis. Power-Aware Page Allocation. In Proceedings of the Ninth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS IX), 2000.]] Google ScholarDigital Library
- 27 Kelvin Li and Sugih Jamin. A Meaanrement-Based Admission-Controlled Web Server. In Proceedings oflEEE lnfocom 2000, March 2000.]]Google Scholar
- 28 Lee W. McKnight and Joseph P. Bailey, editors, lnternet Economics. MIT Press, Cambridge, MA, 1997.]] Google ScholarDigital Library
- 29 Jennifer D. Mitebell-Jacksun. Energy Needs in an Interuet Economy: A Closer Look at Data Centers. Master's thesis, Energy and Resources Group, University of California at Berkeley, July 2001.]]Google Scholar
- 30 Roll Neugebaner and Derek McAuley. Energy is Just Another Resource: Energy Accounting and Energy Pricing in the Nemesis OS. In Proceedings of the Eighth IEEE Workshop on Hot Topics in Operating Systems HerOS-VIII, pages 59--64, May 2001.]] Google ScholarDigital Library
- 31 Brian D. Noble, M. Satyanarayanan, Dushyanth Narayanan, James Eric Tiltun, Jason Flinn, and Kevin R. Walker. Agile Application-Aware Adaptation for Mobility. In Proceedings of the Sixteenth ACM Symposium on Operating Systems Principles ( SOSP ), pages 276--287, Saint Male, France, October 1997.]] Google ScholarDigital Library
- 32 Vivek S. Pal, Mohit Area, Gaurav Banga, Michael Svendsen, Peter Drnschel, Willy Zwaenopoel, and Erich Nahum. Locality-Aware Request Distribution in Clnster-based Network Servers. In Proceedings of the Eighth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS VIII), October 1998.]] Google ScholarDigital Library
- 33 Eduardo Pinheiro, Ricardo Bianchini, Eurique V. Carrera, and Taliver Heath. Load Balancing and Unbalancing for Power and Performance in Cluster-Based Systems. Technical Report DCS-TR-440, Department of Computer Science, Rutgers University, May 2001.]]Google Scholar
- 34 John Reumann, Ashish Mehra, Kang G. Shin, and Dilip Kandlur. Virtual Services: A New Abstraction for Server Consolidation. In Proceedings of the USENIX 2000 Technical Conference, June 2000.]] Google ScholarDigital Library
- 35 Timothy Roscoe and Prashant Shenoy. New Resource Control Issues in Shared Clusters. In Proceedings of the Eight International Workshop on Interactive Distributed Multimedia Systems ( IDMS'01), September 2001.]] Google ScholarDigital Library
- 36 Yasnshi Saito, Brian N. Bershad, and Henry M. Levy. Manageability, Availability and Performance in Porcupine: A Highly Scalable Cluster-Based Mail Service. In Proceedings of the 17th ACM Symposium on Operating Systems Principles (SOSP), pages 1-15, Kiawah Island, December 1999.]] Google ScholarDigital Library
- 37 Barry C. Smith, John F. Leimkuhler, and Ross M. Darrow. Yield Management at American Airlines. Interfaces, 22(1), January 1992.]]Google Scholar
- 38 David C. Steere, Ashvin Goel, Joshua Gmenberg, Dylan McNamee, Calton Pu, and Jonathan Walpole. A Feedback-driven Proportion Allocator for Real-Rate Scheduling. In Proceedings of the Third Symposium on Operating Systems Design and Implementation (OSDI), pages 145-158, February 1999.]] Google ScholarDigital Library
- 39 Michael Stonebraker, Paul M. Aoki, Witold Litwin, Avi Pfeffer, Adam Sah, Jeff Sidell, Carl Staelin, and Andrew Yu. Mariposa: A Wide-Area Distributed Database System. VLDB Journal: Very Large Data Bases, 5(1):48--63, 1996.]] Google ScholarDigital Library
- 40 David G. Sullivan and Margo I. Seltzer. Isolation with Flexibility: A Resource Management Framework for Central Servers. In Proceedings of the 2000 USENIX Annual Technical Conference, pages 337-350, June 2000.]] Google ScholarDigital Library
- 41 Kevin Thompson, Gregory J. Miller, and Rick Wilder. Wide-Area lnternet Traffic Patterns and Characteristics. In IEEENetwork, November 1997.]]Google Scholar
- 42 Amin Vahdat, Thomas Anderson, Michael Dahlia, Eshwar Belani, David Culler, Paul Eastham, and Chad Yoshikawa. WebOS: Operating System Services for Wide-Area Applications. In Proceedings of the Seventh IEEE Symposium on High Performance Distributed Computing (HPDC), Chicago, Illinois, July 1998.]] Google ScholarDigital Library
- 43 Amin Vahdat, Alvin R. Lebeek, and Carla S. Ellis. Every Joule is Precious: The Case for Revisiting Operating System Design for Energy Efficiency. In Proceedings of the 9th ACM SIGOPS European Workshop, September 2000.]] Google ScholarDigital Library
- 44 Ben Verghese, Anoop Gupta, and Mendel Rosenblum. Performance Isolation: Sharing and Isolation in Shared Memory Multiprocessors. In Proceedings of the 8th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), October 1998.]] Google ScholarDigital Library
- 45 Carl A. Waldspurger, Tad Hogg, Bernardo A. Huberman, Jeffrey O. Kephart, and W. Scott Stornetta. Spawn: A Distributed Computational Economy. IEEE Transactions on Software Engineering, 18(2): 103-117, February 1992.]] Google ScholarDigital Library
- 46 Carl A. Waldspurger and William E. Weihl. Lottery Scheduling: Flexible Proportional-Share Resource Management. In Proceedings of the First Symposium on Operating Systems Design and Implementation (OSDI), pages 1-11, November 1994.]] Google ScholarDigital Library
- 47 Huiean Zhu, Hung Tang, and Tao Yang. Demand-driven Service Differentiation in Cluster-Based Network Servers. In Proceedings of IEEE Infocom 2001, April 2001.]]Google Scholar
Index Terms
- Managing energy and server resources in hosting centers
Recommendations
Managing server energy and operational costs in hosting centers
SIGMETRICS '05: Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systemsThe growing cost of tuning and managing computer systems is leading to out-sourcing of commercial services to hosting centers. These centers provision thousands of dense servers within a relatively small real-estate in order to host the applications/...
Managing energy and server resources in hosting centers
SOSP '01: Proceedings of the eighteenth ACM symposium on Operating systems principlesInternet hosting centers serve multiple service sites from a common hardware base. This paper presents the design and implementation of an architecture for resource management in a hosting center operating system, with an emphasis on energy as a driving ...
On Honey Bees and Dynamic Server Allocation in Internet Hosting Centers
Internet centers host services for e-banks, e-auctions and other clients. Hosting centers then must allocate servers among clients to maximize revenue. The limited number of servers, costs of reallocating servers, and unpredictability of requests make ...
Comments