skip to main content
article

Mercury and freon: temperature emulation and management for server systems

Authors Info & Claims
Published:20 October 2006Publication History
Skip Abstract Section

Abstract

Power densities have been increasing rapidly at all levels of server systems. To counter the high temperatures resulting from these densities, systems researchers have recently started work on softwarebased thermal management. Unfortunately, research in this new area has been hindered by the limitations imposed by simulators and real measurements. In this paper, we introduce Mercury, a software suite that avoids these limitations by accurately emulating temperatures based on simple layout, hardware, and componentutilization data. Most importantly, Mercury runs the entire software stack natively, enables repeatable experiments, and allows the study of thermal emergencies without harming hardware reliability. We validate Mercury using real measurements and a widely used commercial simulator. We use Mercury to develop Freon, a system that manages thermal emergencies in a server cluster without unnecessary performance degradation. Mercury will soon become available from http://www.darklab.rutgers.edu.

References

  1. D. Anderson, J. Dykes, and E. Riedel. More than an Interface - SCSI vs. ATA. In Proceedings of FAST, March 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. F. Bellosa, S. Kellner, M.Waitz, and A.Weissel. Event-Driven Energy Accounting of Dynamic Thermal Management. In Proceedings of COLP, September 2003.]]Google ScholarGoogle Scholar
  3. D. Brooks and M. Martonosi. Dynamic Thermal Management for High-Performance Microprocessors. In Proceedings of HPCA, January 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Chase, D. Anderson, P. Thackar, A. Vahdat, and R. Boyle. Managing Energy and Server Resources in Hosting Centers. In Proceedings of SOSP, October 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Y. Chen, A. Das, W. Qin, A. Sivasubramaniam, Q. Wang, and N. Gautam. Managing Server Energy and Operational Costs in Hosting Centers. In Proceedings of Sigmetrics, June 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. E.N. Elnozahy, M. Kistler, and R. Rajamony. Energy-Efficient Server Clusters. In Proceedings of PACS, February 2002.]]Google ScholarGoogle Scholar
  7. Ericsson Microelectronics. Reliability Aspects on Power Supplies. Technical Report Design Note 002, April 2000.]]Google ScholarGoogle Scholar
  8. W. Felter, K. Rajamani, T. Keller, and C. Rusu. A Performance-Conserving Approach for Reducing Peak Power Consumption in Server Systems. In Proceedings of ICS, June 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Fluent. Fluent: The Right Answer in CFD. http://www.fluent.com/-index.htm.]]Google ScholarGoogle Scholar
  10. E. Gansner, E. Koutsofois, and S. North. Drawing Graphs with Dot. http://www.graphviz.org/Documentation/dotguide.pdf.]]Google ScholarGoogle Scholar
  11. S. Gurumurthi, A. Sivasubramaniam, M. Kandemir, and H. Franke. DRPM: Dynamic Speed Control for Power Management in Server Class Disks. In Proceedings of ISCA, June 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. S. Gurumurthi, A. Sivasubramaniam, and V. Natarajan. Disk Drive Roadmap from the Thermal Perspective: A Case for Dynamic Thermal Management. In Proceedings of ISCA, June 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. T. Heath, A.P. Centeno, P. George, L. Ramos, Y. Jaluria, and R. Bianchini. Mercury and Freon: Temperature Emulation and Management for Server Systems. Technical Report DCS-TR-596, Department of Computer Science, Rutgers University, January 2006, Revised July 2006.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. T. Heath, B. Diniz, E.V. Carrera, W. Meira Jr., and R. Bianchini. Energy Conservation in Heterogeneous Server Clusters. In Proceedings of PPoPP, June 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. T. Icoz and Y. Jaluria. Numerical Simulation of Boundary Conditions and the Onset of Instability in Natural Convection Due to Protruding Thermal Sources in an Open Rectangular Channel. Numerical Heat Transfer, 48:831--847, 2005.]]Google ScholarGoogle ScholarCross RefCross Ref
  16. Y. Kim, S. Gurumurthi, and A. Sivasubramaniam. Understanding the Performance-Temperature Interactions in Disk I/O of Server Workloads. In Proceedings of HPCA, February 2006.]]Google ScholarGoogle Scholar
  17. Y. Li, D. Brooks, Z. Hu, and K. Skadron. Performance, Energy, and Thermal Considerations for SMT and CMP Architectures. In Proceedings of HPCA, February 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Merkel and F. Bellosa. Balancing Power Consumption in Multiprocessor Systems. In Proceedings of Eurosys, April 2006.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Moore, J. Chase, and P. Ranganathan. ConSil: Low-Cost Thermal Mapping of Data Centers. In Proceedings of SysML, June 2006.]]Google ScholarGoogle Scholar
  20. J. Moore, J. Chase, and P. Ranganathan. Weatherman: Automated, Online, and Predictive Thermal Mapping and Management for Data Centers. In Proceedings of ICAC, June 2006.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Moore, J. Chase, P. Ranganathan, and R. Sharma. Making Scheduling Cool: Temperature-Aware Resource Assignment in Data Centers. In Proceedings of USENIX, April 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. E. Pinheiro, R. Bianchini, E. Carrera, and T. Heath. Dynamic Cluster Reconfiguration for Power and Performance. In L. Benini, M. Kandemir, and J. Ramanujam, editors, Compilers and Operating Systems for Low Power. Kluwer Academic Publishers, August 2003. Earlier version published in Proceedings of COLP 2001, September 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. M. Powell, M. Gomaa, and T.N. Vijaykumar. Heat-and-Run: Leveraging SMT and CMP to Manage Power Density Through the Operating System. In Proceedings of ASPLOS, October 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. F.K. Price. Heat Transfer and Fluid Flow Data Books. Genium Publishing Corp., 1990.]]Google ScholarGoogle Scholar
  25. K. Rajamani and C. Lefurgy. On Evaluating Request-Distribution Schemes for Saving Energy in Server Clusters. In Proceedings of ISPASS, March 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. E. Rohou and M.D. Smith. Dynamically Managing Processor Temperature and Power. In Proceedings of FDO, November 1999.]]Google ScholarGoogle Scholar
  27. C. Rusu, A. Ferreira, C. Scordino, A. Watson, R. Melhem, and D. Mosse. Energy-Efficient Real-Time Heterogeneous Server Clusters. In Proceedings of RTAS, April 2006.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. L. Shang, L.-S. Peh, A. Kumar, and N. Jha. Characterization and Management of On-Chip Networks. In Proceedings of Micro, December 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. R. Sharma, C. Bash, C. Patel, R. Friedrich, and J. Chase. Balance of Power: Dynamic Thermal Management for Internet Data Centers. IEEE Internet Computing, 9(1), January 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. K. Skadron, M. Stan, W. Huang, S. Velusamy, K. Sankaranarayanan, and D. Tarjan. Temperature-Aware Microarchitecture. In Proceedings of ISCA, June 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. J. Srinivasan, S. Adve, P. Bose, and J. Rivers. The Case for Lifetime Reliability-Aware Microprocessors. In Proceedings of ISCA, June 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. A. Weissel and F. Bellosa. Dynamic Thermal Management for Distributed Systems. In Proceedings of TACS, June 2004.]]Google ScholarGoogle Scholar
  33. W. Zhang. Linux Virtual Server for Scalable Network Services. In Proceedings of the Linux Symposium, July 2000.]]Google ScholarGoogle Scholar

Index Terms

  1. Mercury and freon: temperature emulation and management for server systems

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM SIGOPS Operating Systems Review
      ACM SIGOPS Operating Systems Review  Volume 40, Issue 5
      Proceedings of the 2006 ASPLOS Conference
      December 2006
      425 pages
      ISSN:0163-5980
      DOI:10.1145/1168917
      Issue’s Table of Contents
      • cover image ACM Conferences
        ASPLOS XII: Proceedings of the 12th international conference on Architectural support for programming languages and operating systems
        October 2006
        440 pages
        ISBN:1595934510
        DOI:10.1145/1168857

      Copyright © 2006 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 20 October 2006

      Check for updates

      Qualifiers

      • article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader