skip to main content
10.1145/1810479.1810510acmconferencesArticle/Chapter ViewAbstractPublication PagesspaaConference Proceedingsconference-collections
research-article

Towards optimizing energy costs of algorithms for shared memory architectures

Published:13 June 2010Publication History

ABSTRACT

Energy consumption by computer systems has emerged as an important concern. However, the energy consumed in executing an algorithm cannot be inferred from its performance alone: it must be modeled explicitly. This paper analyzes energy consumption of parallel algorithms executed on shared memory multicore processors. Specifically, we develop a methodology to evaluate how energy consumption of a given parallel algorithm changes as the number of cores and their frequency is varied. We use this analysis to establish the optimal number of cores to minimize the energy consumed by the execution of a parallel algorithm for a specific problem size while satisfying a given performance requirement. We study the sensitivity of our analysis to changes in parameters such as the ratio of the power consumed by a computation step versus the power consumed in accessing memory. The results show that the relation between the problem size and the optimal number of cores is relatively unaffected for a wide range of these parameters.

References

  1. http://www.greenpeace.org/raw/content/ international/press/reports/make-it-green-cloud-computing.pdf. Greenpeace International, 2010.Google ScholarGoogle Scholar
  2. A. Aggarwal and S. Vitter, Jeffrey. The input/output complexity of sorting and related problems. Communications of ACM, 31(9):1116--1127, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. L. Arge, M. T. Goodrich, M. J. Nelson, and N. Sitchinava. Fundamental parallel algorithms for private-cache chip multiprocessors. In SPAA, pages 197--206, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. A. Bender, J. T. Fineman, S. Gilbert, and B. C. Kuszmaul. Concurrent cache-oblivious b-trees. In SPAA, pages 228--237, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. B. D. Bingham and M. R. Greenstreet. Computation with energy-time trade-offs: Models, algorithms and lower-bounds. In ISPA, pages 143--152, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. G. E. Blelloch, R. A. Chowdhury, P. B. Gibbons, V. Ramachandran, S. Chen, and M. Kozuch. Provably good multicore cache performance for divide-and-conquer algorithms. In SODA, pages 501--510, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Chandrakasan, S. Sheng, and R. Brodersen. Low-power cmos digital design. IEEE Journal of Solid-State Circuits, 27(4):473--484, 1992.Google ScholarGoogle ScholarCross RefCross Ref
  8. S. Cho and R. Melhem. Corollaries to Amdahl's Law for Energy. Computer Architecture Letters, 7(1):25--28, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Curtis-Maury, A. Shah, F. Blagojevic, D. Nikolopoulos, B. de Supinski, and M. Schulz. Prediction Models for Multi-dimensional Power-Performance Optimization on Many Cores. In International Conference on Parallel architectures and compilation techniques, pages 250--259, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. R. Ge, X. Feng, and K. Cameron. Performance-Constrained Distributed DVS Scheduling for Scientific Applications on Power-Aware Clusters. In International Conference on Supercomputing. IEEE Computer Society Washington, DC, USA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Gonzalez and M. A. Horowitz. Energy dissipation in general purpose microprocessors. IEEE Journal of Solid-State Circuits, 31:1277--1284, 1995.Google ScholarGoogle ScholarCross RefCross Ref
  12. M. T. Goodrich. Communication-efficient parallel sorting. SIAM Journal of Computing, 29(2):416--432, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. C. Isci, A. Buyuktosunoglu, C. Cher, P. Bose, and M. Martonosi. An Analysis of Efficient Multi-Core Global Power Management Policies: Maximizing Performance for a Given Power Budget. In International Symposium on Microarchitecture, volume 9, pages 347--358, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. M. Karp and V. Ramachandran. Parallel algorithms for shared-memory machines. Handbook of Theoretical Computer Science (vol. A): Algorithms and Complexity, pages 869--941, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. V. A. Korthikanti and G. Agha. Analysis of parallel algorithms for energy conservation in scalable multicore architectures. In ICPP, pages 212--219, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. V. Kumar, A. Grama, A. Gupta, and G. Karypis. Introduction to Parallel Computing: Design and Analysis of Algorithms. Benjamin-Cummings Publishing Co., Inc. Redwood City, CA, USA, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. E. Ladner and M. J. Fischer. Parallel prefix computation. ACM Journal, 27(4):831--838, 1980. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. Li and J. Martinez. Power-Performance Considerations of Parallel Computing on Chip Multiprocessors. ACM Transactions on Architecture and Code Optimization, 2(4):1--25, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Li and J. Martinez. Dynamic Power-Performance Adaptation of Parallel Computation on Chip Multiprocessors. In International Symposium on High-Performance Computer Architecture, pages 77--87, 2006.Google ScholarGoogle Scholar
  20. A. J. Martin. Towards an energy complexity of computation. Information Processing Letters, 39:181--187, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. M. P. Mills. The internet begins with coal. Green Earth Society, USA, 1999.Google ScholarGoogle Scholar
  22. R. Murphy. On the effects of memory latency and bandwidth on supercomputer application performance. IEEE International Symposium on Workload Characterization, pages 35--43, Sept. 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. S. Park, W. Jiang, Y. Zhou, and S. Adve. Managing Energy-Performance Tradeoffs for Multithreaded Applications on Multiprocessor Architectures. In ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, pages 169--180. ACM New York, NY, USA, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. X. Wang and S. Ziavras. Performance-Energy Tradeoffs for Matrix Multiplication on FPGA-Based Mixed-Mode Chip Multiprocessors. In International Symposium on Quality Electronic Design, pages 386--391, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Towards optimizing energy costs of algorithms for shared memory architectures

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SPAA '10: Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
        June 2010
        378 pages
        ISBN:9781450300797
        DOI:10.1145/1810479

        Copyright © 2010 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 13 June 2010

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate447of1,461submissions,31%

        Upcoming Conference

        SPAA '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader