Skip to main content
Top
Published in: SICS Software-Intensive Cyber-Physical Systems 3-4/2014

01-08-2014 | Special Issue Paper

Automatic detection of power bottlenecks in parallel scientific applications

Authors: María Barreda, Sandra Catalán, Manuel F. Dolz, Rafael Mayo, Enrique S. Quintana-Ortí

Published in: SICS Software-Intensive Cyber-Physical Systems | Issue 3-4/2014

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper we present an extension of the pmlib framework for power-performance analysis that permits a rapid and automatic detection of power sinks during the execution of concurrent scientific workloads. The extension is shaped in the form of a multithreaded Python module that offers high reliability and flexibility, rendering an overall inspection process that introduces low overhead. Additionally, we investigate the advantages and drawbacks of the RAPL power model, introduced in the Intel Xeon “Sandy-Bridge” CPU, versus a data acquisition system from National Instruments.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Computer Science - Research and Development

Computer Science – Research and Development (CSRD), formerly Informatik – Forschung und Entwicklung (IFE), is a quarterly international journal that publishes high-quality research and survey papers from the Software Engineering & Systems area.

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Show more products
Literature
1.
2.
go back to reference Aliaga JI, Bollhöfer M, Martín AF, Quintana-Ortí ES (2011) Exploiting thread-level parallelism in the iterative solution of sparse linear systems. Parallel Comput 37(3):183–202 CrossRefMATHMathSciNet Aliaga JI, Bollhöfer M, Martín AF, Quintana-Ortí ES (2011) Exploiting thread-level parallelism in the iterative solution of sparse linear systems. Parallel Comput 37(3):183–202 CrossRefMATHMathSciNet
3.
go back to reference Aliaga JI, Dolz MF, Martín AF, Mayo R, Quintana-Ortí ES (2012) Leveraging task-parallelism in energy-efficient ILU preconditioners. In: 2nd int con on ICT as key technology against global warming—ICT-GLOW. Lecture notes in computer science, vol 7453, pp 55–63 CrossRef Aliaga JI, Dolz MF, Martín AF, Mayo R, Quintana-Ortí ES (2012) Leveraging task-parallelism in energy-efficient ILU preconditioners. In: 2nd int con on ICT as key technology against global warming—ICT-GLOW. Lecture notes in computer science, vol 7453, pp 55–63 CrossRef
4.
go back to reference Alonso P, Badia RM, Labarta J, Barreda M, Dolz MF, Mayo R, Quintana-Ortí ES, Reyes R (2012) Tools for power-energy modelling and analysis of parallel scientific applications. In: 41st int conf on parallel processing—ICPP, pp 420–429 Alonso P, Badia RM, Labarta J, Barreda M, Dolz MF, Mayo R, Quintana-Ortí ES, Reyes R (2012) Tools for power-energy modelling and analysis of parallel scientific applications. In: 41st int conf on parallel processing—ICPP, pp 420–429
5.
go back to reference Alonso P, Dolz MF, Igual FD, Mayo R, Quintana-Ortí ES (2012) Reducing energy consumption of dense linear algebra operations on hybrid CPU-GPU platforms. In: Proc 10th IEEE int symp on parallel and distributed processing with applications—ISPA 2012, pp 56–62 CrossRef Alonso P, Dolz MF, Igual FD, Mayo R, Quintana-Ortí ES (2012) Reducing energy consumption of dense linear algebra operations on hybrid CPU-GPU platforms. In: Proc 10th IEEE int symp on parallel and distributed processing with applications—ISPA 2012, pp 56–62 CrossRef
6.
go back to reference Alonso P, Dolz MF, Igual FD, Quintana-Ortí ES, Mayo R (2013) Runtime scheduling of the LU factorization: performance and energy. In: Proc energy efficiency in large scale distributed systems conference—EE-LSDS 2013 (to appear) Alonso P, Dolz MF, Igual FD, Quintana-Ortí ES, Mayo R (2013) Runtime scheduling of the LU factorization: performance and energy. In: Proc energy efficiency in large scale distributed systems conference—EE-LSDS 2013 (to appear)
8.
go back to reference Barreda M, Barrachina S, Catalán S, Dolz MF, Fabregat G, Mayo R, Quintana ES (2013) A framework for power-performance analysis of parallel scientific applications. In: Third int conference on smart grids, green communications and IT energy-aware technologies—Energy 2013, pp 114–119 Barreda M, Barrachina S, Catalán S, Dolz MF, Fabregat G, Mayo R, Quintana ES (2013) A framework for power-performance analysis of parallel scientific applications. In: Third int conference on smart grids, green communications and IT energy-aware technologies—Energy 2013, pp 114–119
10.
go back to reference Castillo M, Fernández JC, Mayo R, Quintana-Ortí ES, Roca V (2012) Analysis of strategies to save energy for message-passing dense linear algebra kernels. In: Proc 20th euromicro conference on parallel, distributed and network based processing, pp 346–352 Castillo M, Fernández JC, Mayo R, Quintana-Ortí ES, Roca V (2012) Analysis of strategies to save energy for message-passing dense linear algebra kernels. In: Proc 20th euromicro conference on parallel, distributed and network based processing, pp 346–352
11.
go back to reference Dongarra J et al. (2011) The international Exascale software project roadmap. Int J High Perform Comput Appl 25(1):3–60 CrossRef Dongarra J et al. (2011) The international Exascale software project roadmap. Int J High Perform Comput Appl 25(1):3–60 CrossRef
13.
go back to reference El Mehdi Diouri M, Dolz MF, Glück O, Lefèvre L, Alonso P, Catalán S, Mayo R, Quintana-Ortí ES (2013) Solving some mysteries in power monitoring of servers: take care of your wattmeters! In: Proc energy efficiency in large scale distributed systems conference—EE-LSDS 2013 (to appear) El Mehdi Diouri M, Dolz MF, Glück O, Lefèvre L, Alonso P, Catalán S, Mayo R, Quintana-Ortí ES (2013) Solving some mysteries in power monitoring of servers: take care of your wattmeters! In: Proc energy efficiency in large scale distributed systems conference—EE-LSDS 2013 (to appear)
14.
go back to reference HP Corp, Intel Corp, Microsoft Corp, Phoenix Tech Ltd, Toshiba Corp (2011) Advanced configuration and power interface specification, revision 5.0 HP Corp, Intel Corp, Microsoft Corp, Phoenix Tech Ltd, Toshiba Corp (2011) Advanced configuration and power interface specification, revision 5.0
15.
go back to reference Intel Corp (2012) Intel 64 and IA-32 architectures software developer manual Intel Corp (2012) Intel 64 and IA-32 architectures software developer manual
18.
go back to reference Knüpfer A, Brunst H et al. (2008) The vampir performance analysis tool-set. In: Tools for high performance computing, pp 139–155 CrossRef Knüpfer A, Brunst H et al. (2008) The vampir performance analysis tool-set. In: Tools for high performance computing, pp 139–155 CrossRef
19.
go back to reference Kunkel J (2011) HDTrace—a tracing and simulation environment of application and system interaction. Tech Rep 2, Department of Informatics, Scientific Computing. Universität Hamburg Kunkel J (2011) HDTrace—a tracing and simulation environment of application and system interaction. Tech Rep 2, Department of Informatics, Scientific Computing. Universität Hamburg
21.
go back to reference NVIDIA Corporation (2009) NVIDIA CUDA compute unified device architecture programming guide, 2.3.1 edn. NVIDIA Corporation (2009) NVIDIA CUDA compute unified device architecture programming guide, 2.3.1 edn.
23.
go back to reference Pillet V, Labarta J, Cortes T, Girona S (1995) Paraver: a tool to visualize and analyze parallel code. In: 18th world OCCAM and transputer user group technical meeting Pillet V, Labarta J, Cortes T, Girona S (1995) Paraver: a tool to visualize and analyze parallel code. In: 18th world OCCAM and transputer user group technical meeting
24.
25.
go back to reference Quintana-Ortí G, Quintana-Ortí E, van de Geijn R, Zee FV, Chan E (2009) Programming matrix algorithms-by-blocks for thread-level parallelism. ACM Trans Math Softw 36(3):14:1–14:26 CrossRef Quintana-Ortí G, Quintana-Ortí E, van de Geijn R, Zee FV, Chan E (2009) Programming matrix algorithms-by-blocks for thread-level parallelism. ACM Trans Math Softw 36(3):14:1–14:26 CrossRef
26.
go back to reference Saxe E (2010) Power-efficient software. In: ACM queue Saxe E (2010) Power-efficient software. In: ACM queue
27.
go back to reference Servat H, Llort G Extrae user guide manual for version 2.1.1 Servat H, Llort G Extrae user guide manual for version 2.1.1
28.
go back to reference Shende SS, Malony AD (2006) The tau parallel performance system. Int J High Perform Comput Appl 20(2):287–311 CrossRef Shende SS, Malony AD (2006) The tau parallel performance system. Int J High Perform Comput Appl 20(2):287–311 CrossRef
Metadata
Title
Automatic detection of power bottlenecks in parallel scientific applications
Authors
María Barreda
Sandra Catalán
Manuel F. Dolz
Rafael Mayo
Enrique S. Quintana-Ortí
Publication date
01-08-2014
Publisher
Springer Berlin Heidelberg
Published in
SICS Software-Intensive Cyber-Physical Systems / Issue 3-4/2014
Print ISSN: 2524-8510
Electronic ISSN: 2524-8529
DOI
https://doi.org/10.1007/s00450-013-0242-8

Other articles of this Issue 3-4/2014

SICS Software-Intensive Cyber-Physical Systems 3-4/2014 Go to the issue

Premium Partner