Skip to main content
Top
Published in: Cluster Computing 4/2014

01-12-2014

Assessing the impact of the CPU power-saving modes on the task-parallel solution of sparse linear systems

Authors: José I. Aliaga, María Barreda, Manuel F. Dolz, Alberto F. Martín, Rafael Mayo, Enrique S. Quintana-Ortí

Published in: Cluster Computing | Issue 4/2014

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We investigate the benefits that an energy-aware implementation of the runtime in charge of the concurrent execution of ILUPACK—a sophisticated preconditioned iterative solver for sparse linear systems—produces on the time-power-energy balance of the application. Furthermore, to connect the experimental results with the theory, we propose several simple yet accurate power models that capture the variations of average power that result from the introduction of the energy-aware strategies as well as the impact of the P-states into ILUPACK’s runtime, at high accuracy, on two distinct platforms based on multicore technology from AMD and Intel.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
A ratio of 1 EXAFLOPS=\(10^{18}\) floating-point arithmetic operations, or flops, per second.
 
2
On a separate experiment [21], it was determined that the aggregate power supplied by the 3-Volt and 5-Volt lines during the execution of ILUPACK on these two platforms remains practically constant and, furthermore, it is negligible compared with that measured from the 12-Volt lines.
 
Literature
1.
go back to reference Duranton, M. et al.: The HiPEAC vision for advanced computing in horizon 2020. (2013). Duranton, M. et al.: The HiPEAC vision for advanced computing in horizon 2020. (2013).
2.
go back to reference Feng, W.C., Feng, X., Ge, R.: Green supercomputing comes of age. IT Professional, 10(1), 17–23 (2008). Feng, W.C., Feng, X., Ge, R.: Green supercomputing comes of age. IT Professional, 10(1), 17–23 (2008).
3.
go back to reference Dongarra, J., et al.: The international ExaScale software project roadmap. Int. J. High Perform. Comput. Appl. 25(1), 3–60 (2011). Dongarra, J., et al.: The international ExaScale software project roadmap. Int. J. High Perform. Comput. Appl. 25(1), 3–60 (2011).
6.
go back to reference Albers, Susanne: Energy-efficient algorithms. Commun. ACM 53, 86–96 (May 2010) Albers, Susanne: Energy-efficient algorithms. Commun. ACM 53, 86–96 (May 2010)
8.
go back to reference Aliaga, J.I., Bollhöfer, M., Martín, A.F., Quintana-Ortí, E.S.: Exploiting thread-level parallelism in the iterative solution of sparse linear systems. Parallel Comput. 37(3), 183–202 (2011)CrossRefMATHMathSciNet Aliaga, J.I., Bollhöfer, M., Martín, A.F., Quintana-Ortí, E.S.: Exploiting thread-level parallelism in the iterative solution of sparse linear systems. Parallel Comput. 37(3), 183–202 (2011)CrossRefMATHMathSciNet
9.
go back to reference Aliaga, J.I., Bollhöfer, M., Martín, A.F., Quintana-Ortí, E.S.: Parallelization of multilevel ILU preconditioners on distributed-memory multiprocessors. In: State of the Art in Scientific and Parallel Computing—PARA 2010. Lecture Notes in Computer Science, vol. 7133, pp. 162–172 (2012) Aliaga, J.I., Bollhöfer, M., Martín, A.F., Quintana-Ortí, E.S.: Parallelization of multilevel ILU preconditioners on distributed-memory multiprocessors. In: State of the Art in Scientific and Parallel Computing—PARA 2010. Lecture Notes in Computer Science, vol. 7133, pp. 162–172 (2012)
10.
go back to reference Aliaga, J.I., Dolz, M.F., Martín, A.F., Mayo, R. Quintana-Ortí, E.S.: Leveraging task-parallelism in energy-efficient ILU preconditioners. In: Proceedings of the Second International Conference on ICT as Key Technology Against Global Warming—ICT-GLOW. Lecture Notes in Computer Science, vol. 7453, pp. 55–63 (2012). Aliaga, J.I., Dolz, M.F., Martín, A.F., Mayo, R. Quintana-Ortí, E.S.: Leveraging task-parallelism in energy-efficient ILU preconditioners. In: Proceedings of the Second International Conference on ICT as Key Technology Against Global Warming—ICT-GLOW. Lecture Notes in Computer Science, vol. 7453, pp. 55–63 (2012).
11.
go back to reference HP Corp., Intel Corp., Microsoft Corp., Phoenix Tech. Ltd., and Toshiba Corp. Advanced configuration and power interface specification, revision 5.0 (2011). HP Corp., Intel Corp., Microsoft Corp., Phoenix Tech. Ltd., and Toshiba Corp. Advanced configuration and power interface specification, revision 5.0 (2011).
12.
go back to reference Valentini, G.L., Lassonde, W., Khan, S.U., Samee, U., Min-Allah, N., Madani, S.A., Li, J., Zhang, L., Wang, L., Ghani, N., Kolodziej, J., Li, H., Zomaya, A.Y., Xu, C., Balaji, P., Vishnu, A., Pinel, F., Pecero, J.E., Kliazovich, D., Bouvry, P.: An overview of energy efficiency techniques in cluster computing systems. Cluster Comput. 16(1), 3–15 (2013)CrossRef Valentini, G.L., Lassonde, W., Khan, S.U., Samee, U., Min-Allah, N., Madani, S.A., Li, J., Zhang, L., Wang, L., Ghani, N., Kolodziej, J., Li, H., Zomaya, A.Y., Xu, C., Balaji, P., Vishnu, A., Pinel, F., Pecero, J.E., Kliazovich, D., Bouvry, P.: An overview of energy efficiency techniques in cluster computing systems. Cluster Comput. 16(1), 3–15 (2013)CrossRef
13.
go back to reference Dolz, M.F., Fernández, J.C., Mayo, R., Quintana-Ortí, E.S.: Energy saving cluster roll: power saving system for clusters. In: Architecture of Computing Systems ARCS. Lecture Notes in Computer Science, vol. 5974, pp. 162–173 (2010). Dolz, M.F., Fernández, J.C., Mayo, R., Quintana-Ortí, E.S.: Energy saving cluster roll: power saving system for clusters. In: Architecture of Computing Systems ARCS. Lecture Notes in Computer Science, vol. 5974, pp. 162–173 (2010).
14.
go back to reference Pinheiro, E., Bianchini, R., Carrera, E.V., Heath, T.: Dynamic cluster recon guration for power and performance. In: Proceedings of the Workshop on Compilers and Operating Systems for Low Power, pp. 75–93 (2003). Pinheiro, E., Bianchini, R., Carrera, E.V., Heath, T.: Dynamic cluster recon guration for power and performance. In: Proceedings of the Workshop on Compilers and Operating Systems for Low Power, pp. 75–93 (2003).
15.
go back to reference Wang, L., Khan, S.U., Chen, D., Kolodziej, J., Ranjan, R., Xu, C., Zomaya, A.Y.: Energy-aware parallel task scheduling in a cluster. Future Generation Comp. Syst. 29(7), 1661–1670 (2013)CrossRef Wang, L., Khan, S.U., Chen, D., Kolodziej, J., Ranjan, R., Xu, C., Zomaya, A.Y.: Energy-aware parallel task scheduling in a cluster. Future Generation Comp. Syst. 29(7), 1661–1670 (2013)CrossRef
16.
go back to reference Alonso, P., Dolz, M.F., Mayo, R., Quintana-Ortí, E.S.: Energy-efficient execution of dense linear algebra algorithms on multi-core processors. Cluster Comput. 16(3), 497–509 (2013) Alonso, P., Dolz, M.F., Mayo, R., Quintana-Ortí, E.S.: Energy-efficient execution of dense linear algebra algorithms on multi-core processors. Cluster Comput. 16(3), 497–509 (2013)
17.
go back to reference Kolodziej, J., Khan, S.U., Wang, L., Byrski, A., Min-Allah, N., Madani, S.A.: Hierarchical genetic-based grid scheduling with energy optimization. Cluster Comput. 16(3), 591–609 (2013)CrossRef Kolodziej, J., Khan, S.U., Wang, L., Byrski, A., Min-Allah, N., Madani, S.A.: Hierarchical genetic-based grid scheduling with energy optimization. Cluster Comput. 16(3), 591–609 (2013)CrossRef
19.
go back to reference Gunther, S., Deval, A., Burton, T.: Energy-efficient computing: power-management system on the Intel Nehalem family of processors. Intel Technol. J. 15(1), 211 (2010). Gunther, S., Deval, A., Burton, T.: Energy-efficient computing: power-management system on the Intel Nehalem family of processors. Intel Technol. J. 15(1), 211 (2010).
20.
go back to reference Schöne, R., Hackenberg, D., Molka, D.: Memory performance at reduced CPU clock speeds: an analysis of current x86\_64 processors. In Proceedings of the 2012 USENIX Conference on Power-Aware Computing and Systems (2012). Schöne, R., Hackenberg, D., Molka, D.: Memory performance at reduced CPU clock speeds: an analysis of current x86\_64 processors. In Proceedings of the 2012 USENIX Conference on Power-Aware Computing and Systems (2012).
21.
go back to reference Diouri, M.E.M., Dolz, M.F., Glück, O., Lefèvre, L., Alonso, P., Catalán, S., Mayo, R., Quintana-Ortí, E.S.: Solving some mysteries in power monitoring of servers: take care of your wattmeters!. In: Proceedings of the Energy Efficiency in Large Scale Distributed Systems Conference—EE-LSDS (2013). Diouri, M.E.M., Dolz, M.F., Glück, O., Lefèvre, L., Alonso, P., Catalán, S., Mayo, R., Quintana-Ortí, E.S.: Solving some mysteries in power monitoring of servers: take care of your wattmeters!. In: Proceedings of the Energy Efficiency in Large Scale Distributed Systems Conference—EE-LSDS (2013).
22.
go back to reference Alonso, P., Badia, R.M., Labarta, J., Barreda, M., Dolz, M.F., Mayo, R., Quintana-Ortí, E.S., Reyes, R.: Tools for power-energy modelling and analysis of parallel scientific applications. In: Proceedings of the 2012 41st International Conference on Parallel Processing—ICPP, pp. 420–429 (2012). Alonso, P., Badia, R.M., Labarta, J., Barreda, M., Dolz, M.F., Mayo, R., Quintana-Ortí, E.S., Reyes, R.: Tools for power-energy modelling and analysis of parallel scientific applications. In: Proceedings of the 2012 41st International Conference on Parallel Processing—ICPP, pp. 420–429 (2012).
23.
go back to reference Barrachina, S., Barreda, M., Catalán, S., Dolz, M.F., Fabregat, G., Mayo, R., Quintana-Ortí, E.S.: An integrated framework for power-performance analysis of parallel scientific workloads. In: Proceedings of the Third International Conference on Smart Grids, Green Communications and IT Energy-aware Technologies (2013). Barrachina, S., Barreda, M., Catalán, S., Dolz, M.F., Fabregat, G., Mayo, R., Quintana-Ortí, E.S.: An integrated framework for power-performance analysis of parallel scientific workloads. In: Proceedings of the Third International Conference on Smart Grids, Green Communications and IT Energy-aware Technologies (2013).
25.
go back to reference Alonso, P., Dolz, M.F., Mayo, R., Quintana-Ortí, E.S.: Modeling power and energy of the task-parallel Cholesky factorization on multicore processors. Comput. Sci. Res. Develop., 29(2), 105–112 (2014) Alonso, P., Dolz, M.F., Mayo, R., Quintana-Ortí, E.S.: Modeling power and energy of the task-parallel Cholesky factorization on multicore processors. Comput. Sci. Res. Develop., 29(2), 105–112 (2014)
26.
go back to reference Hill, D.L., Huff, T., Kulick, S., Safranek, R.: The Uncore: a modular approach to feeding the high-performance cores. Intel Technol. J. 14(3), 30 (2010). Hill, D.L., Huff, T., Kulick, S., Safranek, R.: The Uncore: a modular approach to feeding the high-performance cores. Intel Technol. J. 14(3), 30 (2010).
30.
go back to reference NVIDIA Corporation. CUDA toolkit 4.0 readiness for CUDA applications, 4.0 edition, March (2011). NVIDIA Corporation. CUDA toolkit 4.0 readiness for CUDA applications, 4.0 edition, March (2011).
Metadata
Title
Assessing the impact of the CPU power-saving modes on the task-parallel solution of sparse linear systems
Authors
José I. Aliaga
María Barreda
Manuel F. Dolz
Alberto F. Martín
Rafael Mayo
Enrique S. Quintana-Ortí
Publication date
01-12-2014
Publisher
Springer US
Published in
Cluster Computing / Issue 4/2014
Print ISSN: 1386-7857
Electronic ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-014-0402-z

Other articles of this Issue 4/2014

Cluster Computing 4/2014 Go to the issue

Premium Partner