Skip to main content

2015 | OriginalPaper | Buchkapitel

Energy-Performance Tradeoffs for HPC Applications on Low Power Processors

verfasst von : Enrico Calore, Sebastiano Fabio Schifano, Raffaele Tripiccione

Erschienen in: Euro-Par 2015: Parallel Processing Workshops

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Energy efficiency is becoming more and more important in the HPC field; high-end processors are quickly evolving towards more advanced power-saving and power-monitoring technologies. On the other hand, low-power processors, designed for the mobile market, attract interest in the HPC area for their increasing computing capabilities, competitive pricing and low power consumption. In this work we study energy and computing performances of a Tegra K1 mobile processor using an HPC Lattice Boltzmann application as a benchmark. We run this application on the ARM Cortex-A15 CPU and on the GK20A GPU, both available in this processor. Our analysis uses time-accurate measurements, obtained by a simple custom-developed current monitor. We discuss several energy and performance metrics, interesting per se and also in view of a prospective use of these processors in a HPC context.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
2.
Zurück zum Zitat Biferale, L., Mantovani, F., Sbragaglia, M., Scagliarini, A., Toschi, F., Tripiccione, R.: Second-order closure in stratified turbulence: simulations and modeling of bulk and entrainment regions. Phys. Rev. E 84(1), 016305 (2011). doi:10.1103/PhysRevE.84.016305 CrossRef Biferale, L., Mantovani, F., Sbragaglia, M., Scagliarini, A., Toschi, F., Tripiccione, R.: Second-order closure in stratified turbulence: simulations and modeling of bulk and entrainment regions. Phys. Rev. E 84(1), 016305 (2011). doi:10.​1103/​PhysRevE.​84.​016305 CrossRef
3.
Zurück zum Zitat Calore, E., Schifano, S.F., Tripiccione, R.: On portability, performance and scalability of an MPI OpenCL lattice boltzmann code. In: Lopes, L., et al. (eds.) Euro-Par 2014, Part II. LNCS, vol. 8806, pp. 438–449. Springer, Heidelberg (2014) Calore, E., Schifano, S.F., Tripiccione, R.: On portability, performance and scalability of an MPI OpenCL lattice boltzmann code. In: Lopes, L., et al. (eds.) Euro-Par 2014, Part II. LNCS, vol. 8806, pp. 438–449. Springer, Heidelberg (2014)
4.
Zurück zum Zitat Choi, J., Dukhan, M., Liu, X., Vuduc, R.: Algorithmic time, energy, and power on candidate HPC compute building blocks. In: IEEE 28th International Parallel and Distributed Processing Symposium, pp. 447–457 (2014). doi:10.1109/IPDPS.2014.54 Choi, J., Dukhan, M., Liu, X., Vuduc, R.: Algorithmic time, energy, and power on candidate HPC compute building blocks. In: IEEE 28th International Parallel and Distributed Processing Symposium, pp. 447–457 (2014). doi:10.​1109/​IPDPS.​2014.​54
5.
Zurück zum Zitat Coplin, J., Burtscher, M.: Effects of source-code optimizations on GPU performance and energy consumption. In: Proceedings of the 8th Workshop on General Purpose Processing Using GPUs, GPGPU 2015, pp. 48–58 (2015). doi:10.1145/2716282.2716292 Coplin, J., Burtscher, M.: Effects of source-code optimizations on GPU performance and energy consumption. In: Proceedings of the 8th Workshop on General Purpose Processing Using GPUs, GPGPU 2015, pp. 48–58 (2015). doi:10.​1145/​2716282.​2716292
6.
Zurück zum Zitat Crimi, G., Mantovani, F., Pivanti, M., Schifano, S.F., Tripiccione, R.: Early experience on porting and running a lattice boltzmann code on the xeon-phi co-processor. Procedia Comput. Sci. 18, 551–560 (2013). doi:10.1016/j.procs.2013.05.219 CrossRef Crimi, G., Mantovani, F., Pivanti, M., Schifano, S.F., Tripiccione, R.: Early experience on porting and running a lattice boltzmann code on the xeon-phi co-processor. Procedia Comput. Sci. 18, 551–560 (2013). doi:10.​1016/​j.​procs.​2013.​05.​219 CrossRef
7.
Zurück zum Zitat Hackenberg, D., Ilsche, T., Schone, R., Molka, D., Schmidt, M., Nagel, W.: Power measurement techniques on standard compute nodes: a quantitative comparison. In: 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 194–204 (2013). doi:10.1109/ISPASS.2013.6557170 Hackenberg, D., Ilsche, T., Schone, R., Molka, D., Schmidt, M., Nagel, W.: Power measurement techniques on standard compute nodes: a quantitative comparison. In: 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 194–204 (2013). doi:10.​1109/​ISPASS.​2013.​6557170
8.
Zurück zum Zitat Kim, N., Austin, T., Baauw, D., Mudge, T., Flautner, K., Hu, J., Irwin, M., Kandemir, M., Narayanan, V.: Leakage current: moore’s law meets static power. Computer 36(12), 68–75 (2003). doi:10.1109/MC.2003.1250885 CrossRef Kim, N., Austin, T., Baauw, D., Mudge, T., Flautner, K., Hu, J., Irwin, M., Kandemir, M., Narayanan, V.: Leakage current: moore’s law meets static power. Computer 36(12), 68–75 (2003). doi:10.​1109/​MC.​2003.​1250885 CrossRef
9.
Zurück zum Zitat Kraus, J., Pivanti, M., Schifano, S.F., Tripiccione, R., Zanella, M.: Benchmarking GPUs with a parallel lattice-boltzmann code. In: 25th Int. Symposiumon Computer Architecture and High Performance Computing (SBAC-PAD), pp. 160–167. IEEE (2013). doi:10.1109/SBAC-PAD.2013.37 Kraus, J., Pivanti, M., Schifano, S.F., Tripiccione, R., Zanella, M.: Benchmarking GPUs with a parallel lattice-boltzmann code. In: 25th Int. Symposiumon Computer Architecture and High Performance Computing (SBAC-PAD), pp. 160–167. IEEE (2013). doi:10.​1109/​SBAC-PAD.​2013.​37
10.
Zurück zum Zitat Laurenzano, M.A., Tiwari, A., Jundt, A., Peraza, J., Ward Jr, W.A., Campbell, R., Carrington, L.: Characterizing the performance-energy tradeoff of small ARM cores in HPC computation. In: Silva, F., Dutra, I., Santos Costa, V. (eds.) Euro-Par 2014 Parallel Processing. LNCS, vol. 8632, pp. 124–137. Springer, Heidelberg (2014) Laurenzano, M.A., Tiwari, A., Jundt, A., Peraza, J., Ward Jr, W.A., Campbell, R., Carrington, L.: Characterizing the performance-energy tradeoff of small ARM cores in HPC computation. In: Silva, F., Dutra, I., Santos Costa, V. (eds.) Euro-Par 2014 Parallel Processing. LNCS, vol. 8632, pp. 124–137. Springer, Heidelberg (2014)
12.
Zurück zum Zitat Mead, C., Conway, L.: Introduction to VLSI systems, vol. 802. Addison-Wesley, Reading (1980) Mead, C., Conway, L.: Introduction to VLSI systems, vol. 802. Addison-Wesley, Reading (1980)
13.
Zurück zum Zitat Rajovic, N., Rico, A., Puzovic, N., Adeniyi-Jones, C., Ramirez, A.: Tibidabo: making the case for an ARM-based HPC system. Future Generation Computer Systems 36, 322–334 (2014)CrossRef Rajovic, N., Rico, A., Puzovic, N., Adeniyi-Jones, C., Ramirez, A.: Tibidabo: making the case for an ARM-based HPC system. Future Generation Computer Systems 36, 322–334 (2014)CrossRef
15.
Zurück zum Zitat Scagliarini, A., Biferale, L., Sbragaglia, M., Sugiyama, K., Toschi, F.: Lattice boltzmann methods for thermal flows: continuum limit and applications to compressible Rayleigh-Taylor systems. Phys. Fluids (1994-present) 22(5), 055101 (2010). doi:10.1063/1.3392774 CrossRef Scagliarini, A., Biferale, L., Sbragaglia, M., Sugiyama, K., Toschi, F.: Lattice boltzmann methods for thermal flows: continuum limit and applications to compressible Rayleigh-Taylor systems. Phys. Fluids (1994-present) 22(5), 055101 (2010). doi:10.​1063/​1.​3392774 CrossRef
16.
Zurück zum Zitat Succi, S.: The Lattice-Boltzmann Equation. Oxford University Press, Oxford (2001)MATH Succi, S.: The Lattice-Boltzmann Equation. Oxford University Press, Oxford (2001)MATH
17.
Zurück zum Zitat Wittmann, M., Hager, G., Zeiser, T., Treibig, J., Wellein, G.: Chip-level and multi-node analysis of energy-optimized lattice Boltzmann CFD simulations. Concurr. Comput. Pract. Exp. (2015). doi:10.1002/cpe.3489. ISSN: 1532-0634 Wittmann, M., Hager, G., Zeiser, T., Treibig, J., Wellein, G.: Chip-level and multi-node analysis of energy-optimized lattice Boltzmann CFD simulations. Concurr. Comput. Pract. Exp. (2015). doi:10.​1002/​cpe.​3489. ISSN: 1532-0634
Metadaten
Titel
Energy-Performance Tradeoffs for HPC Applications on Low Power Processors
verfasst von
Enrico Calore
Sebastiano Fabio Schifano
Raffaele Tripiccione
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-27308-2_59