Skip to main content
Erschienen in: Cluster Computing 3/2013

01.09.2013

Evaluating application performance and energy consumption on hybrid CPU+GPU architecture

verfasst von: Edson Luiz Padoin, Laércio Lima Pilla, Francieli Zanon Boito, Rodrigo Virote Kassick, Pedro Velho, Philippe O. A. Navaux

Erschienen in: Cluster Computing | Ausgabe 3/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The High Performance Computing (HPC) community aimed for many years to increase performance regardless of energy consumption. Until the end of the decade, a next generation of HPC systems is expected to reach sustained performances of the order of exaflops. This requires many times more performance compared to the fastest supercomputers of today. Achieving this goal is unthinkable with current technology due to strict constraints on supplied power. Therefore, finding ways to improve energy efficiency become a main challenge on state-of-the-art research. The present paper investigates energy efficiency on heterogeneous CPU+GPU architectures using a scientific application from the agroforestry domain as a case-study. Differently from other works, our work evaluates how the workload of the application may affect energy efficiency on hybrid architectures. Results point out that the power supplier constraints depend also on the workload.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Barker, K., Davis, K., Hoisie, A., Kerbyson, D., Lang, M., Pakin, S., Sancho, J.: Using performance modeling to design large-scale systems. IEEE Comput. 42(11), 42–49 (2009) CrossRef Barker, K., Davis, K., Hoisie, A., Kerbyson, D., Lang, M., Pakin, S., Sancho, J.: Using performance modeling to design large-scale systems. IEEE Comput. 42(11), 42–49 (2009) CrossRef
2.
Zurück zum Zitat Beckman, P., Dally, B., Shainer, G., Dunning, T., Ahalt, S.C., Bernhardt, M.: On the road to exascale. Sci. Comput. World 116, 26–28 (2011) Beckman, P., Dally, B., Shainer, G., Dunning, T., Ahalt, S.C., Bernhardt, M.: On the road to exascale. Sci. Comput. World 116, 26–28 (2011)
3.
Zurück zum Zitat Brodtkorb, A.R., Dyken, C., Hagen, T.R., Hjelmervik, J.M., Storaasli, O.O.: State-of-the-art in heterogeneous computing. Sci. Program. 18(1), 1–33 (2010) Brodtkorb, A.R., Dyken, C., Hagen, T.R., Hjelmervik, J.M., Storaasli, O.O.: State-of-the-art in heterogeneous computing. Sci. Program. 18(1), 1–33 (2010)
4.
Zurück zum Zitat Buck, I., Foley, T., Horn, D., Sugerman, J., Fatahalian, K., Houston, M., Hanrahan, P.: Brook for GPUS: stream computing on graphics hardware. In: ACM Transactions on Graphics (TOG), vol. 23, pp. 777–786. ACM, New York (2004) Buck, I., Foley, T., Horn, D., Sugerman, J., Fatahalian, K., Houston, M., Hanrahan, P.: Brook for GPUS: stream computing on graphics hardware. In: ACM Transactions on Graphics (TOG), vol. 23, pp. 777–786. ACM, New York (2004)
6.
Zurück zum Zitat Dong, Y., Chen, J., Tang, T.: Power measurements and analyses of massive object storage system. In: 2010 10th IEEE International Conference on Computer and Information Technology (CIT 2010), pp. 1317–1322. IEEE, New York (2010) CrossRef Dong, Y., Chen, J., Tang, T.: Power measurements and analyses of massive object storage system. In: 2010 10th IEEE International Conference on Computer and Information Technology (CIT 2010), pp. 1317–1322. IEEE, New York (2010) CrossRef
7.
Zurück zum Zitat Dongarra, J., Beckman, P., Aerts, P., Cappello, F., Lippert, T., Matsuoka, S., Messina, P., Moore, T., Stevens, R., Trefethen, A., et al.: The international exascale software project: a call to cooperative action by the global high-performance community. Int. J. High Perform. Comput. Appl. 23(4), 309–322 (2009) CrossRef Dongarra, J., Beckman, P., Aerts, P., Cappello, F., Lippert, T., Matsuoka, S., Messina, P., Moore, T., Stevens, R., Trefethen, A., et al.: The international exascale software project: a call to cooperative action by the global high-performance community. Int. J. High Perform. Comput. Appl. 23(4), 309–322 (2009) CrossRef
9.
Zurück zum Zitat Doussan, C., Jouniaux, L., Thony, J.: Variations of self-potential and unsaturated water flow with time in sandy loam and clay loam soils. J. Hydrol. 267(3), 173–185 (2002) CrossRef Doussan, C., Jouniaux, L., Thony, J.: Variations of self-potential and unsaturated water flow with time in sandy loam and clay loam soils. J. Hydrol. 267(3), 173–185 (2002) CrossRef
11.
Zurück zum Zitat Feng, W., Cameron, K.: The Green500 list: encouraging sustainable supercomputing. Computer 40(12), 50–55 (2007) CrossRef Feng, W., Cameron, K.: The Green500 list: encouraging sustainable supercomputing. Computer 40(12), 50–55 (2007) CrossRef
12.
Zurück zum Zitat Frachtenberg, E., Heydari, A., Li, H., Michael, A., Na, J., Nisbet, A., Sarti, P.: High-efficiency server design. In: Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, p. 27. ACM, New York (2011) Frachtenberg, E., Heydari, A., Li, H., Michael, A., Na, J., Nisbet, A., Sarti, P.: High-efficiency server design. In: Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, p. 27. ACM, New York (2011)
13.
Zurück zum Zitat Grochowski, E., Annavaram, M.: Energy per instruction trends in Intel microprocessors. Technol. Intel Mag. 4(3), 1–8 (2006) Grochowski, E., Annavaram, M.: Energy per instruction trends in Intel microprocessors. Technol. Intel Mag. 4(3), 1–8 (2006)
14.
Zurück zum Zitat Hsu, C., Feng, W., Archuleta, J.: Towards efficient supercomputing: a quest for the right metric. In: Proceedings 19th IEEE International Parallel and Distributed Processing Symposium, 2005, p. 8. IEEE, New York (2005) Hsu, C., Feng, W., Archuleta, J.: Towards efficient supercomputing: a quest for the right metric. In: Proceedings 19th IEEE International Parallel and Distributed Processing Symposium, 2005, p. 8. IEEE, New York (2005)
15.
Zurück zum Zitat Hsu, C.H., Feng, W.-C., Archuleta, J.S.: Towards efficient supercomputing: a quest for the right metric. In: Proc. 19th IEEE International Parallel & Distributed Processing Symposium, p. 8. Denver, Colorado, USA (2005). Technical report LA-UR05-0936 Hsu, C.H., Feng, W.-C., Archuleta, J.S.: Towards efficient supercomputing: a quest for the right metric. In: Proc. 19th IEEE International Parallel & Distributed Processing Symposium, p. 8. Denver, Colorado, USA (2005). Technical report LA-UR05-0936
16.
Zurück zum Zitat Jiao, Y., Lin, H., Balaji, P., Feng, W.: Power and performance characterization of computational kernels on the GPU. In: Green Computing and Communications (GreenCom), 2010 IEEE/ACM Int’l Conference on & Int’l Conference on Cyber, Physical and Social Computing (CPSCom), pp. 221–228. IEEE, New York (2010) CrossRef Jiao, Y., Lin, H., Balaji, P., Feng, W.: Power and performance characterization of computational kernels on the GPU. In: Green Computing and Communications (GreenCom), 2010 IEEE/ACM Int’l Conference on & Int’l Conference on Cyber, Physical and Social Computing (CPSCom), pp. 221–228. IEEE, New York (2010) CrossRef
17.
Zurück zum Zitat Khairy, M., Mehlfuhrer, C., Rupp, M.: Boosting sphere decoding speed through graphic processing units. In: 2010 European Wireless Conference (EW), pp. 99–104. IEEE, New York (2010) CrossRef Khairy, M., Mehlfuhrer, C., Rupp, M.: Boosting sphere decoding speed through graphic processing units. In: 2010 European Wireless Conference (EW), pp. 99–104. IEEE, New York (2010) CrossRef
18.
19.
Zurück zum Zitat Kogge, P., Bergman, K., Borkar, S., Campbell, D., Carson, W., Dally, W., Denneau, M., Franzon, P., Harrod, W., Hill, K., et al.: In: Exascale Computing Study: Technology Challenges in Achieving Exascale Systems, pp. 1–297 (2008) Kogge, P., Bergman, K., Borkar, S., Campbell, D., Carson, W., Dally, W., Denneau, M., Franzon, P., Harrod, W., Hill, K., et al.: In: Exascale Computing Study: Technology Challenges in Achieving Exascale Systems, pp. 1–297 (2008)
20.
Zurück zum Zitat Lee, V.W., Kim, C., Chhugani, J., Deisher, M., Kim, D., Nguyen, A.D., Satish, N., Smelyanskiy, M., Chennupaty, S., Hammarlund, P., Singhal, R., Dubey, P.: Debunking the 100x gpu vs. cpu myth: an evaluation of throughput computing on cpu and gpu. In: Proceedings of the 37th Annual International Symposium on Computer Architecture, ISCA’10, pp. 451–460. ACM, New York (2010). doi:10.1145/1815961.1816021 Lee, V.W., Kim, C., Chhugani, J., Deisher, M., Kim, D., Nguyen, A.D., Satish, N., Smelyanskiy, M., Chennupaty, S., Hammarlund, P., Singhal, R., Dubey, P.: Debunking the 100x gpu vs. cpu myth: an evaluation of throughput computing on cpu and gpu. In: Proceedings of the 37th Annual International Symposium on Computer Architecture, ISCA’10, pp. 451–460. ACM, New York (2010). doi:10.​1145/​1815961.​1816021
21.
Zurück zum Zitat Liu, W., Du, Z., Xiao, Y., Bader, D., Xu, C.: A waterfall model to achieve energy efficient tasks mapping for large scale gpu clusters. In: 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), pp. 82–92. IEEE, New York (2011) CrossRef Liu, W., Du, Z., Xiao, Y., Bader, D., Xu, C.: A waterfall model to achieve energy efficient tasks mapping for large scale gpu clusters. In: 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), pp. 82–92. IEEE, New York (2011) CrossRef
22.
Zurück zum Zitat Luk, C., Hong, S., Kim, H.: Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping. In: Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture, pp. 45–55. ACM, New York (2009) CrossRef Luk, C., Hong, S., Kim, H.: Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping. In: Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture, pp. 45–55. ACM, New York (2009) CrossRef
24.
Zurück zum Zitat Miyazaki, T.: Water flow in unsaturated soil in layered slopes. J. Hydrol. 102(1–4), 201–214 (1988) CrossRef Miyazaki, T.: Water flow in unsaturated soil in layered slopes. J. Hydrol. 102(1–4), 201–214 (1988) CrossRef
25.
Zurück zum Zitat Miyazaki, T.: Water Flow in Soils. CRC Press, Boca Raton (2006) Miyazaki, T.: Water Flow in Soils. CRC Press, Boca Raton (2006)
26.
Zurück zum Zitat NVIDIA: NVIDIA CUDA Compute Unified Device Architecture Programming Guide (2009) NVIDIA: NVIDIA CUDA Compute Unified Device Architecture Programming Guide (2009)
27.
Zurück zum Zitat NVIDIA: Next Generation CUDA Compute Architecture: Fermi (2009) NVIDIA: Next Generation CUDA Compute Architecture: Fermi (2009)
29.
Zurück zum Zitat Pawlowski, S.S.: Exascale science: the next frontier in high performance computing. In: The 24th International Conference on Supercomputing (ICS), 2010, p. 1 (2010) Pawlowski, S.S.: Exascale science: the next frontier in high performance computing. In: The 24th International Conference on Supercomputing (ICS), 2010, p. 1 (2010)
30.
Zurück zum Zitat Ren, D.Q., Suda, R.: Investigation on the power efficiency of multi-core and gpu processing element in large scale SIMD computation with CUDA. In: International Conference on Green Computing, pp. 309–316. IEEE, New York (2010) CrossRef Ren, D.Q., Suda, R.: Investigation on the power efficiency of multi-core and gpu processing element in large scale SIMD computation with CUDA. In: International Conference on Green Computing, pp. 309–316. IEEE, New York (2010) CrossRef
31.
Zurück zum Zitat Schreier, P.: How cool are supercomputer? Sci. Comput. World 116, 22–24 (2011) Schreier, P.: How cool are supercomputer? Sci. Comput. World 116, 22–24 (2011)
32.
Zurück zum Zitat Shiers, J.: The worldwide lhc computing grid (worldwide lcg). Comput. Phys. Commun. 177(1–2), 219–223 (2007) CrossRef Shiers, J.: The worldwide lhc computing grid (worldwide lcg). Comput. Phys. Commun. 177(1–2), 219–223 (2007) CrossRef
33.
Zurück zum Zitat Subramaniam, B., Feng, W.: Understanding power measurement implications in the Green500 list. In: Green Computing and Communications (GreenCom), 2010 IEEE/ACM Int’l Conference on & Int’l Conference on Cyber, Physical and Social Computing (CPSCom), pp. 245–251. IEEE Press, New York (2010) CrossRef Subramaniam, B., Feng, W.: Understanding power measurement implications in the Green500 list. In: Green Computing and Communications (GreenCom), 2010 IEEE/ACM Int’l Conference on & Int’l Conference on Cyber, Physical and Social Computing (CPSCom), pp. 245–251. IEEE Press, New York (2010) CrossRef
34.
Zurück zum Zitat Suda, R., Aoki, T., Hirasawa, S., Nukada, A., Honda, H., Matsuoka, S.: Aspects of gpu for general purpose high performance computing. In: Proceedings of the 2009 Asia and South Pacific Design Automation Conference, pp. 216–223. IEEE Press, New York (2009) CrossRef Suda, R., Aoki, T., Hirasawa, S., Nukada, A., Honda, H., Matsuoka, S.: Aspects of gpu for general purpose high performance computing. In: Proceedings of the 2009 Asia and South Pacific Design Automation Conference, pp. 216–223. IEEE Press, New York (2009) CrossRef
35.
Zurück zum Zitat Tarantola, A.: Inverse Problem Theory and Methods for Model Parameter Estimation. SIAM, Philadelphia (2005) MATHCrossRef Tarantola, A.: Inverse Problem Theory and Methods for Model Parameter Estimation. SIAM, Philadelphia (2005) MATHCrossRef
36.
Zurück zum Zitat Tveito, A., Langtangen, H., Nielsen, B., Cai, X.: Parameter estimation and inverse problems. In: Elements of Scientific Computing, pp. 411–421 (2010) CrossRef Tveito, A., Langtangen, H., Nielsen, B., Cai, X.: Parameter estimation and inverse problems. In: Elements of Scientific Computing, pp. 411–421 (2010) CrossRef
37.
Zurück zum Zitat Valero, M.: Towards exaflop supercomputers. In: Conference Center of the University of Patras—High Performance Computing Academic Research Network (HPC-net) (2011) Valero, M.: Towards exaflop supercomputers. In: Conference Center of the University of Patras—High Performance Computing Academic Research Network (HPC-net) (2011)
38.
Zurück zum Zitat Wang, G., Ren, X.: Power-efficient work distribution method for cpu-gpu heterogeneous system. In: International Symposium on Parallel and Distributed Processing with Applications, pp. 122–129. IEEE, New York (2010) CrossRef Wang, G., Ren, X.: Power-efficient work distribution method for cpu-gpu heterogeneous system. In: International Symposium on Parallel and Distributed Processing with Applications, pp. 122–129. IEEE, New York (2010) CrossRef
39.
Zurück zum Zitat Younge, A., von Laszewski, G., Wang, L., Lopez-Alarcon, S., Carithers, W.: Efficient resource management for cloud computing environments. In: International Conference on Green Computing, pp. 357–364. IEEE, New York (2010) CrossRef Younge, A., von Laszewski, G., Wang, L., Lopez-Alarcon, S., Carithers, W.: Efficient resource management for cloud computing environments. In: International Conference on Green Computing, pp. 357–364. IEEE, New York (2010) CrossRef
Metadaten
Titel
Evaluating application performance and energy consumption on hybrid CPU+GPU architecture
verfasst von
Edson Luiz Padoin
Laércio Lima Pilla
Francieli Zanon Boito
Rodrigo Virote Kassick
Pedro Velho
Philippe O. A. Navaux
Publikationsdatum
01.09.2013
Verlag
Springer US
Erschienen in
Cluster Computing / Ausgabe 3/2013
Print ISSN: 1386-7857
Elektronische ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-012-0219-6

Weitere Artikel der Ausgabe 3/2013

Cluster Computing 3/2013 Zur Ausgabe