Skip to main content

2019 | OriginalPaper | Buchkapitel

Petaflop Seismic Simulations in the Public Cloud

verfasst von : Alexander Breuer, Yifeng Cui, Alexander Heinecke

Erschienen in: High Performance Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

During the last decade cloud services and infrastructure as a service became a popular solution for diverse applications. Additionally, hardware support for virtualization closed performance gaps, compared to on-premises, bare-metal systems. This development is driven by offloaded hypervisors and full CPU virtualization. Today’s cloud service providers, such as Amazon or Google, offer the ability to assemble application-tailored clusters to maximize performance. However, from an interconnect point of view, one has to tackle a 4–5\(\times \) slow-down in terms of bandwidth and 25\(\times \) in terms of latency, compared to latest high-speed and low-latency interconnects. Taking into account the high per-node and accelerator-driven performance of latest supercomputers, we observe that the network-bandwidth performance of recent cloud offerings is within 2\(\times \) of large supercomputers. In order to address these challenges, we present a comprehensive application-centric approach for high-order seismic simulations utilizing the ADER discontinuous Galerkin finite element method, which exhibits excellent communication characteristics. This covers the tuning of the operating system, normally not possible on supercomputers, micro-benchmarking, and finally, the efficient execution of our solver in the public cloud. Due to this performance-oriented end-to-end workflow, we were able to achieve 1.09 PFLOPS on 768 AWS c5.18xlarge instances, offering 27,648 cores with 5 PFLOPS of theoretical computational power. This correlates to an achieved peak efficiency of over 20% and a close-to 90% parallel efficiency in a weak scaling setup. In terms of strong scalability, we were able to strong-scale a science scenario from 2 to 64 instances with 60% parallel efficiency. This work is, to the best of our knowledge, the first of its kind at such a large scale.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
LIBXSMM is available from: https://​github.​com/​hfp/​libxsmm.
 
5
AWS ParallelCluster is available from: https://​aws-parallelcluster.​readthedocs.​io.
 
7
AWS ParallelCluster supports further submission systems, e.g., AWS Batch or SGE.
 
15
Verification of FP32 for ADER-DG seismic wave propagation is recent work (see http://​doi.​org/​10.​17605/​OSF.​IO/​H9G5N and http://​opt.​dial3343.​org).
 
Literatur
1.
Zurück zum Zitat Alliez, P., et al.: 3D mesh generation. In: CGAL User and Reference Manual (2018) Alliez, P., et al.: 3D mesh generation. In: CGAL User and Reference Manual (2018)
2.
Zurück zum Zitat Breuer, A., et al.: Petascale local time stepping for the ADER-DG finite element method. In: IPDPS 2016 (2016) Breuer, A., et al.: Petascale local time stepping for the ADER-DG finite element method. In: IPDPS 2016 (2016)
3.
6.
Zurück zum Zitat Custódio, S., et al.: The 2004 mw6.0 Parkfield, California, earthquake: inversion of near-source ground motion using multiple data sets. Geophys. Res. Lett. 32(23) (2005) Custódio, S., et al.: The 2004 mw6.0 Parkfield, California, earthquake: inversion of near-source ground motion using multiple data sets. Geophys. Res. Lett. 32(23) (2005)
7.
Zurück zum Zitat Deelman, E., et al.: The cost of doing science on the cloud: the montage example. In: SC 2008 (2008) Deelman, E., et al.: The cost of doing science on the cloud: the montage example. In: SC 2008 (2008)
8.
Zurück zum Zitat Evangelinos, C., et al.: Cloud computing for parallel scientific HPC applications: feasibility of running coupled atmosphere-applications (2008) Evangelinos, C., et al.: Cloud computing for parallel scientific HPC applications: feasibility of running coupled atmosphere-applications (2008)
9.
Zurück zum Zitat Geuzaine, C., et al.: Gmsh: a 3-d finite element mesh generator with built-in pre- and post-processing facilities. Numer. Methods Eng. 79(11), 1309 (2009)MathSciNetCrossRef Geuzaine, C., et al.: Gmsh: a 3-d finite element mesh generator with built-in pre- and post-processing facilities. Numer. Methods Eng. 79(11), 1309 (2009)MathSciNetCrossRef
10.
11.
Zurück zum Zitat Graves, R., et al.: Cybershake: a physics-based seismic hazard model for Southern California. Pure Appl. Geophys. 168(3), 367–381 (2011)CrossRef Graves, R., et al.: Cybershake: a physics-based seismic hazard model for Southern California. Pure Appl. Geophys. 168(3), 367–381 (2011)CrossRef
12.
Zurück zum Zitat Heinecke, A., et al.: Petascale high order dynamic rupture earthquake simulations on heterogeneous supercomputers. In: SC 2014 (2014) Heinecke, A., et al.: Petascale high order dynamic rupture earthquake simulations on heterogeneous supercomputers. In: SC 2014 (2014)
13.
Zurück zum Zitat Intel: Intel Xeon Processor Scalable Family Specification Update (2018) Intel: Intel Xeon Processor Scalable Family Specification Update (2018)
14.
Zurück zum Zitat Jackson, K.R., et al.: Performance analysis of high performance computing applications on the Amazon web services cloud. In: CCCTS 2010 (2010) Jackson, K.R., et al.: Performance analysis of high performance computing applications on the Amazon web services cloud. In: CCCTS 2010 (2010)
15.
Zurück zum Zitat Mauch, V., et al.: High performance cloud computing. Future Gener. Comput. Syst. 29, 1408 (2013)CrossRef Mauch, V., et al.: High performance cloud computing. Future Gener. Comput. Syst. 29, 1408 (2013)CrossRef
16.
Zurück zum Zitat McCalpin, J.D.: HPL and DGEMM performance variability on the Xeon Platinum 8160 processor. In: SC 2018, pp. 18:1–18:13. IEEE Press, Piscataway (2018) McCalpin, J.D.: HPL and DGEMM performance variability on the Xeon Platinum 8160 processor. In: SC 2018, pp. 18:1–18:13. IEEE Press, Piscataway (2018)
17.
Zurück zum Zitat Mohammadi, M., et al.: Comparative benchmarking of cloud computing vendors with high performance linpack. In: HPCCC 2018 (2018) Mohammadi, M., et al.: Comparative benchmarking of cloud computing vendors with high performance linpack. In: HPCCC 2018 (2018)
18.
Zurück zum Zitat Napper, J., et al.: Can cloud computing reach the top500? In: UCHPC-MAW 2009 (2009) Napper, J., et al.: Can cloud computing reach the top500? In: UCHPC-MAW 2009 (2009)
19.
Zurück zum Zitat Schoeder, S., et al.: Efficient explicit time stepping of high order discontinuous Galerkin schemes for waves. arXiv e-prints arXiv:1805.03981, May 2018 Schoeder, S., et al.: Efficient explicit time stepping of high order discontinuous Galerkin schemes for waves. arXiv e-prints arXiv:​1805.​03981, May 2018
20.
Zurück zum Zitat Small, P., et al.: The SCEC unified community velocity model software framework. Seismol. Res. Lett. 88(6), 1539 (2017)CrossRef Small, P., et al.: The SCEC unified community velocity model software framework. Seismol. Res. Lett. 88(6), 1539 (2017)CrossRef
21.
Zurück zum Zitat Top500 Authors: Top500 List, November 2013 Top500 Authors: Top500 List, November 2013
22.
Zurück zum Zitat Uphoff, C., et al.: Extreme scale multi-physics simulations of the tsunamigenic 2004 sumatra megathrust earthquake. In: SC 2017 (2017) Uphoff, C., et al.: Extreme scale multi-physics simulations of the tsunamigenic 2004 sumatra megathrust earthquake. In: SC 2017 (2017)
23.
Zurück zum Zitat Yvinec, M.: 2D triangulation. In: CGAL User and Reference Manual (2018) Yvinec, M.: 2D triangulation. In: CGAL User and Reference Manual (2018)
24.
Zurück zum Zitat Zhao, L., et al.: Strain green’s tensors, reciprocity, and their applications to seismic source and structure studies. Bull. Seismol. Soc. Am. 96(5), 1753 (2006)CrossRef Zhao, L., et al.: Strain green’s tensors, reciprocity, and their applications to seismic source and structure studies. Bull. Seismol. Soc. Am. 96(5), 1753 (2006)CrossRef
Metadaten
Titel
Petaflop Seismic Simulations in the Public Cloud
verfasst von
Alexander Breuer
Yifeng Cui
Alexander Heinecke
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-20656-7_9

Premium Partner