Skip to main content
Erschienen in: Computational Mechanics 5/2019

17.08.2018 | Original Paper

The spectral cell method for wave propagation in heterogeneous materials simulated on multiple GPUs and CPUs

verfasst von: Farshid Mossaiby, Meysam Joulaian, Alexander Düster

Erschienen in: Computational Mechanics | Ausgabe 5/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Efficient simulation of wave propagation in heterogeneous materials is still a challenging task. The spectral cell method, representing a combination of spectral elements with the fictitious domain concept, has proven to be an efficient approach for wave propagation analysis in materials with complicated microstructure. In this paper, we report details of parallel implementation of the spectral cell method using multi-core CPUs as well as GPUs. In our CPU implementation, we employ the OpenMP directives to parallelize the loops. On GPUs, however, we use the OpenCL framework to develop single- and multi-GPU versions of the code. In all of our implementations, the core operation is a sparse matrix-vector multiplication (SpMV) kernel. We analyze each implementation to determine its features and bottlenecks. The results show that speedups of up to 128 relative to serial CPU code can be achieved using multi-GPU code.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
3.
4.
Zurück zum Zitat Ashari A, Sedaghati N, Eisenlohr J, Parthasarath S, Sadayappan P (2014) Fast sparse matrix-vector multiplication on GPUs for graph applications. In: SC14: international conference for high performance computing, networking, storage and analysis, pp 781–792. https://doi.org/10.1109/SC.2014.69 Ashari A, Sedaghati N, Eisenlohr J, Parthasarath S, Sadayappan P (2014) Fast sparse matrix-vector multiplication on GPUs for graph applications. In: SC14: international conference for high performance computing, networking, storage and analysis, pp 781–792. https://​doi.​org/​10.​1109/​SC.​2014.​69
6.
Zurück zum Zitat Bathe KJ (1996) Finite element procedures. Prentice Hall, Upper Saddle RiverMATH Bathe KJ (1996) Finite element procedures. Prentice Hall, Upper Saddle RiverMATH
7.
Zurück zum Zitat Bell N, Garland M (2009) Implementing sparse matrix-vector multiplication on throughput-oriented processors. In: Proceedings of the conference on high performance computing networking, storage and analysis, SC ’09, pp 18:1–18:11. ACM, New York, NY, USA. https://doi.org/10.1145/1654059.1654078 Bell N, Garland M (2009) Implementing sparse matrix-vector multiplication on throughput-oriented processors. In: Proceedings of the conference on high performance computing networking, storage and analysis, SC ’09, pp 18:1–18:11. ACM, New York, NY, USA. https://​doi.​org/​10.​1145/​1654059.​1654078
8.
Zurück zum Zitat Choi JW, Singh A, Vuduc RW (2010) Model-driven autotuning of sparse matrix-vector multiply on GPUs. In: Proceedings of the 15th ACM SIGPLAN symposium on principles and practice of parallel programming, PPoPP ’10, pp 115–126. ACM, New York, NY, USA. https://doi.org/10.1145/1693453.1693471 Choi JW, Singh A, Vuduc RW (2010) Model-driven autotuning of sparse matrix-vector multiply on GPUs. In: Proceedings of the 15th ACM SIGPLAN symposium on principles and practice of parallel programming, PPoPP ’10, pp 115–126. ACM, New York, NY, USA. https://​doi.​org/​10.​1145/​1693453.​1693471
9.
Zurück zum Zitat Cohen G (2002) Higher-order numerical methods for transient wave equations. Springer, BerlinCrossRefMATH Cohen G (2002) Higher-order numerical methods for transient wave equations. Springer, BerlinCrossRefMATH
12.
Zurück zum Zitat Düster A, Parvizian J, Yang Z, Rank E (2008) The finite cell method for three-dimensional problems of solid mechanics. Comput Methods Appl Mech Eng 197:3768–3782MathSciNetCrossRefMATH Düster A, Parvizian J, Yang Z, Rank E (2008) The finite cell method for three-dimensional problems of solid mechanics. Comput Methods Appl Mech Eng 197:3768–3782MathSciNetCrossRefMATH
13.
Zurück zum Zitat Düster A, Rank E, Szabó B (2017) The p-version of the finite element and finite cell methods. In: Stein E, de Borst R, Hughes TJR (eds) Encyclopedia of computational mechanics, 2nd edn. Wiley, Hoboken, pp 137–171. https://doi.org/10.1002/9781119176817.ecm2003g vol. Part 1. Solids and Structures, chap. 4 Düster A, Rank E, Szabó B (2017) The p-version of the finite element and finite cell methods. In: Stein E, de Borst R, Hughes TJR (eds) Encyclopedia of computational mechanics, 2nd edn. Wiley, Hoboken, pp 137–171. https://​doi.​org/​10.​1002/​9781119176817.​ecm2003g vol. Part 1. Solids and Structures, chap. 4
16.
Zurück zum Zitat Fries TP, Omerović S (2016) Higher-order accurate integration of implicit geometries. Int J Numer Methods Eng 106(5):323–371MathSciNetCrossRefMATH Fries TP, Omerović S (2016) Higher-order accurate integration of implicit geometries. Int J Numer Methods Eng 106(5):323–371MathSciNetCrossRefMATH
18.
Zurück zum Zitat Godwin J, Holewinski J, Sadayappan P (2012) High-performance sparse matrix-vector multiplication on GPUs for structured grid computations. In: Proceedings of the 5th annual workshop on general purpose processing with graphics processing units, GPGPU-5, pp 47–56. ACM, New York, NY, USA. https://doi.org/10.1145/2159430.2159436 Godwin J, Holewinski J, Sadayappan P (2012) High-performance sparse matrix-vector multiplication on GPUs for structured grid computations. In: Proceedings of the 5th annual workshop on general purpose processing with graphics processing units, GPGPU-5, pp 47–56. ACM, New York, NY, USA. https://​doi.​org/​10.​1145/​2159430.​2159436
19.
Zurück zum Zitat Gopalakrishnan S, Chakraborty A, Roy Mahapatra D (2008) Spectral finite element method—wave propagation, diagnostics and control in anisotropic and inhomogeneous structuresa. Springer, London (Computational Fluid and Solid Mechanics)MATH Gopalakrishnan S, Chakraborty A, Roy Mahapatra D (2008) Spectral finite element method—wave propagation, diagnostics and control in anisotropic and inhomogeneous structuresa. Springer, London (Computational Fluid and Solid Mechanics)MATH
20.
Zurück zum Zitat Gopalakrishnan S, Ruzzene M, Hanagud S (2011) Computational techniques for structural health monitoring. Springer, LondonCrossRef Gopalakrishnan S, Ruzzene M, Hanagud S (2011) Computational techniques for structural health monitoring. Springer, LondonCrossRef
22.
Zurück zum Zitat Hinton E, Rock T, Zienkiewicz OC (1976) A note on mass lumping and related processes in the finite element method. Earthq Eng Struct Dyn 4:245–249CrossRef Hinton E, Rock T, Zienkiewicz OC (1976) A note on mass lumping and related processes in the finite element method. Earthq Eng Struct Dyn 4:245–249CrossRef
24.
Zurück zum Zitat Joulaian M (2017) The hierarchical finite cell method for problems in structural mechanics. Ph.D. thesis, Hamburg University of Technology Joulaian M (2017) The hierarchical finite cell method for problems in structural mechanics. Ph.D. thesis, Hamburg University of Technology
27.
Zurück zum Zitat Karwacki M, Bylina B, Bylina J (2012) Multi-GPU implementation of the uniformization method for solving markov models. In: 2012 Federated conference on computer science and information systems (FedCSIS), pp 533–537 Karwacki M, Bylina B, Bylina J (2012) Multi-GPU implementation of the uniformization method for solving markov models. In: 2012 Federated conference on computer science and information systems (FedCSIS), pp 533–537
28.
Zurück zum Zitat Komatitsch D, Vilotte JP, Vai R, Castillo-Covarrubias J, Sanchez-Sesma F (1999) The spectral element method for elastic wave equations—application to 2-D and 3-D seismic problems. Int J Numer Methods Eng 45:1139–1164CrossRefMATH Komatitsch D, Vilotte JP, Vai R, Castillo-Covarrubias J, Sanchez-Sesma F (1999) The spectral element method for elastic wave equations—application to 2-D and 3-D seismic problems. Int J Numer Methods Eng 45:1139–1164CrossRefMATH
29.
Zurück zum Zitat Kreutzer M, Hager G, Wellein G, Fehske H, Basermann A, Bishop A.R (2012) Sparse matrix-vector multiplication on GPGPU clusters: a new storage format and a scalable implementation. In: 2012 IEEE 26th international parallel and distributed processing symposium workshops Ph.D. Forum, pp 1696–1702. https://doi.org/10.1109/IPDPSW.2012.211 Kreutzer M, Hager G, Wellein G, Fehske H, Basermann A, Bishop A.R (2012) Sparse matrix-vector multiplication on GPGPU clusters: a new storage format and a scalable implementation. In: 2012 IEEE 26th international parallel and distributed processing symposium workshops Ph.D. Forum, pp 1696–1702. https://​doi.​org/​10.​1109/​IPDPSW.​2012.​211
30.
Zurück zum Zitat Laugier P, Haïat G (2010) Bone quantitative ultrasound. Springer, Dordrecht Laugier P, Haïat G (2010) Bone quantitative ultrasound. Springer, Dordrecht
31.
Zurück zum Zitat McCalpin JD (1995) Memory bandwidth and machine balance in current high performance computers. IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter pp 19–25 McCalpin JD (1995) Memory bandwidth and machine balance in current high performance computers. IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter pp 19–25
35.
Zurück zum Zitat Ostachowicz W, Kudela P, Krawczuk M, Zak A (2012) Guided waves in structures for SHM. Wiley, ChichesterCrossRefMATH Ostachowicz W, Kudela P, Krawczuk M, Zak A (2012) Guided waves in structures for SHM. Wiley, ChichesterCrossRefMATH
36.
Zurück zum Zitat Parvizian J, Düster A, Rank E (2007) Finite cell method - h- and p-extension for embedded domain problems in solid mechanics. Comput Mech 41:121–133MathSciNetCrossRefMATH Parvizian J, Düster A, Rank E (2007) Finite cell method - h- and p-extension for embedded domain problems in solid mechanics. Comput Mech 41:121–133MathSciNetCrossRefMATH
37.
Zurück zum Zitat Patera AT (1984) A spectral element method for fluid dynamics: Laminar flow in a channel expansion. J Comput Phys 54:468–488CrossRefMATH Patera AT (1984) A spectral element method for fluid dynamics: Laminar flow in a channel expansion. J Comput Phys 54:468–488CrossRefMATH
41.
Zurück zum Zitat Rul S, Vandierendonck H, D’Haene J, De Bosschere K (2010) An experimental study on performance portability of OpenCL kernels. In: 2010 Symposium on application accelerators in high performance computing (SAAHPC ’10). biblio.ugent.be Rul S, Vandierendonck H, D’Haene J, De Bosschere K (2010) An experimental study on performance portability of OpenCL kernels. In: 2010 Symposium on application accelerators in high performance computing (SAAHPC ’10). biblio.ugent.be
42.
Zurück zum Zitat Staszewski WJ (2003) Health monitoring for aerospace structures. Wiley, ChichesterCrossRef Staszewski WJ (2003) Health monitoring for aerospace structures. Wiley, ChichesterCrossRef
44.
Zurück zum Zitat Willberg C, Duczek S, Vivar Perez JM, Schmicker D, Gabbert U (2012) Comparison of different higher order finite element schemes for the simulation of Lamb waves. Comput Methods Appl Mech Eng 241–244:246–261CrossRefMATH Willberg C, Duczek S, Vivar Perez JM, Schmicker D, Gabbert U (2012) Comparison of different higher order finite element schemes for the simulation of Lamb waves. Comput Methods Appl Mech Eng 241–244:246–261CrossRefMATH
Metadaten
Titel
The spectral cell method for wave propagation in heterogeneous materials simulated on multiple GPUs and CPUs
verfasst von
Farshid Mossaiby
Meysam Joulaian
Alexander Düster
Publikationsdatum
17.08.2018
Verlag
Springer Berlin Heidelberg
Erschienen in
Computational Mechanics / Ausgabe 5/2019
Print ISSN: 0178-7675
Elektronische ISSN: 1432-0924
DOI
https://doi.org/10.1007/s00466-018-1623-4

Weitere Artikel der Ausgabe 5/2019

Computational Mechanics 5/2019 Zur Ausgabe

Neuer Inhalt