Skip to main content
Erschienen in: The Journal of Supercomputing 1/2017

16.01.2016

Using heterogeneous computing for scattering prediction in scenarios with several source configurations

verfasst von: M. López-Portugués, J.  A. López-Fernández, José Ranilla, R. G. Ayestarán, F. Las-Heras

Erschienen in: The Journal of Supercomputing | Ausgabe 1/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this work, we present a tool for solving large scattering problems with several acoustic source configurations. These problems entail a large matrix multiplication where the matrices must be generated on demand so that problems can be solved using systems with less memory than that required to store the whole matrices. We have analysed and developed different versions: one based on multiple matrix-vector products, two different approaches built on tiled matrix multiplication, and one heterogeneous implementation for using a GPU and a Xeon Phi simultaneously. To test these implementations, we have used different devices: multicore CPUs, a Xeon Phi accelerator, and a Tesla GPU. When compared to our initial work, the peak speedup of the new solutions is \(25\times \) for CPU, \(17\times \) for Phi, \(20\times \) for GPU, and \(20\times \) for the heterogeneous GPU + Phi implementation. Finally, it is worth mentioning that the tool presented in this work can be adapted and applied to other fields whenever the problem to solve requires a large matrix multiplication where the elements must be generated on demand (e.g. the inverse scattering problem in electromagnetics).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Fußnoten
1
8 cores at 2.0 GHz (Hyper-Threading and Turbo Boost disabled).
 
2
2,496 CUDA cores at 706 MHz and 5 GB of device memory.
 
3
60 cores at 1.053 GHz (4 threads/core) and 8 GB of RAM.
 
Literatur
5.
Zurück zum Zitat Quintana-Ortí G et al (2012) A runtime system for programming out-of-core matrix algorithms-by-tiles on multithreaded architectures. ACM Trans Math Softw 38(4):1–25MathSciNetCrossRef Quintana-Ortí G et al (2012) A runtime system for programming out-of-core matrix algorithms-by-tiles on multithreaded architectures. ACM Trans Math Softw 38(4):1–25MathSciNetCrossRef
6.
Zurück zum Zitat Hu FQ (2013) An efficient solution of time domain boundary integral equations for acoustic scattering and its acceleration by Graphics Processing Units. In: 19th AIAA/CEAS Aeroacoustics Conference. American Institute of Aeronautics and Astronautics Hu FQ (2013) An efficient solution of time domain boundary integral equations for acoustic scattering and its acceleration by Graphics Processing Units. In: 19th AIAA/CEAS Aeroacoustics Conference. American Institute of Aeronautics and Astronautics
7.
Zurück zum Zitat López-Portugués M et al (2014) Aircraft noise scattering prediction using different accelerator architectures. J Supercomp 70(2):612–622CrossRef López-Portugués M et al (2014) Aircraft noise scattering prediction using different accelerator architectures. J Supercomp 70(2):612–622CrossRef
8.
Zurück zum Zitat El-Shenawee M, Miller EL (2004) Multiple-incidence and multifrequency for profile reconstruction of random rough surfaces using the 3-D electromagnetic fast multipole model. IEEE Trans Geosci Remote Sens 42(11):2499–2510CrossRef El-Shenawee M, Miller EL (2004) Multiple-incidence and multifrequency for profile reconstruction of random rough surfaces using the 3-D electromagnetic fast multipole model. IEEE Trans Geosci Remote Sens 42(11):2499–2510CrossRef
9.
Zurück zum Zitat Álvarez-López Y et al (2010) Geometry reconstruction of metallic bodies using the sources reconstruction method. IEEE Antennas Wirel Propag Lett 9:1197–1200CrossRef Álvarez-López Y et al (2010) Geometry reconstruction of metallic bodies using the sources reconstruction method. IEEE Antennas Wirel Propag Lett 9:1197–1200CrossRef
10.
Zurück zum Zitat Guan J, Yan S, Jin JM (2013) An OpenMP-CUDA implementation of multilevel fast multipole algorithm for electromagnetic simulation on multi-GPU computing systems. IEEE Trans Antennas Propag 61(7):3607–3616MathSciNetCrossRef Guan J, Yan S, Jin JM (2013) An OpenMP-CUDA implementation of multilevel fast multipole algorithm for electromagnetic simulation on multi-GPU computing systems. IEEE Trans Antennas Propag 61(7):3607–3616MathSciNetCrossRef
11.
Zurück zum Zitat Nguyen QM et al (2013) Parallelizing fast multipole method for large-scale electromagnetic problems using GPU clusters. IEEE Antennas Wirel Propag Lett 12:868–871CrossRef Nguyen QM et al (2013) Parallelizing fast multipole method for large-scale electromagnetic problems using GPU clusters. IEEE Antennas Wirel Propag Lett 12:868–871CrossRef
12.
Zurück zum Zitat Dang V, Nguyen Q, Kilic O (2013) Fast multipole method for large-scale electromagnetic scattering problems on GPU cluster and FPGA-accelerated platforms. Appl Comput Electromagn Soc J 28(12):1187–1198 Dang V, Nguyen Q, Kilic O (2013) Fast multipole method for large-scale electromagnetic scattering problems on GPU cluster and FPGA-accelerated platforms. Appl Comput Electromagn Soc J 28(12):1187–1198
13.
Zurück zum Zitat López-Portugués M et al (2012) Acoustic scattering solver based on single level FMM for multi-GPU systems. J Parallel Distrib Comp 72(9):1057–1064CrossRef López-Portugués M et al (2012) Acoustic scattering solver based on single level FMM for multi-GPU systems. J Parallel Distrib Comp 72(9):1057–1064CrossRef
14.
Zurück zum Zitat López-Portugués M et al (2013) Parallelization of the FMM on distributed-memory GPGPU. J Supercomp 64(1):17–27CrossRef López-Portugués M et al (2013) Parallelization of the FMM on distributed-memory GPGPU. J Supercomp 64(1):17–27CrossRef
16.
Zurück zum Zitat Wu TW (2000) Boundary element acoustics: fundamentals and computer codes. WIT Press, SouthamptonMATH Wu TW (2000) Boundary element acoustics: fundamentals and computer codes. WIT Press, SouthamptonMATH
17.
Zurück zum Zitat Anderson E et al (1995) LAPACK users’ guide. second. Society for Industrial and Applied Mathematics, Philadelphia Anderson E et al (1995) LAPACK users’ guide. second. Society for Industrial and Applied Mathematics, Philadelphia
21.
Zurück zum Zitat Gannon D, Jalby W, Gallivan K (1988) Strategies for cache and local memory management by global program transformation. J Parallel Distrib Comp 5(5):587–616CrossRef Gannon D, Jalby W, Gallivan K (1988) Strategies for cache and local memory management by global program transformation. J Parallel Distrib Comp 5(5):587–616CrossRef
22.
Zurück zum Zitat Lebeck AR, Wood DA (1994) Cache profiling and the SPEC benchmarks: a case study. IEEE Comp 27(10):15–26CrossRef Lebeck AR, Wood DA (1994) Cache profiling and the SPEC benchmarks: a case study. IEEE Comp 27(10):15–26CrossRef
Metadaten
Titel
Using heterogeneous computing for scattering prediction in scenarios with several source configurations
verfasst von
M. López-Portugués
J.  A. López-Fernández
José Ranilla
R. G. Ayestarán
F. Las-Heras
Publikationsdatum
16.01.2016
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 1/2017
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-015-1618-2

Weitere Artikel der Ausgabe 1/2017

The Journal of Supercomputing 1/2017 Zur Ausgabe