Skip to main content
Top
Published in: The Journal of Supercomputing 8/2021

20-01-2021

Evaluating the performance of FFT library implementations on modern hybrid computing systems

Authors: Sergey I. Malkovsky, Aleksei A. Sorokin, Georgiy I. Tsoy, Sergey P. Korolev, Sergey I. Smagin, Vadim A. Kondrashev

Published in: The Journal of Supercomputing | Issue 8/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Fast Fourier transform is widely used to solve numerous scientific and engineering problems. In particular, this transform is behind the software dealing with speech and image recognition, signal analysis, modeling of properties of new materials and substances, etc. Newly emerging high-performance hybrid computing systems, as well as systems with alternative architectures, require research on discrete Fourier transform computation efficiency on these new platforms. The results of such research allow assessing the feasibility of certain solutions for building modern computing and data processing centers. This paper presents the results of such research covering modern hybrid computing systems based on the IBM POWER and Intel Xeon processors, as well as on NVIDIA Tesla co-processors. The analysis is carried out, and conclusions are presented on their performance when executing fast Fourier transforms. The impact of the existing architectural aspects of the hardware (CPU simultaneous multithreading mode, GPU data transfer bus, etc.) on the transform performance efficiency is assessed. The obtained results are used to provide recommendations on the optimal operation modes and settings of the considered mathematical libraries.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Cooley JW, Tukey JW (1965) An algorithm for the machine calculation of complex Fourier series. Math Comput 19(90):297–301MathSciNetCrossRef Cooley JW, Tukey JW (1965) An algorithm for the machine calculation of complex Fourier series. Math Comput 19(90):297–301MathSciNetCrossRef
3.
go back to reference Stanković D, Jovanović P, Jović A, Slavnić V, Vudragović D, Balaž A (2014) Implementation and Benchmarking of New FFT Libraries in Quantum ESPRESSO. In: Dulea M, Karaivanova A, Oulas A, Liabotis I, Stojiljkovic D, Prnjat O (eds) High-Performance Computing Infrastructure for South East Europe’s Research Communities, Modeling and Optimization in Science and Technologies, vol 2. Springer, Cham. https://doi.org/10.1007/978-3-319-01520-0_19 Stanković D, Jovanović P, Jović A, Slavnić V, Vudragović D, Balaž A (2014) Implementation and Benchmarking of New FFT Libraries in Quantum ESPRESSO. In: Dulea M, Karaivanova A, Oulas A, Liabotis I, Stojiljkovic D, Prnjat O (eds) High-Performance Computing Infrastructure for South East Europe’s Research Communities, Modeling and Optimization in Science and Technologies, vol 2. Springer, Cham. https://​doi.​org/​10.​1007/​978-3-319-01520-0_​19
4.
go back to reference Wende F, Marsman M, Steinke T (2016) On Enhancing 3D-FFT Performance in VASP. In: CUG proceedings Wende F, Marsman M, Steinke T (2016) On Enhancing 3D-FFT Performance in VASP. In: CUG proceedings
6.
go back to reference Luszczek P, Dongarra J, Koester D, Rabensiefner R, Lucas B, Kepner J, McCalpin J, Bailey D, Takahashi D (2005) Introduction to the HPC Challenge Benchmark Suite. Lawrence Berkeley National Laboratory. Paper LBNL-57493, 12p Luszczek P, Dongarra J, Koester D, Rabensiefner R, Lucas B, Kepner J, McCalpin J, Bailey D, Takahashi D (2005) Introduction to the HPC Challenge Benchmark Suite. Lawrence Berkeley National Laboratory. Paper LBNL-57493, 12p
8.
go back to reference Jodra JL, Gurrutxaga I, Muguerza J (2015) A study of memory consumption and execution performance of the cufft library. In: 2015 10th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC). IEEE, pp 323–327. https://doi.org/10.1109/3PGCIC.2015.66 Jodra JL, Gurrutxaga I, Muguerza J (2015) A study of memory consumption and execution performance of the cufft library. In: 2015 10th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC). IEEE, pp 323–327. https://​doi.​org/​10.​1109/​3PGCIC.​2015.​66
9.
go back to reference Střelák D, Filipovič J (2018) Performance analysis and autotuning setup of the cuFFT library. In: Proceedings of the 2nd Workshop on Autotuning and Adaptivity Approaches for Energy Efficient HPC Systems—ANDARE ’18. ACM Press, New York, pp 1–6. https://doi.org/10.1145/3295816.3295817 Střelák D, Filipovič J (2018) Performance analysis and autotuning setup of the cuFFT library. In: Proceedings of the 2nd Workshop on Autotuning and Adaptivity Approaches for Energy Efficient HPC Systems—ANDARE ’18. ACM Press, New York, pp 1–6. https://​doi.​org/​10.​1145/​3295816.​3295817
10.
go back to reference Govindaraju NK, Lloyd B, Dotsenko Y, Smith B, Manferdelli J (2008) High performance discrete Fourier transforms on graphics processors. In: 2008 SC—International Conference for High Performance Computing, Networking, Storage and Analysis IEEE, pp 1-12. https://doi.org/10.1109/SC.2008.5213922 Govindaraju NK, Lloyd B, Dotsenko Y, Smith B, Manferdelli J (2008) High performance discrete Fourier transforms on graphics processors. In: 2008 SC—International Conference for High Performance Computing, Networking, Storage and Analysis IEEE, pp 1-12. https://​doi.​org/​10.​1109/​SC.​2008.​5213922
16.
go back to reference Sinharoy B, Van Norstrand JA, Eickemeyer RJ, Le HQ, Leenstra J, Nguyen DQ, Konigsburg B, Ward K, Brown MD, Moreira JE, Levitan D, Tung S, Hrusecky D, Bishop JW, Gschwind M, Boersma M, Kroener M, Kaltenbach M, Karkhanis T, Fernsler KM (2015) IBM POWER8 processor core microarchitecture. IBM J Res Dev 59(1):2:1–2:21CrossRef Sinharoy B, Van Norstrand JA, Eickemeyer RJ, Le HQ, Leenstra J, Nguyen DQ, Konigsburg B, Ward K, Brown MD, Moreira JE, Levitan D, Tung S, Hrusecky D, Bishop JW, Gschwind M, Boersma M, Kroener M, Kaltenbach M, Karkhanis T, Fernsler KM (2015) IBM POWER8 processor core microarchitecture. IBM J Res Dev 59(1):2:1–2:21CrossRef
17.
go back to reference Sadasivam SK, Thompto BW, Kalla R, Starke WJ (2017) IBM Power9 processor architecture. IEEE Micro 37:40–51CrossRef Sadasivam SK, Thompto BW, Kalla R, Starke WJ (2017) IBM Power9 processor architecture. IEEE Micro 37:40–51CrossRef
26.
go back to reference Steinbach P, Werner M (2017) Gearshifft—the FFT benchmark suite for heterogeneous platforms. In: Kunkel J, Yokota R, Balaji P, Keyes D (eds) High performance computing. ISC 2017. Lecture notes in computer science, vol 10266. Springer, Cham, pp 199-216. https://doi.org/10.1007/978-3-319-58667-0_11 Steinbach P, Werner M (2017) Gearshifft—the FFT benchmark suite for heterogeneous platforms. In: Kunkel J, Yokota R, Balaji P, Keyes D (eds) High performance computing. ISC 2017. Lecture notes in computer science, vol 10266. Springer, Cham, pp 199-216. https://​doi.​org/​10.​1007/​978-3-319-58667-0_​11
Metadata
Title
Evaluating the performance of FFT library implementations on modern hybrid computing systems
Authors
Sergey I. Malkovsky
Aleksei A. Sorokin
Georgiy I. Tsoy
Sergey P. Korolev
Sergey I. Smagin
Vadim A. Kondrashev
Publication date
20-01-2021
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 8/2021
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-020-03591-6

Other articles of this Issue 8/2021

The Journal of Supercomputing 8/2021 Go to the issue

Premium Partner