Skip to main content
Top

2019 | OriginalPaper | Chapter

7. Parallel FFT Algorithms for Shared-Memory Parallel Computers

Author : Daisuke Takahashi

Published in: Fast Fourier Transform Algorithms for Parallel Computers

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This chapter presents parallel FFT algorithms for shared-memory parallel computers. First, the implementation of parallel one-dimensional FFT on shared-memory parallel computers is described. Next, optimizing parallel FFTs for manycore processors and its performance are explained.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bailey, D.H.: FFTs in external or hierarchical memory. J. Supercomput. 4, 23–35 (1990)CrossRef Bailey, D.H.: FFTs in external or hierarchical memory. J. Supercomput. 4, 23–35 (1990)CrossRef
2.
go back to reference Bergland, G.D.: A fast Fourier transform algorithm using base 8 iterations. Math. Comput. 22, 275–279 (1968)MathSciNetMATH Bergland, G.D.: A fast Fourier transform algorithm using base 8 iterations. Math. Comput. 22, 275–279 (1968)MathSciNetMATH
3.
go back to reference Frigo, M., Johnson, S.G.: FFTW: an adaptive software architecture for the FFT. In: Proceedings of 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ’98), vol. 3, pp. 1381–1384 (1998) Frigo, M., Johnson, S.G.: FFTW: an adaptive software architecture for the FFT. In: Proceedings of 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ’98), vol. 3, pp. 1381–1384 (1998)
4.
go back to reference Frigo, M., Johnson, S.G.: The design and implementation of FFTW3. Proc. IEEE 93, 216–231 (2005)CrossRef Frigo, M., Johnson, S.G.: The design and implementation of FFTW3. Proc. IEEE 93, 216–231 (2005)CrossRef
5.
go back to reference Hascoet, J., Nezan, J.F., Ensor, A., de Dinechin, B.D.: Implementation of a fast Fourier transform algorithm onto a manycore processor. In: Proceedings of 2015 Conference on Design and Architectures for Signal and Image Processing (DASIP 2015) (2015) Hascoet, J., Nezan, J.F., Ensor, A., de Dinechin, B.D.: Implementation of a fast Fourier transform algorithm onto a manycore processor. In: Proceedings of 2015 Conference on Design and Architectures for Signal and Image Processing (DASIP 2015) (2015)
6.
go back to reference Hegland, M.: A self-sorting in-place fast Fourier transform algorithm suitable for vector and parallel processing. Numer. Math. 68, 507–547 (1994)MathSciNetCrossRef Hegland, M.: A self-sorting in-place fast Fourier transform algorithm suitable for vector and parallel processing. Numer. Math. 68, 507–547 (1994)MathSciNetCrossRef
8.
go back to reference Marr, D.T., Binns, F., Hill, D.L., Hinton, G., Koufaty, D.A., Miller, J.A., Upton, M.: Hyper-threading technology architecture and microarchitecture. Intel Technol. J. 6, 1–11 (2002) Marr, D.T., Binns, F., Hill, D.L., Hinton, G., Koufaty, D.A., Miller, J.A., Upton, M.: Hyper-threading technology architecture and microarchitecture. Intel Technol. J. 6, 1–11 (2002)
10.
go back to reference Panda, P.R., Nakamura, H., Dutt, N.D., Nicolau, A.: Augmenting loop tiling with data alignment for improved cache performance. IEEE Trans. Comput. 48, 142–149 (1999)CrossRef Panda, P.R., Nakamura, H., Dutt, N.D., Nicolau, A.: Augmenting loop tiling with data alignment for improved cache performance. IEEE Trans. Comput. 48, 142–149 (1999)CrossRef
11.
go back to reference Sodani, A., Gramunt, R., Corbal, J., Kim, H.S., Vinod, K., Chinthamani, S., Hutsell, S., Agarwal, R., Liu, Y.C.: Knights Landing: second-generation Intel Xeon Phi product. IEEE Micro 36, 34–46 (2016)CrossRef Sodani, A., Gramunt, R., Corbal, J., Kim, H.S., Vinod, K., Chinthamani, S., Hutsell, S., Agarwal, R., Liu, Y.C.: Knights Landing: second-generation Intel Xeon Phi product. IEEE Micro 36, 34–46 (2016)CrossRef
13.
go back to reference Takahashi, D.: An implementation of parallel 1-D real FFT on Intel Xeon Phi processors. In: Proceedings of 17th International Conference on Computational Science and Its Applications (ICCSA 2017), Part I, Lecture Notes in Computer Science, vol. 10404, pp. 401–410. Springer International Publishing (2017) Takahashi, D.: An implementation of parallel 1-D real FFT on Intel Xeon Phi processors. In: Proceedings of 17th International Conference on Computational Science and Its Applications (ICCSA 2017), Part I, Lecture Notes in Computer Science, vol. 10404, pp. 401–410. Springer International Publishing (2017)
14.
go back to reference Takahashi, D., Sato, M., Boku, T.: An OpenMP implementation of parallel FFT and its performance on IA-64 processors. In: Proceedings of International Workshop on OpenMP Applications and Tools (WOMPAT 2003), Lecture Notes in Computer Science, vol. 2716, pp. 99–108. Springer (2003) Takahashi, D., Sato, M., Boku, T.: An OpenMP implementation of parallel FFT and its performance on IA-64 processors. In: Proceedings of International Workshop on OpenMP Applications and Tools (WOMPAT 2003), Lecture Notes in Computer Science, vol. 2716, pp. 99–108. Springer (2003)
16.
go back to reference Van Loan, C.: Computational Frameworks for the Fast Fourier Transform. SIAM Press, Philadelphia, PA (1992)CrossRef Van Loan, C.: Computational Frameworks for the Fast Fourier Transform. SIAM Press, Philadelphia, PA (1992)CrossRef
17.
go back to reference Wadleigh, K.R.: High performance FFT algorithms for cache-coherent multiprocessors. Int. J. High Perform. Comput. Appl. 13, 163–171 (1999)CrossRef Wadleigh, K.R.: High performance FFT algorithms for cache-coherent multiprocessors. Int. J. High Perform. Comput. Appl. 13, 163–171 (1999)CrossRef
Metadata
Title
Parallel FFT Algorithms for Shared-Memory Parallel Computers
Author
Daisuke Takahashi
Copyright Year
2019
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-9965-7_7

Premium Partner