Skip to main content

2021 | OriginalPaper | Buchkapitel

Evaluation of the NEC Vector Engine for Legacy CFD Codes

verfasst von : Keith Obenschain, Yu Yu Khine, Raghunandan Mathur, Gopal Patnaik, Robert Rosenberg

Erschienen in: High Performance Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Many codes that are still in production use trace their origins to code developed during the vector supercomputing era from the 1970’s to 1990’s. The recently released NEC Vector Engine (VE) provides an opportunity to exploit this vector heritage. The VE can provide state-of-the-art performance without a complete rewrite of a well-validated codebase. Programs do not require an additional level of abstraction to use the capabilities of the VE. Given the time and cost required to port or rewrite codes, this is an attractive solution. Further tuning as described in this paper can realize maximum performance.
The goal was to assess how the NEC VE’s performance and ease of use compare with that of existing CPU architectures (e.g. AMD, Intel) using a legacy Computational Fluid Dynamics (CFD) solver, FDL3DI written in Fortran. FDL3DI was originally vectorized and optimized for efficient operation on vector processing machines. The NEC VE’s architecture, high memory bandwidth and ability to compile Fortran was the primary motivation for this evaluation.
Through profiling and modifying the key compute kernels using typical vector and NEC VE specific optimizations, the code was successfully able to utilize the vector engine hardware with minimal modification of the code. Scalar code developed later in FDL3DI’s lifetime was substituted with vector friendly implementations. With optimizations, this vector architecture was found to be 3× faster for main-memory bound problems with the CPU architectures competitive for smaller problem sizes. This performance using standard well-known techniques is considered to be a key benefit of this architecture.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Komatsu, K., et al.: Performance evaluation of a vector supercomputer SX-Aurora TSUBASA. In: IEEE Conference Proceedings, USA, pp. 685–696 (2018) Komatsu, K., et al.: Performance evaluation of a vector supercomputer SX-Aurora TSUBASA. In: IEEE Conference Proceedings, USA, pp. 685–696 (2018)
7.
Zurück zum Zitat Quinn, M.J.: Parallel Programming in C with MPI and OpenMP. McGraw-Hill Education (2004) Quinn, M.J.: Parallel Programming in C with MPI and OpenMP. McGraw-Hill Education (2004)
8.
Zurück zum Zitat Boris, J.P., Landsberg, A.M., Oran, E.S., Gardner, J.H.: LCPFCT - a flux-corrected transport algorithm for solving generalized continuity equations. NRL Memorandum Report 93-7192 (1993) Boris, J.P., Landsberg, A.M., Oran, E.S., Gardner, J.H.: LCPFCT - a flux-corrected transport algorithm for solving generalized continuity equations. NRL Memorandum Report 93-7192 (1993)
9.
Zurück zum Zitat Gaitonde, D., Visbal, M.: High-order schemes for Navier-Stokes equations: algorithm and implementation into FDL3DI. Technical report AFRL-VA-WP-TR-1998-3060, Air Force Research Laboratory, Wright-Patterson AFB (1998) Gaitonde, D., Visbal, M.: High-order schemes for Navier-Stokes equations: algorithm and implementation into FDL3DI. Technical report AFRL-VA-WP-TR-1998-3060, Air Force Research Laboratory, Wright-Patterson AFB (1998)
10.
Zurück zum Zitat Garmann, D.J., Visbal, M.R.: AFRL contributions to the third international workshop on high-order CFD methods. In: Third International Workshop on High-Order CFD Methods (2015) Garmann, D.J., Visbal, M.R.: AFRL contributions to the third international workshop on high-order CFD methods. In: Third International Workshop on High-Order CFD Methods (2015)
11.
Zurück zum Zitat Gordnier, R.E., Visbal, M.R.: Numerical simulation of delta-wing roll. Aerosp. Sci. Technol. 6, 347–357 (1998)CrossRefMATH Gordnier, R.E., Visbal, M.R.: Numerical simulation of delta-wing roll. Aerosp. Sci. Technol. 6, 347–357 (1998)CrossRefMATH
12.
Zurück zum Zitat Ducros, F., et al.: Large-eddy simulation of the shock/turbulence interaction. J. Comput. Phys. 152, 517–549 (1999)CrossRefMATH Ducros, F., et al.: Large-eddy simulation of the shock/turbulence interaction. J. Comput. Phys. 152, 517–549 (1999)CrossRefMATH
Metadaten
Titel
Evaluation of the NEC Vector Engine for Legacy CFD Codes
verfasst von
Keith Obenschain
Yu Yu Khine
Raghunandan Mathur
Gopal Patnaik
Robert Rosenberg
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-78713-4_14

Premium Partner