Skip to main content
Top

2024 | OriginalPaper | Chapter

Using MPIs Non-Blocking Allreduce for Health Checks in Dynamic Simulations

Authors : Jana Gericke, Harald Klimach, Neda Ebrahimi Pour, Sabine Roller

Published in: Parallel and Distributed Computing, Applications and Technologies

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Large-scale simulations often require frequent checks on global conditions that are not directly needed for the computation itself. While such health checks are not an integral part of the numerical algorithm, they serve an important role in controlling and coordinating the simulation. In distributed parallel computations their required communication may negatively impact the actual parallel computation and result in unnecessary synchronization points. We show that by using a non-blocking reduction these synchronization requirements can be loosened and the impact on the actual computation minimized. Further, it enables us to shift the communication into the background and progress it during the MPI calls that are done during the computation anyway. We demonstrate that a sufficient amount of MPI calls in between is required to allow for progress to happen. The presented approach delays the decision to be made in response to those health checks. But as it is not vital for the correct computation itself such a delay is usually tolerable and could offer a more robust scaling to large process counts.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
3.
go back to reference Roller, S., Bernsdorf, J., Klimach, H. et al.: An adaptable simulation framework based on a linearized octree. In: Resch, M., Wang, X., Bez, W., Focht, E., Kobayashi, H., Roller, S. (eds.) High Performance Computing on Vector Systems 2011, pp. 93–105. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22244-3_7 Roller, S., Bernsdorf, J., Klimach, H. et al.: An adaptable simulation framework based on a linearized octree. In: Resch, M., Wang, X., Bez, W., Focht, E., Kobayashi, H., Roller, S. (eds.) High Performance Computing on Vector Systems 2011, pp. 93–105. Springer, Heidelberg (2011). https://​doi.​org/​10.​1007/​978-3-642-22244-3_​7
5.
go back to reference Kandalla, K., Yang, U., Keasler, J., et al.: Designing non-blocking allreduce with collective offload on infiniband clusters: a case study with conjugate gradient solvers. In: IEEE 26th International Parallel and Distributed Processing Symposium, pp. 1156–1167 (2012) Kandalla, K., Yang, U., Keasler, J., et al.: Designing non-blocking allreduce with collective offload on infiniband clusters: a case study with conjugate gradient solvers. In: IEEE 26th International Parallel and Distributed Processing Symposium, pp. 1156–1167 (2012)
6.
7.
go back to reference Eller, P.R., Gropp, W.: Scalable non-blocking preconditioned conjugate gradient methods. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 204–215. (2016). https://doi.org/10.1109/SC.2016.17 Eller, P.R., Gropp, W.: Scalable non-blocking preconditioned conjugate gradient methods. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 204–215. (2016). https://​doi.​org/​10.​1109/​SC.​2016.​17
8.
go back to reference Eller, P.R., Hoefler, T., Gropp, W.: Using performance models to understand scalable Krylov solver performance at scale for structured grid problems. In: Proceedings of the ACM International Conference on Supercomputing, pp. 138–149. New York, USA (2019). https://doi.org/10.1145/3330345.3330358 Eller, P.R., Hoefler, T., Gropp, W.: Using performance models to understand scalable Krylov solver performance at scale for structured grid problems. In: Proceedings of the ACM International Conference on Supercomputing, pp. 138–149. New York, USA (2019). https://​doi.​org/​10.​1145/​3330345.​3330358
10.
go back to reference Hasert, M., Masilamani, K., Zimny, S., et al.: Complex fluid simulations with the parallel tree-based Lattice Boltzmann solver Musubi. J. Comput. Sci.Comput. Sci. 5(5), 784–794 (2014)MathSciNet Hasert, M., Masilamani, K., Zimny, S., et al.: Complex fluid simulations with the parallel tree-based Lattice Boltzmann solver Musubi. J. Comput. Sci.Comput. Sci. 5(5), 784–794 (2014)MathSciNet
Metadata
Title
Using MPIs Non-Blocking Allreduce for Health Checks in Dynamic Simulations
Authors
Jana Gericke
Harald Klimach
Neda Ebrahimi Pour
Sabine Roller
Copyright Year
2024
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-99-8211-0_3

Premium Partner