Skip to main content
Erschienen in: The Journal of Supercomputing 6/2020

03.08.2019

STEEL-RT: combining single task–single executor model and expanded scheduling to ease heterogeneity exploitation

verfasst von: Antón Rey, Francisco D. Igual, Manuel Prieto-Matías

Erschienen in: The Journal of Supercomputing | Ausgabe 6/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The ever-increasing complexity in terms of heterogeneity and available parallelism of modern architectures, together with the growing software ecosystem in terms of application libraries and runtime systems are posing a new challenge in terms of programmability on top of the already difficult task of extracting high parallel performance. With the goal of simplifying the programmability at the extreme, we first present the single task–single executor (STSE) model, in which the execution entry point consists solely on a single task assigned to an abstract execution context or executor, representing the whole system platform. As a second contribution, we introduce the expanded scheduling operations that endow executors with additional runtime task scheduling capabilities, and explain how executors can be composed to express and exploit different architectural features at run time under the framework of a single/multiple–task/executor taxonomy. As a result, we introduce STEEL, a novel programming model built on these concepts and STEEL-RT, a coupled high-performance runtime implementation for it. We illustrate the potential of STEEL by means of runtime experiments on a modern heterogeneous architecture, showing that complex and efficient heterogeneous executions are accessible from a STSE approach.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Fußnoten
1
If considered as a set of multi-processors, e.g., a modern CUDA-capable device.
 
Literatur
1.
Zurück zum Zitat Bueno J, Planas J, Duran A, Badia RM, Martorell X, Ayguade E, Labarta J (2012) Productive programming of GPU clusters with OmpSs. In: IPDPS 2012, pp 557–568 Bueno J, Planas J, Duran A, Badia RM, Martorell X, Ayguade E, Labarta J (2012) Productive programming of GPU clusters with OmpSs. In: IPDPS 2012, pp 557–568
2.
Zurück zum Zitat Augonnet C, Thibault S, Namyst R, Wacrenier P (2011) StarPU a unified platform for task scheduling on heterogeneous multicore architectures. CCPE 23(2):187–198 Augonnet C, Thibault S, Namyst R, Wacrenier P (2011) StarPU a unified platform for task scheduling on heterogeneous multicore architectures. CCPE 23(2):187–198
3.
Zurück zum Zitat Openmp Architecture, Review Board, Openmp Architecture, Review Board, Openmp Architecture, and Review Board. OpenMP Application Programming Interface. (November):359, 2015 Openmp Architecture, Review Board, Openmp Architecture, Review Board, Openmp Architecture, and Review Board. OpenMP Application Programming Interface. (November):359, 2015
4.
Zurück zum Zitat Pheatt C (2008) Intel threading building blocks. J Comput Sci Coll 23(4):298–298 Pheatt C (2008) Intel threading building blocks. J Comput Sci Coll 23(4):298–298
5.
Zurück zum Zitat Heller T, Diehl P, Byerly Z, Biddiscombe J, Kaiser H (2017) HPX—an open source C++ standard library for parallelism and concurrency. In: Proceedings of OpenSuCo 2017, Denver, Colorado USA, November 2017 (OpenSuCo17), p 5 Heller T, Diehl P, Byerly Z, Biddiscombe J, Kaiser H (2017) HPX—an open source C++ standard library for parallelism and concurrency. In: Proceedings of OpenSuCo 2017, Denver, Colorado USA, November 2017 (OpenSuCo17), p 5
6.
Zurück zum Zitat Rey A, Igual FD, Prieto-Matías M (2016) Hesp: a simulation framework for solving the task scheduling-partitioning problem on heterogeneous architectures. CoRR, arXiv:abs/1602.05510 Rey A, Igual FD, Prieto-Matías M (2016) Hesp: a simulation framework for solving the task scheduling-partitioning problem on heterogeneous architectures. CoRR, arXiv:​abs/​1602.​05510
7.
Zurück zum Zitat Anderson E, Bai Z, Demmel J, Dongarra JE, DuCroz J, Greenbaum A, Hammarling S, McKenney AE, Ostrouchov S, Sorensen D (1992) LAPACK users’ guide. SIAM, PhiladelphiaMATH Anderson E, Bai Z, Demmel J, Dongarra JE, DuCroz J, Greenbaum A, Hammarling S, McKenney AE, Ostrouchov S, Sorensen D (1992) LAPACK users’ guide. SIAM, PhiladelphiaMATH
8.
Zurück zum Zitat Barrachina S, Castillo M, Igual FD, Mayo R, Quintana-Ortí ES (2008) Solving dense linear systems on graphics processors. In: Luque E, Margalef T, Benítez D (eds) Proceedings of the 14th International Euro-Par Conference, Lecture Notes in Computer Science, vol 5168. Springer, pp 739–748 Barrachina S, Castillo M, Igual FD, Mayo R, Quintana-Ortí ES (2008) Solving dense linear systems on graphics processors. In: Luque E, Margalef T, Benítez D (eds) Proceedings of the 14th International Euro-Par Conference, Lecture Notes in Computer Science, vol 5168. Springer, pp 739–748
9.
Zurück zum Zitat El-Ghazawi T, Smith L (2006) Upc: unified parallel c. In: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, SC ’06, New York, NY, USA, 2006. ACM El-Ghazawi T, Smith L (2006) Upc: unified parallel c. In: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, SC ’06, New York, NY, USA, 2006. ACM
10.
Zurück zum Zitat Wienke S, Springer P, Terboven C, an Mey D (2012) Openacc: first experiences with real-world applications. In: Proceedings of the 18th International Conference on Parallel Processing, Euro-Par’12. Springer, Berlin, pp 859–870CrossRef Wienke S, Springer P, Terboven C, an Mey D (2012) Openacc: first experiences with real-world applications. In: Proceedings of the 18th International Conference on Parallel Processing, Euro-Par’12. Springer, Berlin, pp 859–870CrossRef
11.
Zurück zum Zitat Planas J, Badia RM, Ayguade E, Labarta J (2013) Self-adaptive OmpSs tasks in heterogeneous environments. In: 2013 IEEE 27th International Symposium on IPDPS, pp 138–149 Planas J, Badia RM, Ayguade E, Labarta J (2013) Self-adaptive OmpSs tasks in heterogeneous environments. In: 2013 IEEE 27th International Symposium on IPDPS, pp 138–149
13.
Zurück zum Zitat Zheng Y, Kamil A, Driscoll MB, Shan H, Yelick K (2014) UPC++: a PGAS extension for C++. In: IPDPS 2014, vol 00, pp 1105–1114 Zheng Y, Kamil A, Driscoll MB, Shan H, Yelick K (2014) UPC++: a PGAS extension for C++. In: IPDPS 2014, vol 00, pp 1105–1114
14.
Zurück zum Zitat Keryell R, Reyes R, Howes L (2015) Khronos SYCL for OPenCL. In: Proceedings of the 3rd International Workshop on OpenCL, IWOCL ’15, pp 24:1–24:1, New York. ACM Keryell R, Reyes R, Howes L (2015) Khronos SYCL for OPenCL. In: Proceedings of the 3rd International Workshop on OpenCL, IWOCL ’15, pp 24:1–24:1, New York. ACM
15.
Zurück zum Zitat Carter Edwards H, Trott CR, Sunderland D (2014) Kokkos. J Parallel Distrib Comput 74(12):3202–3216CrossRef Carter Edwards H, Trott CR, Sunderland D (2014) Kokkos. J Parallel Distrib Comput 74(12):3202–3216CrossRef
16.
Zurück zum Zitat De Sensi D, De Matteis T, Torquati M, Mencagli G, Danelutto M (2017) Bringing parallel patterns out of the corner: the p3 arsec benchmark suite. ACM Trans Archit Code Optim 14(4):33:1–33:26CrossRef De Sensi D, De Matteis T, Torquati M, Mencagli G, Danelutto M (2017) Bringing parallel patterns out of the corner: the p3 arsec benchmark suite. ACM Trans Archit Code Optim 14(4):33:1–33:26CrossRef
17.
Zurück zum Zitat del Rio Astorga D, Dolz MF, Fernández J, Daniel García J (2017) A generic parallel pattern interface for stream and data processing. Concurr Comput Pract Exp 29(24):e4175 (e4175 cpe.4175)CrossRef del Rio Astorga D, Dolz MF, Fernández J, Daniel García J (2017) A generic parallel pattern interface for stream and data processing. Concurr Comput Pract Exp 29(24):e4175 (e4175 cpe.4175)CrossRef
Metadaten
Titel
STEEL-RT: combining single task–single executor model and expanded scheduling to ease heterogeneity exploitation
verfasst von
Antón Rey
Francisco D. Igual
Manuel Prieto-Matías
Publikationsdatum
03.08.2019
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 6/2020
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-019-02955-x

Weitere Artikel der Ausgabe 6/2020

The Journal of Supercomputing 6/2020 Zur Ausgabe