nach oben

The Journal of Supercomputing

Erschienen in:

03.08.2019

STEEL-RT: combining single task–single executor model and expanded scheduling to ease heterogeneity exploitation

verfasst von: Antón Rey, Francisco D. Igual, Manuel Prieto-Matías

Erschienen in: The Journal of Supercomputing | Ausgabe 6/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

The ever-increasing complexity in terms of heterogeneity and available parallelism of modern architectures, together with the growing software ecosystem in terms of application libraries and runtime systems are posing a new challenge in terms of programmability on top of the already difficult task of extracting high parallel performance. With the goal of simplifying the programmability at the extreme, we first present the single task–single executor (STSE) model, in which the execution entry point consists solely on a single task assigned to an abstract execution context or executor, representing the whole system platform. As a second contribution, we introduce the expanded scheduling operations that endow executors with additional runtime task scheduling capabilities, and explain how executors can be composed to express and exploit different architectural features at run time under the framework of a single/multiple–task/executor taxonomy. As a result, we introduce STEEL, a novel programming model built on these concepts and STEEL-RT, a coupled high-performance runtime implementation for it. We illustrate the potential of STEEL by means of runtime experiments on a modern heterogeneous architecture, showing that complex and efficient heterogeneous executions are accessible from a STSE approach.

Vorheriger Artikel InKS: a programming model to decouple algorithm from optimization in HPC codes

Nächster Artikel Hybrid scheduling to enhance reliability of real-time tasks running on reconfigurable devices

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

If considered as a set of multi-processors, e.g., a modern CUDA-capable device.

http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2018/p0761r2.pdf.

Bueno J, Planas J, Duran A, Badia RM, Martorell X, Ayguade E, Labarta J (2012) Productive programming of GPU clusters with OmpSs. In: IPDPS 2012, pp 557–568

Augonnet C, Thibault S, Namyst R, Wacrenier P (2011) StarPU a unified platform for task scheduling on heterogeneous multicore architectures. CCPE 23(2):187–198

Openmp Architecture, Review Board, Openmp Architecture, Review Board, Openmp Architecture, and Review Board. OpenMP Application Programming Interface. (November):359, 2015

Pheatt C (2008) Intel threading building blocks. J Comput Sci Coll 23(4):298–298

Heller T, Diehl P, Byerly Z, Biddiscombe J, Kaiser H (2017) HPX—an open source C++ standard library for parallelism and concurrency. In: Proceedings of OpenSuCo 2017, Denver, Colorado USA, November 2017 (OpenSuCo17), p 5

Rey A, Igual FD, Prieto-Matías M (2016) Hesp: a simulation framework for solving the task scheduling-partitioning problem on heterogeneous architectures. CoRR, arXiv:abs/1602.05510

Anderson E, Bai Z, Demmel J, Dongarra JE, DuCroz J, Greenbaum A, Hammarling S, McKenney AE, Ostrouchov S, Sorensen D (1992) LAPACK users’ guide. SIAM, PhiladelphiaMATH

Barrachina S, Castillo M, Igual FD, Mayo R, Quintana-Ortí ES (2008) Solving dense linear systems on graphics processors. In: Luque E, Margalef T, Benítez D (eds) Proceedings of the 14th International Euro-Par Conference, Lecture Notes in Computer Science, vol 5168. Springer, pp 739–748

El-Ghazawi T, Smith L (2006) Upc: unified parallel c. In: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, SC ’06, New York, NY, USA, 2006. ACM

10.

Wienke S, Springer P, Terboven C, an Mey D (2012) Openacc: first experiences with real-world applications. In: Proceedings of the 18th International Conference on Parallel Processing, Euro-Par’12. Springer, Berlin, pp 859–870CrossRef

11.

Planas J, Badia RM, Ayguade E, Labarta J (2013) Self-adaptive OmpSs tasks in heterogeneous environments. In: 2013 IEEE 27th International Symposium on IPDPS, pp 138–149

12.

Hornung RD, Keasler JA (2014) The RAJA Portability Layer: Overview and Status. United States. https://doi.org/10.2172/1169830

13.

Zheng Y, Kamil A, Driscoll MB, Shan H, Yelick K (2014) UPC++: a PGAS extension for C++. In: IPDPS 2014, vol 00, pp 1105–1114

14.

Keryell R, Reyes R, Howes L (2015) Khronos SYCL for OPenCL. In: Proceedings of the 3rd International Workshop on OpenCL, IWOCL ’15, pp 24:1–24:1, New York. ACM

15.

Carter Edwards H, Trott CR, Sunderland D (2014) Kokkos. J Parallel Distrib Comput 74(12):3202–3216CrossRef

16.

De Sensi D, De Matteis T, Torquati M, Mencagli G, Danelutto M (2017) Bringing parallel patterns out of the corner: the p3 arsec benchmark suite. ACM Trans Archit Code Optim 14(4):33:1–33:26CrossRef

17.

del Rio Astorga D, Dolz MF, Fernández J, Daniel García J (2017) A generic parallel pattern interface for stream and data processing. Concurr Comput Pract Exp 29(24):e4175 (e4175 cpe.4175)CrossRef

Titel: STEEL-RT: combining single task–single executor model and expanded scheduling to ease heterogeneity exploitation
verfasst von: Antón Rey
Francisco D. Igual
Manuel Prieto-Matías
Publikationsdatum: 03.08.2019
Verlag: Springer US
Erschienen in: The Journal of Supercomputing / Ausgabe 6/2020
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI: https://doi.org/10.1007/s11227-019-02955-x

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Weitere Artikel der Ausgabe 6/2020

Packet classification based on the decision tree with information entropy

A self-adjusting quantum key renewal management scheme in classical network symmetric cryptography

Feature fatigue analysis of product usability using Hybrid ant colony optimization with artificial bee colony approach

Profit and resource availability-constrained optimal handling of high-performance scientific computing tasks

A proposed method for the improvement in biometric facial image recognition using document-based classification

A graph-based model to improve social trust and influence for social recommendation