2009 | OriginalPaper | Buchkapitel
StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures
verfasst von : Cédric Augonnet, Samuel Thibault, Raymond Namyst, Pierre-André Wacrenier
Erschienen in: Euro-Par 2009 Parallel Processing
Verlag: Springer Berlin Heidelberg
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
In the field of HPC, the current hardware trend is to design multiprocessor architectures that feature heterogeneous technologies such as specialized coprocessors (
e.g.
Cell/BE SPUs) or data-parallel accelerators (
e.g.
GPGPUs).
Approaching the theoretical performance of these architectures is a complex issue. Indeed, substantial efforts have already been devoted to efficiently offload parts of the computations. However, designing an execution model that unifies all computing units and associated embedded memory remains a main challenge.
We have thus designed
StarPU
, an original runtime system providing a high-level, unified execution model tightly coupled with an expressive data management library. The main goal of
StarPU
is to provide numerical kernel designers with a convenient way to generate parallel tasks over heterogeneous hardware on the one hand, and easily develop and tune powerful scheduling algorithms on the other hand.
We have developed several strategies that can be selected seamlessly at run time, and we have demonstrated their efficiency by analyzing the impact of those scheduling policies on several classical linear algebra algorithms that take advantage of multiple cores and
GPU
s at the same time. In addition to substantial improvements regarding execution times, we obtained consistent
superlinear
parallelism by actually
exploiting
the heterogeneous nature of the machine.