Skip to main content
Erschienen in: The Journal of Supercomputing 1/2014

01.07.2014

FuPerMod: a software tool for the optimization of data-parallel applications on heterogeneous platforms

verfasst von: David Clarke, Ziming Zhong, Vladimir Rychkov, Alexey Lastovetsky

Erschienen in: The Journal of Supercomputing | Ausgabe 1/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Optimization of data-parallel applications for modern HPC platforms requires partitioning the computations between the heterogeneous computing devices in proportion to their speed. Heterogeneous data partitioning algorithms are based on computation performance models of the executing platforms. Their implementation is not trivial as it requires: accurate and efficient benchmarking of computing devices, which may share resources and/or execute different codes; appropriate interpolation methods to predict performance; and advanced mathematical methods to solve the data partitioning problem. In this paper, we present FuPerMod, a software tool that addresses these implementation issues and automates the development of data partitioning code in data-parallel applications for heterogeneous HPC platforms.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Aubanel E, Wu X (2007) Incorporating latency in heterogeneous graph partitioning. In: IPDPS 2007, pp 1–8 Aubanel E, Wu X (2007) Incorporating latency in heterogeneous graph partitioning. In: IPDPS 2007, pp 1–8
2.
Zurück zum Zitat Beaumont O, Boudet V, Rastello F, Robert Y (2001) Matrix multiplication on heterogeneous platforms. IEEE Trans Parallel Distrib Syst 12(10):1033–1051CrossRefMathSciNet Beaumont O, Boudet V, Rastello F, Robert Y (2001) Matrix multiplication on heterogeneous platforms. IEEE Trans Parallel Distrib Syst 12(10):1033–1051CrossRefMathSciNet
3.
Zurück zum Zitat Catalyurek U, Boman E, Devine K et al (2007) Hypergraph-based dynamic load balancing for adaptive scientific computations. In: IPDPS 2007, pp 1–11 Catalyurek U, Boman E, Devine K et al (2007) Hypergraph-based dynamic load balancing for adaptive scientific computations. In: IPDPS 2007, pp 1–11
4.
Zurück zum Zitat Chevalier C, Pellegrini F (2008) PT-Scotch: a tool for efficient parallel graph ordering. Parallel Comput 34(68):318–331CrossRefMathSciNet Chevalier C, Pellegrini F (2008) PT-Scotch: a tool for efficient parallel graph ordering. Parallel Comput 34(68):318–331CrossRefMathSciNet
5.
Zurück zum Zitat Clarke D, Lastovetsky A, Rychkov V (2012) Column-based matrix partitioning for parallel matrix multiplication on heterogeneous processors based on functional performance models. In: HeteroPar’2011, pp 450–459 Clarke D, Lastovetsky A, Rychkov V (2012) Column-based matrix partitioning for parallel matrix multiplication on heterogeneous processors based on functional performance models. In: HeteroPar’2011, pp 450–459
6.
Zurück zum Zitat Clarke D et al (2011) Dynamic load balancing of parallel computational iterative routines on highly heterogeneous HPC platforms. Parallel Process Lett 21:195–217CrossRefMathSciNet Clarke D et al (2011) Dynamic load balancing of parallel computational iterative routines on highly heterogeneous HPC platforms. Parallel Process Lett 21:195–217CrossRefMathSciNet
7.
Zurück zum Zitat Karypis G, Schloegel K (2013) ParMETIS: parallel graph partitioning and sparse matrix ordering library. Version 4 Karypis G, Schloegel K (2013) ParMETIS: parallel graph partitioning and sparse matrix ordering library. Version 4
8.
Zurück zum Zitat Lastovetsky A, Reddy R (2007) Data partitioning with a functional performance model of heterogeneous processors. Int J High Perform C 21:76–90CrossRef Lastovetsky A, Reddy R (2007) Data partitioning with a functional performance model of heterogeneous processors. Int J High Perform C 21:76–90CrossRef
9.
Zurück zum Zitat Lastovetsky A, Reddy R (2010) Distributed data partitioning for heterogeneous processors based on partial estimation of their functional performance models. In: Euro-Par 2009, LNCS, vol 6043. Springer, pp 91–101 Lastovetsky A, Reddy R (2010) Distributed data partitioning for heterogeneous processors based on partial estimation of their functional performance models. In: Euro-Par 2009, LNCS, vol 6043. Springer, pp 91–101
10.
Zurück zum Zitat Malony AD, Biersdorff S, Shende S et al (2011) Parallel performance measurement of heterogeneous parallel systems with GPUs. In: ICPP ’11, pp 176–185 Malony AD, Biersdorff S, Shende S et al (2011) Parallel performance measurement of heterogeneous parallel systems with GPUs. In: ICPP ’11, pp 176–185
11.
Zurück zum Zitat Rychkov V, Clarke D, Lastovetsky A (2011) Using multidimensional solvers for optimal data partitioning on dedicated heterogeneous HPC platforms. In: PaCT-2011, LNCS, vol 6873. Springer, pp 332–346 Rychkov V, Clarke D, Lastovetsky A (2011) Using multidimensional solvers for optimal data partitioning on dedicated heterogeneous HPC platforms. In: PaCT-2011, LNCS, vol 6873. Springer, pp 332–346
12.
Zurück zum Zitat Walshaw C, Cross M (2001) Multilevel mesh partitioning for heterogeneous communication networks. Future Gener Comput Syst 17(5):601–623CrossRef Walshaw C, Cross M (2001) Multilevel mesh partitioning for heterogeneous communication networks. Future Gener Comput Syst 17(5):601–623CrossRef
13.
Zurück zum Zitat Zhong Z, Rychkov V, Lastovetsky A (2012) Data partitioning on heterogeneous multicore and multi-GPU systems using functional performance models of data-parallel applications. In: Cluster, pp 191–199 Zhong Z, Rychkov V, Lastovetsky A (2012) Data partitioning on heterogeneous multicore and multi-GPU systems using functional performance models of data-parallel applications. In: Cluster, pp 191–199
Metadaten
Titel
FuPerMod: a software tool for the optimization of data-parallel applications on heterogeneous platforms
verfasst von
David Clarke
Ziming Zhong
Vladimir Rychkov
Alexey Lastovetsky
Publikationsdatum
01.07.2014
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 1/2014
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-014-1207-9

Weitere Artikel der Ausgabe 1/2014

The Journal of Supercomputing 1/2014 Zur Ausgabe