Skip to main content
Erschienen in: The Journal of Supercomputing 3/2015

01.03.2015

Locality-aware policies to improve job scheduling on 3D tori

verfasst von: Jose A. Pascual, Jose Miguel-Alonso, Jose A. Lozano

Erschienen in: The Journal of Supercomputing | Ausgabe 3/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper studies the influence that contiguous job placement has on the performance of schedulers for large-scale computing systems. In contrast with non-contiguous strategies, contiguous partitioning enables the exploitation of communication locality in applications, and also reduces inter-application interference. However, contiguous partitioning increases scheduling times and system fragmentation, degrading system utilization. We propose and evaluate several strategies to select contiguous partitions to allocate incoming jobs. These strategies are used in combination with different mapping mechanisms to perform the task-to-node assignment in order to further reduce application run times. A simulation-based study has been carried out, using a collection of synthetic applications performing common communication patterns. Results show that the exploitation of communication locality by means of a correct partitioning–mapping results in an effective reduction of application run times, and the gains achieved more than compensate the scheduling inefficiency, therefore resulting in better overall system performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Pascual JA, Miguel-Alonso J, Lozano JA (2011) Optimization-based mapping framework for parallel applications. J Parallel Distrib Comput 71(10):1377–1387CrossRef Pascual JA, Miguel-Alonso J, Lozano JA (2011) Optimization-based mapping framework for parallel applications. J Parallel Distrib Comput 71(10):1377–1387CrossRef
2.
Zurück zum Zitat Navaridas J, Miguel-Alonso J, Pascual JA, Ridruejo FJ (2011) Simulating and evaluating interconnection networks with INSEE. Simul Model Pract Theory 19(1):494–515CrossRef Navaridas J, Miguel-Alonso J, Pascual JA, Ridruejo FJ (2011) Simulating and evaluating interconnection networks with INSEE. Simul Model Pract Theory 19(1):494–515CrossRef
3.
Zurück zum Zitat Feitelson DG, Rudolph L, Schwiegelshohn U (2005) Parallel job scheduling—a status report. In: Feitelson DG, Rudolph L (eds) Job scheduling strategies for parallel processing. Springer, Berlin, pp 1–16 Feitelson DG, Rudolph L, Schwiegelshohn U (2005) Parallel job scheduling—a status report. In: Feitelson DG, Rudolph L (eds) Job scheduling strategies for parallel processing. Springer, Berlin, pp 1–16
4.
Zurück zum Zitat Bender MA, Bunde DP, Demaine ED, Fekete SP, Leung VJ, Meijer H, Phillips CA (2008) Communication-aware processor allocation for supercomputers: finding point sets of small average distance. Algorithmica 50(2):279–298CrossRefMATHMathSciNet Bender MA, Bunde DP, Demaine ED, Fekete SP, Leung VJ, Meijer H, Phillips CA (2008) Communication-aware processor allocation for supercomputers: finding point sets of small average distance. Algorithmica 50(2):279–298CrossRefMATHMathSciNet
5.
Zurück zum Zitat Lo V, Windisch K, Liu W, Nitzberg B (1997) Noncontiguous processor allocation algorithms for mesh-connected multicomputers. IEEE Trans Parallel Distrib Syst 8(7):712–726CrossRef Lo V, Windisch K, Liu W, Nitzberg B (1997) Noncontiguous processor allocation algorithms for mesh-connected multicomputers. IEEE Trans Parallel Distrib Syst 8(7):712–726CrossRef
6.
Zurück zum Zitat Pascual JA, Miguel-Alonso J, Lozano JA (2014) A fast implementation of the first-fit contiguous partitioning strategy for cubic topologies. Concurr Comput: Pract Exper 26(17):2792–2810CrossRef Pascual JA, Miguel-Alonso J, Lozano JA (2014) A fast implementation of the first-fit contiguous partitioning strategy for cubic topologies. Concurr Comput: Pract Exper 26(17):2792–2810CrossRef
8.
Zurück zum Zitat Bhatele A, Kalé LV (2008) Benefits of topology aware mapping for mesh interconnects. Parallel Process Lett 18(4):549–566CrossRefMathSciNet Bhatele A, Kalé LV (2008) Benefits of topology aware mapping for mesh interconnects. Parallel Process Lett 18(4):549–566CrossRefMathSciNet
9.
Zurück zum Zitat Smith BE, Bode B (2005) Performance effects of node mappings on the IBM blue gene/l machine. In Proceedings of the 11th international Euro-Par conference on parallel processing. Springer, Berlin, pp 1005–1013 Smith BE, Bode B (2005) Performance effects of node mappings on the IBM blue gene/l machine. In Proceedings of the 11th international Euro-Par conference on parallel processing. Springer, Berlin, pp 1005–1013
10.
Zurück zum Zitat Yu H, Chung I-H, Moreira J (2006) Topology mapping for Blue Gene/L supercomputer. In: Proceedings of the 2006 ACM/IEEE conference on supercomputing, New York, NY, USA, 2006. ACM Yu H, Chung I-H, Moreira J (2006) Topology mapping for Blue Gene/L supercomputer. In: Proceedings of the 2006 ACM/IEEE conference on supercomputing, New York, NY, USA, 2006. ACM
11.
Zurück zum Zitat Bani-Mohammad S, Ould-Khaoua M, Ababneh I, Mackenzie LM (2009) Comparative evaluation of contiguous allocation strategies on 3d mesh multicomputers. J Syst Softw 82(2):307–318CrossRef Bani-Mohammad S, Ould-Khaoua M, Ababneh I, Mackenzie LM (2009) Comparative evaluation of contiguous allocation strategies on 3d mesh multicomputers. J Syst Softw 82(2):307–318CrossRef
12.
Zurück zum Zitat Kang M, Yu C, Youn HY, Lee B, Kim M (2003) Isomorphic strategy for processor allocation in k-ary n-cube systems. IEEE Trans Comput 52(5):645–657CrossRef Kang M, Yu C, Youn HY, Lee B, Kim M (2003) Isomorphic strategy for processor allocation in k-ary n-cube systems. IEEE Trans Comput 52(5):645–657CrossRef
13.
Zurück zum Zitat Windisch K, Lo V, Bose B (1995) Contiguous and non-contiguous processor allocation algorithms for k-ary n-cubes. IEEE Trans Parallel Distrib Syst 8:712–726 Windisch K, Lo V, Bose B (1995) Contiguous and non-contiguous processor allocation algorithms for k-ary n-cubes. IEEE Trans Parallel Distrib Syst 8:712–726
14.
Zurück zum Zitat Broeg B, Bose B, Kwon Y, Ashir Y (1995) Lee distance and topological properties of k-ary n-cubes. IEEE Trans Comput 44(8):1021–1030CrossRefMATHMathSciNet Broeg B, Bose B, Kwon Y, Ashir Y (1995) Lee distance and topological properties of k-ary n-cubes. IEEE Trans Comput 44(8):1021–1030CrossRefMATHMathSciNet
15.
Zurück zum Zitat Navaridas J, Miguel-Alonso J (2009) Realistic evaluation of interconnection networks using synthetic traffic. In: Proceedings of the 2009 eighth international symposium on parallel and distributed computing, pp 249–252, Lisbon, Portugal, 2009. IEEE Computer Society Navaridas J, Miguel-Alonso J (2009) Realistic evaluation of interconnection networks using synthetic traffic. In: Proceedings of the 2009 eighth international symposium on parallel and distributed computing, pp 249–252, Lisbon, Portugal, 2009. IEEE Computer Society
16.
Zurück zum Zitat Navaridas J, Miguel-Alonso J, Ridruejo F (2008) On synthesizing workloads emulating mpi applications. In: IEEE international symposium on parallel and distributed processing, IPDPS, April 2008, pp 1–8, Miami, Florida Navaridas J, Miguel-Alonso J, Ridruejo F (2008) On synthesizing workloads emulating mpi applications. In: IEEE international symposium on parallel and distributed processing, IPDPS, April 2008, pp 1–8, Miami, Florida
17.
Zurück zum Zitat Puente V, Izu C, Beivide R, Gregorio J, Vallejo F, Prellezo J (2001) The adaptive bubble router. J Parallel Distrib Comput 61(9):1180–1208CrossRefMATH Puente V, Izu C, Beivide R, Gregorio J, Vallejo F, Prellezo J (2001) The adaptive bubble router. J Parallel Distrib Comput 61(9):1180–1208CrossRefMATH
18.
Zurück zum Zitat Pascual JA, Navaridas J, Miguel-Alonso J (2009) Effects of topology-aware allocation policies on scheduling performance. In: Job scheduling strategies for parallel processing (IPDPS), Rome, Italy. Springer, Berlin, pp 138–156 Pascual JA, Navaridas J, Miguel-Alonso J (2009) Effects of topology-aware allocation policies on scheduling performance. In: Job scheduling strategies for parallel processing (IPDPS), Rome, Italy. Springer, Berlin, pp 138–156
19.
Zurück zum Zitat Dally W, Towles B (2003) Principles and practices of interconnection networks. Morgan Kaufmann, San Francisco, CA, USA Dally W, Towles B (2003) Principles and practices of interconnection networks. Morgan Kaufmann, San Francisco, CA, USA
20.
Zurück zum Zitat Sheskin DJ (2007) Handbook of parametric and nonparametric statistical procedures, 4th edn. Chapman & Hall/CRC, London Sheskin DJ (2007) Handbook of parametric and nonparametric statistical procedures, 4th edn. Chapman & Hall/CRC, London
21.
Zurück zum Zitat Tsafrir D, Etsion Y, Feitelson DG (2007) Backfilling using system-generated predictions rather than user runtime estimates. IEEE Trans Parallel Distrib Syst 18(6):789–803CrossRef Tsafrir D, Etsion Y, Feitelson DG (2007) Backfilling using system-generated predictions rather than user runtime estimates. IEEE Trans Parallel Distrib Syst 18(6):789–803CrossRef
22.
Zurück zum Zitat Liu W, Lo V, Windisch K, Nitzberg B (1994) Non-contiguous processor allocation algorithms for distributed memory multicomputers. In: Proceedings of the 1994 ACM/IEEE conference on supercomputing, Supercomputing ’94, pp 227–236, Los Alamitos, CA, USA. IEEE Computer Society Liu W, Lo V, Windisch K, Nitzberg B (1994) Non-contiguous processor allocation algorithms for distributed memory multicomputers. In: Proceedings of the 1994 ACM/IEEE conference on supercomputing, Supercomputing ’94, pp 227–236, Los Alamitos, CA, USA. IEEE Computer Society
23.
Zurück zum Zitat Johnson CR, Bunde DP, Leung V J (2010) A Tie-breaking strategy for processor allocation in meshes. In: 39th International conference on parallel processing, ICPP workshops 2010, San Diego, California, USA, 13–16 September 2010, pp 331–338. IEEE Computer Society Johnson CR, Bunde DP, Leung V J (2010) A Tie-breaking strategy for processor allocation in meshes. In: 39th International conference on parallel processing, ICPP workshops 2010, San Diego, California, USA, 13–16 September 2010, pp 331–338. IEEE Computer Society
24.
Zurück zum Zitat Walker P, Bunde DP, Leung VJ (2010) Faster high-quality processor allocation. In: Proceedings of the 11th LCI international conference on high-performance cluster computing, 2010 Walker P, Bunde DP, Leung VJ (2010) Faster high-quality processor allocation. In: Proceedings of the 11th LCI international conference on high-performance cluster computing, 2010
26.
Zurück zum Zitat Bhatele A, Gupta G, Kale L, Chung I-H (2010) Automated mapping of regular communication graphs on mesh interconnects. In: 2010 International conference on high performance computing (HiPC), Dec 2010, pp 1–10 Bhatele A, Gupta G, Kale L, Chung I-H (2010) Automated mapping of regular communication graphs on mesh interconnects. In: 2010 International conference on high performance computing (HiPC), Dec 2010, pp 1–10
27.
Zurück zum Zitat Balzuweit E, Bunde DP, Leung VJ, Finley A, Lee ACS (2014) Local search to improve task mapping. In: Proceedings of the 7th international workshop on parallel programming models and systems software for high-end computing (P2S2). IEEE Balzuweit E, Bunde DP, Leung VJ, Finley A, Lee ACS (2014) Local search to improve task mapping. In: Proceedings of the 7th international workshop on parallel programming models and systems software for high-end computing (P2S2). IEEE
28.
Zurück zum Zitat Meisner D, Gold BT, Wenisch TF (2009) Powernap: eliminating server idle power. In: Proceedings of the 14th international conference on architectural support for programming languages and operating systems, ASPLOS ’09, pp 205–216, New York, NY, USA, 2009. ACM Meisner D, Gold BT, Wenisch TF (2009) Powernap: eliminating server idle power. In: Proceedings of the 14th international conference on architectural support for programming languages and operating systems, ASPLOS ’09, pp 205–216, New York, NY, USA, 2009. ACM
Metadaten
Titel
Locality-aware policies to improve job scheduling on 3D tori
verfasst von
Jose A. Pascual
Jose Miguel-Alonso
Jose A. Lozano
Publikationsdatum
01.03.2015
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 3/2015
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-014-1347-y

Weitere Artikel der Ausgabe 3/2015

The Journal of Supercomputing 3/2015 Zur Ausgabe