Skip to main content
Erschienen in: The Journal of Supercomputing 1/2019

19.09.2018

Mesh-of-Torus: a new topology for server-centric data center networks

verfasst von: Peibo Xie, Huaxi Gu, Kun Wang, Xiaoshan Yu, Shangqi Ma

Erschienen in: The Journal of Supercomputing | Ausgabe 1/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Various topologies have been proposed for high-performance computing (HPC), i.e., fat-tree, Torus topology. Compared with conventional fat-tree topology, Torus performs much better when applied in HPC. Unfortunately, due to its wraparound links, Torus topology naturally has the tendency to trigger deadlock incidents inside the network. Researchers solve this problem by means of virtual channel, but this approach will also restrict the routing of message. In this paper, we propose a deadlock-free topology for HPC, called Mesh-of-Torus, which incarnates the good characteristics of Mesh and Torus topology. Comparing with mesh, Mesh-of-Torus has shorter network diameter. Furthermore, we have proposed a corresponding port assignment rules in consideration of complicated internal arbitration or scheduling mechanism incurred by the employment of virtual channel. Deadlock avoidance can be achieved when dimension-order routing algorithm and our port assignment rules are applied to Mesh-of-Torus. Finally, simulations and mathematical analysis have shown that Mesh-of-Torus outperforms Mesh in terms of average end-to-end latency and network load distribution.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Arabnia HR, Oliver MA (1987) A transfer network for the arbitrary rotation of digitised images. Comput J 30(5):425–432CrossRef Arabnia HR, Oliver MA (1987) A transfer network for the arbitrary rotation of digitised images. Comput J 30(5):425–432CrossRef
2.
Zurück zum Zitat Wijngaart RFVD, Georganas E, Mattson TG, Wissink A (2017) A new parallel research kernel to expand research on dynamic load-balancing capabilities. In: International Supercomputing Conference Wijngaart RFVD, Georganas E, Mattson TG, Wissink A (2017) A new parallel research kernel to expand research on dynamic load-balancing capabilities. In: International Supercomputing Conference
3.
Zurück zum Zitat Bhandarkar SM, Arabnia HR (1995) The REFINE multiprocessor-theoretical properties and algorithms. Parallel Comput 21(11):1783–1805CrossRef Bhandarkar SM, Arabnia HR (1995) The REFINE multiprocessor-theoretical properties and algorithms. Parallel Comput 21(11):1783–1805CrossRef
4.
Zurück zum Zitat Ding M, Tian H (2016) PCA-based network traffic anomaly detection. Tsinghua Sci Technol 21(2):500–509CrossRef Ding M, Tian H (2016) PCA-based network traffic anomaly detection. Tsinghua Sci Technol 21(2):500–509CrossRef
5.
Zurück zum Zitat Alonso P, Ranilla J, Aguiar JV (2017) High-performance computing. J Supercomput 73(1):1–3CrossRef Alonso P, Ranilla J, Aguiar JV (2017) High-performance computing. J Supercomput 73(1):1–3CrossRef
7.
Zurück zum Zitat Chen D, Eisley NA, Heidelberger P, Senger RM, Sugawara Y, Kumar S, Salapura V, Satterfield DL, Burow BS., Parker JJ (2011) The IBM Blue Gene/Q interconnection network and message unit. In: High Performance Computing, Networking, Storage and Analysis Chen D, Eisley NA, Heidelberger P, Senger RM, Sugawara Y, Kumar S, Salapura V, Satterfield DL, Burow BS., Parker JJ (2011) The IBM Blue Gene/Q interconnection network and message unit. In: High Performance Computing, Networking, Storage and Analysis
8.
Zurück zum Zitat Xenopoulos P, Daniel J, Matheson M, Sukumar S (2016) Big data analytics on HPC architectures: performance and cost. In: IEEE International Conference on Big Data Xenopoulos P, Daniel J, Matheson M, Sukumar S (2016) Big data analytics on HPC architectures: performance and cost. In: IEEE International Conference on Big Data
9.
Zurück zum Zitat González ÁF, RosilloEmai R, Dávila JÁM, Matellán V (2015) Historical review and future challenges in supercomputing and networks of scientific communication. J Supercomput 71(12):4476–4503CrossRef González ÁF, RosilloEmai R, Dávila JÁM, Matellán V (2015) Historical review and future challenges in supercomputing and networks of scientific communication. J Supercomput 71(12):4476–4503CrossRef
10.
Zurück zum Zitat Azad HS, Bagherzadeh N, Jaberipour G (2015) Advances in multicore systems architectures. J Supercomput 71(8):2783–2786CrossRef Azad HS, Bagherzadeh N, Jaberipour G (2015) Advances in multicore systems architectures. J Supercomput 71(8):2783–2786CrossRef
11.
Zurück zum Zitat Bermúdez Garzón DF, Requena CG, Gómez ME, López P, Duato J (2016) A family of fault-tolerant efficient indirect topologies. IEEE Trans Parallel Distrib Syst 27(4):927–940CrossRef Bermúdez Garzón DF, Requena CG, Gómez ME, López P, Duato J (2016) A family of fault-tolerant efficient indirect topologies. IEEE Trans Parallel Distrib Syst 27(4):927–940CrossRef
12.
Zurück zum Zitat Dhanak M, Godbole PD, Patil RA (2016) Torus network labeling in high performance computing. In: International Conference on Computing Communication Control and Automation Dhanak M, Godbole PD, Patil RA (2016) Torus network labeling in high performance computing. In: International Conference on Computing Communication Control and Automation
13.
Zurück zum Zitat Yu Z, Xiang D, Wang X (2015) Balancing virtual channel utilization for deadlock-free routing in torus networks. J Supercomput 71(8):3094–3115CrossRef Yu Z, Xiang D, Wang X (2015) Balancing virtual channel utilization for deadlock-free routing in torus networks. J Supercomput 71(8):3094–3115CrossRef
14.
Zurück zum Zitat Abbas D, Jamshidi K (2015) A fault-tolerant hierarchical hybrid mesh-based wireless network-on-chip architecture for multicore platforms. J Supercomput 71(8):3116–3148CrossRef Abbas D, Jamshidi K (2015) A fault-tolerant hierarchical hybrid mesh-based wireless network-on-chip architecture for multicore platforms. J Supercomput 71(8):3116–3148CrossRef
15.
Zurück zum Zitat Prisacari B, Rodriguez G, Minkenberg C, Palacio RB (2012) Performance implications of deadlock avoidance techniques in torus networks. In: International Conference on High Performance Switching and Routing Prisacari B, Rodriguez G, Minkenberg C, Palacio RB (2012) Performance implications of deadlock avoidance techniques in torus networks. In: International Conference on High Performance Switching and Routing
16.
Zurück zum Zitat Puente V, Beivide R, Gregorio JA, Prellezo JM, Duato J, Izu C (1999) Adaptive bubble router: a design to improve performance in torus networks. In: International Conference on Parallel Processing Puente V, Beivide R, Gregorio JA, Prellezo JM, Duato J, Izu C (1999) Adaptive bubble router: a design to improve performance in torus networks. In: International Conference on Parallel Processing
17.
Zurück zum Zitat Jeong YS, Lee SE (2013) Deadlock-free XY-YX router for on-chip interconnection network. Ieice Electron Express 10(20):20130699CrossRef Jeong YS, Lee SE (2013) Deadlock-free XY-YX router for on-chip interconnection network. Ieice Electron Express 10(20):20130699CrossRef
18.
Zurück zum Zitat Yu Z, Wang X, Shen K (2016) Conditional forwarding: simple flow control to increase adaptivity for fully adaptive routing algorithms. J Supercomput 72(2):639–653CrossRef Yu Z, Wang X, Shen K (2016) Conditional forwarding: simple flow control to increase adaptivity for fully adaptive routing algorithms. J Supercomput 72(2):639–653CrossRef
19.
Zurück zum Zitat Boden NJ, Cohen D, Felderman RE (1995) Myrinet: a gigabit-per-second local area network. Micro IEEE 15(1):29–36CrossRef Boden NJ, Cohen D, Felderman RE (1995) Myrinet: a gigabit-per-second local area network. Micro IEEE 15(1):29–36CrossRef
20.
Zurück zum Zitat Veselovsky G, Batovski DA (2003) A study of the permutation capability of a binary hypercube under deterministic dimension-order routing. In: Parallel, Distributed and Network-Based Processing Veselovsky G, Batovski DA (2003) A study of the permutation capability of a binary hypercube under deterministic dimension-order routing. In: Parallel, Distributed and Network-Based Processing
21.
Zurück zum Zitat Ren P, Kinsy MA, Zheng N (2016) Fault-aware load-balancing routing for 2D-mesh and torus on-chip network topologies. IEEE Trans Comput 65(3):873–887MathSciNetCrossRefMATH Ren P, Kinsy MA, Zheng N (2016) Fault-aware load-balancing routing for 2D-mesh and torus on-chip network topologies. IEEE Trans Comput 65(3):873–887MathSciNetCrossRefMATH
22.
Zurück zum Zitat Šeda M, Šedová J, Horký M (2017) Multichannel queueing systems and their simulation. In: Applied Physics, System Science and Computers. APSAC Šeda M, Šedová J, Horký M (2017) Multichannel queueing systems and their simulation. In: Applied Physics, System Science and Computers. APSAC
23.
Zurück zum Zitat Cheng B, Fan J, Jia X (2013) Parallel construction of independent spanning trees and an application in diagnosis on Möbius cubes. J Supercomput 65(3):1279–1301CrossRef Cheng B, Fan J, Jia X (2013) Parallel construction of independent spanning trees and an application in diagnosis on Möbius cubes. J Supercomput 65(3):1279–1301CrossRef
24.
Zurück zum Zitat Xiang D, Pan Y, Wang Q, Chen Z (2008) Deadlock-free fully adaptive routing in 2-dimensional tori based on a new virtual network partitioning scheme. In: International Conference on Distributed Computing Systems Xiang D, Pan Y, Wang Q, Chen Z (2008) Deadlock-free fully adaptive routing in 2-dimensional tori based on a new virtual network partitioning scheme. In: International Conference on Distributed Computing Systems
25.
Zurück zum Zitat Liu Z, Fan J, Jia X (2015) Embedding complete binary trees into parity cubes. J Supercomput 71(1):1–27CrossRef Liu Z, Fan J, Jia X (2015) Embedding complete binary trees into parity cubes. J Supercomput 71(1):1–27CrossRef
26.
Zurück zum Zitat Farrington PA, Nembhard HB, Sturrock DT, Evans GW, Chang X (2009) Network simulations with Opnet. In: Winter Simulation Conference Farrington PA, Nembhard HB, Sturrock DT, Evans GW, Chang X (2009) Network simulations with Opnet. In: Winter Simulation Conference
27.
Zurück zum Zitat Lang H, Quan Z (2008) OPNET modeling and simulation of MSM Clos switch fabric and algorithm with OPNET. Mod Electron Tech 19:011 Lang H, Quan Z (2008) OPNET modeling and simulation of MSM Clos switch fabric and algorithm with OPNET. Mod Electron Tech 19:011
28.
Zurück zum Zitat Li H, Cheng Y, Zhou C, Zhuang W (2009) Minimizing end-to-end delay: a novel routing metric for multi-radio wireless mesh networks. In: International Conference on Computer Communications Li H, Cheng Y, Zhou C, Zhuang W (2009) Minimizing end-to-end delay: a novel routing metric for multi-radio wireless mesh networks. In: International Conference on Computer Communications
29.
Zurück zum Zitat Yu Y, Huang Y, Zhao B, Hua Y (2008) Throughput analysis of wireless mesh networks. In: International Conference on Acoustics, Speech, and Signal Processing Yu Y, Huang Y, Zhao B, Hua Y (2008) Throughput analysis of wireless mesh networks. In: International Conference on Acoustics, Speech, and Signal Processing
30.
Zurück zum Zitat Zhao D, Zou J, Todd TD (2007) Admission control with load balancing in IEEE 802.11-based ESS mesh networks. Wireless Netw 13(3):351–359CrossRef Zhao D, Zou J, Todd TD (2007) Admission control with load balancing in IEEE 802.11-based ESS mesh networks. Wireless Netw 13(3):351–359CrossRef
31.
Zurück zum Zitat Yu J, Bang HC, Lee H, Yang SL (2016) Adaptive internet of things and web of things convergence platform for Internet of reality services. J Supercomput 72(1):84–102CrossRef Yu J, Bang HC, Lee H, Yang SL (2016) Adaptive internet of things and web of things convergence platform for Internet of reality services. J Supercomput 72(1):84–102CrossRef
32.
Zurück zum Zitat Wani MA, Arabnia HR (2003) Parallel edge-region-based segmentation algorithm targeted at reconfigurable multi-ring network. J Supercomput 25(1):43–63CrossRefMATH Wani MA, Arabnia HR (2003) Parallel edge-region-based segmentation algorithm targeted at reconfigurable multi-ring network. J Supercomput 25(1):43–63CrossRefMATH
33.
Zurück zum Zitat Arabnia HR (1990) A parallel algorithm for the arbitrary rotation of digitized images using process-and-data-decomposition approach. J Parallel Distrib Comput 10(2):188–193CrossRef Arabnia HR (1990) A parallel algorithm for the arbitrary rotation of digitized images using process-and-data-decomposition approach. J Parallel Distrib Comput 10(2):188–193CrossRef
34.
Zurück zum Zitat Arabnia HR (1996) Distributed stereocorrelation algorithm. Int J Comput Commun 19(8):707–712CrossRef Arabnia HR (1996) Distributed stereocorrelation algorithm. Int J Comput Commun 19(8):707–712CrossRef
35.
Zurück zum Zitat Wang X, Fan JX, Lin CK (2018) BCDC: a high-performance, server-centric data center network. J Comput Sci Technol 33(2):400–416MathSciNetCrossRef Wang X, Fan JX, Lin CK (2018) BCDC: a high-performance, server-centric data center network. J Comput Sci Technol 33(2):400–416MathSciNetCrossRef
36.
Zurück zum Zitat Wang T, Su Z, Xia Y (2018) CLOT: a cost-effective low-latency overlaid torus-based network architecture for data centers. In: IEEE International Conference on Communications Wang T, Su Z, Xia Y (2018) CLOT: a cost-effective low-latency overlaid torus-based network architecture for data centers. In: IEEE International Conference on Communications
Metadaten
Titel
Mesh-of-Torus: a new topology for server-centric data center networks
verfasst von
Peibo Xie
Huaxi Gu
Kun Wang
Xiaoshan Yu
Shangqi Ma
Publikationsdatum
19.09.2018
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 1/2019
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-018-2610-4

Weitere Artikel der Ausgabe 1/2019

The Journal of Supercomputing 1/2019 Zur Ausgabe

Premium Partner