Skip to main content
Top
Published in: The Journal of Supercomputing 1/2019

19-09-2018

Mesh-of-Torus: a new topology for server-centric data center networks

Authors: Peibo Xie, Huaxi Gu, Kun Wang, Xiaoshan Yu, Shangqi Ma

Published in: The Journal of Supercomputing | Issue 1/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Various topologies have been proposed for high-performance computing (HPC), i.e., fat-tree, Torus topology. Compared with conventional fat-tree topology, Torus performs much better when applied in HPC. Unfortunately, due to its wraparound links, Torus topology naturally has the tendency to trigger deadlock incidents inside the network. Researchers solve this problem by means of virtual channel, but this approach will also restrict the routing of message. In this paper, we propose a deadlock-free topology for HPC, called Mesh-of-Torus, which incarnates the good characteristics of Mesh and Torus topology. Comparing with mesh, Mesh-of-Torus has shorter network diameter. Furthermore, we have proposed a corresponding port assignment rules in consideration of complicated internal arbitration or scheduling mechanism incurred by the employment of virtual channel. Deadlock avoidance can be achieved when dimension-order routing algorithm and our port assignment rules are applied to Mesh-of-Torus. Finally, simulations and mathematical analysis have shown that Mesh-of-Torus outperforms Mesh in terms of average end-to-end latency and network load distribution.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Arabnia HR, Oliver MA (1987) A transfer network for the arbitrary rotation of digitised images. Comput J 30(5):425–432CrossRef Arabnia HR, Oliver MA (1987) A transfer network for the arbitrary rotation of digitised images. Comput J 30(5):425–432CrossRef
2.
go back to reference Wijngaart RFVD, Georganas E, Mattson TG, Wissink A (2017) A new parallel research kernel to expand research on dynamic load-balancing capabilities. In: International Supercomputing Conference Wijngaart RFVD, Georganas E, Mattson TG, Wissink A (2017) A new parallel research kernel to expand research on dynamic load-balancing capabilities. In: International Supercomputing Conference
3.
go back to reference Bhandarkar SM, Arabnia HR (1995) The REFINE multiprocessor-theoretical properties and algorithms. Parallel Comput 21(11):1783–1805CrossRef Bhandarkar SM, Arabnia HR (1995) The REFINE multiprocessor-theoretical properties and algorithms. Parallel Comput 21(11):1783–1805CrossRef
4.
go back to reference Ding M, Tian H (2016) PCA-based network traffic anomaly detection. Tsinghua Sci Technol 21(2):500–509CrossRef Ding M, Tian H (2016) PCA-based network traffic anomaly detection. Tsinghua Sci Technol 21(2):500–509CrossRef
5.
go back to reference Alonso P, Ranilla J, Aguiar JV (2017) High-performance computing. J Supercomput 73(1):1–3CrossRef Alonso P, Ranilla J, Aguiar JV (2017) High-performance computing. J Supercomput 73(1):1–3CrossRef
7.
go back to reference Chen D, Eisley NA, Heidelberger P, Senger RM, Sugawara Y, Kumar S, Salapura V, Satterfield DL, Burow BS., Parker JJ (2011) The IBM Blue Gene/Q interconnection network and message unit. In: High Performance Computing, Networking, Storage and Analysis Chen D, Eisley NA, Heidelberger P, Senger RM, Sugawara Y, Kumar S, Salapura V, Satterfield DL, Burow BS., Parker JJ (2011) The IBM Blue Gene/Q interconnection network and message unit. In: High Performance Computing, Networking, Storage and Analysis
8.
go back to reference Xenopoulos P, Daniel J, Matheson M, Sukumar S (2016) Big data analytics on HPC architectures: performance and cost. In: IEEE International Conference on Big Data Xenopoulos P, Daniel J, Matheson M, Sukumar S (2016) Big data analytics on HPC architectures: performance and cost. In: IEEE International Conference on Big Data
9.
go back to reference González ÁF, RosilloEmai R, Dávila JÁM, Matellán V (2015) Historical review and future challenges in supercomputing and networks of scientific communication. J Supercomput 71(12):4476–4503CrossRef González ÁF, RosilloEmai R, Dávila JÁM, Matellán V (2015) Historical review and future challenges in supercomputing and networks of scientific communication. J Supercomput 71(12):4476–4503CrossRef
10.
go back to reference Azad HS, Bagherzadeh N, Jaberipour G (2015) Advances in multicore systems architectures. J Supercomput 71(8):2783–2786CrossRef Azad HS, Bagherzadeh N, Jaberipour G (2015) Advances in multicore systems architectures. J Supercomput 71(8):2783–2786CrossRef
11.
go back to reference Bermúdez Garzón DF, Requena CG, Gómez ME, López P, Duato J (2016) A family of fault-tolerant efficient indirect topologies. IEEE Trans Parallel Distrib Syst 27(4):927–940CrossRef Bermúdez Garzón DF, Requena CG, Gómez ME, López P, Duato J (2016) A family of fault-tolerant efficient indirect topologies. IEEE Trans Parallel Distrib Syst 27(4):927–940CrossRef
12.
go back to reference Dhanak M, Godbole PD, Patil RA (2016) Torus network labeling in high performance computing. In: International Conference on Computing Communication Control and Automation Dhanak M, Godbole PD, Patil RA (2016) Torus network labeling in high performance computing. In: International Conference on Computing Communication Control and Automation
13.
go back to reference Yu Z, Xiang D, Wang X (2015) Balancing virtual channel utilization for deadlock-free routing in torus networks. J Supercomput 71(8):3094–3115CrossRef Yu Z, Xiang D, Wang X (2015) Balancing virtual channel utilization for deadlock-free routing in torus networks. J Supercomput 71(8):3094–3115CrossRef
14.
go back to reference Abbas D, Jamshidi K (2015) A fault-tolerant hierarchical hybrid mesh-based wireless network-on-chip architecture for multicore platforms. J Supercomput 71(8):3116–3148CrossRef Abbas D, Jamshidi K (2015) A fault-tolerant hierarchical hybrid mesh-based wireless network-on-chip architecture for multicore platforms. J Supercomput 71(8):3116–3148CrossRef
15.
go back to reference Prisacari B, Rodriguez G, Minkenberg C, Palacio RB (2012) Performance implications of deadlock avoidance techniques in torus networks. In: International Conference on High Performance Switching and Routing Prisacari B, Rodriguez G, Minkenberg C, Palacio RB (2012) Performance implications of deadlock avoidance techniques in torus networks. In: International Conference on High Performance Switching and Routing
16.
go back to reference Puente V, Beivide R, Gregorio JA, Prellezo JM, Duato J, Izu C (1999) Adaptive bubble router: a design to improve performance in torus networks. In: International Conference on Parallel Processing Puente V, Beivide R, Gregorio JA, Prellezo JM, Duato J, Izu C (1999) Adaptive bubble router: a design to improve performance in torus networks. In: International Conference on Parallel Processing
17.
go back to reference Jeong YS, Lee SE (2013) Deadlock-free XY-YX router for on-chip interconnection network. Ieice Electron Express 10(20):20130699CrossRef Jeong YS, Lee SE (2013) Deadlock-free XY-YX router for on-chip interconnection network. Ieice Electron Express 10(20):20130699CrossRef
18.
go back to reference Yu Z, Wang X, Shen K (2016) Conditional forwarding: simple flow control to increase adaptivity for fully adaptive routing algorithms. J Supercomput 72(2):639–653CrossRef Yu Z, Wang X, Shen K (2016) Conditional forwarding: simple flow control to increase adaptivity for fully adaptive routing algorithms. J Supercomput 72(2):639–653CrossRef
19.
go back to reference Boden NJ, Cohen D, Felderman RE (1995) Myrinet: a gigabit-per-second local area network. Micro IEEE 15(1):29–36CrossRef Boden NJ, Cohen D, Felderman RE (1995) Myrinet: a gigabit-per-second local area network. Micro IEEE 15(1):29–36CrossRef
20.
go back to reference Veselovsky G, Batovski DA (2003) A study of the permutation capability of a binary hypercube under deterministic dimension-order routing. In: Parallel, Distributed and Network-Based Processing Veselovsky G, Batovski DA (2003) A study of the permutation capability of a binary hypercube under deterministic dimension-order routing. In: Parallel, Distributed and Network-Based Processing
21.
go back to reference Ren P, Kinsy MA, Zheng N (2016) Fault-aware load-balancing routing for 2D-mesh and torus on-chip network topologies. IEEE Trans Comput 65(3):873–887MathSciNetCrossRefMATH Ren P, Kinsy MA, Zheng N (2016) Fault-aware load-balancing routing for 2D-mesh and torus on-chip network topologies. IEEE Trans Comput 65(3):873–887MathSciNetCrossRefMATH
22.
go back to reference Šeda M, Šedová J, Horký M (2017) Multichannel queueing systems and their simulation. In: Applied Physics, System Science and Computers. APSAC Šeda M, Šedová J, Horký M (2017) Multichannel queueing systems and their simulation. In: Applied Physics, System Science and Computers. APSAC
23.
go back to reference Cheng B, Fan J, Jia X (2013) Parallel construction of independent spanning trees and an application in diagnosis on Möbius cubes. J Supercomput 65(3):1279–1301CrossRef Cheng B, Fan J, Jia X (2013) Parallel construction of independent spanning trees and an application in diagnosis on Möbius cubes. J Supercomput 65(3):1279–1301CrossRef
24.
go back to reference Xiang D, Pan Y, Wang Q, Chen Z (2008) Deadlock-free fully adaptive routing in 2-dimensional tori based on a new virtual network partitioning scheme. In: International Conference on Distributed Computing Systems Xiang D, Pan Y, Wang Q, Chen Z (2008) Deadlock-free fully adaptive routing in 2-dimensional tori based on a new virtual network partitioning scheme. In: International Conference on Distributed Computing Systems
25.
go back to reference Liu Z, Fan J, Jia X (2015) Embedding complete binary trees into parity cubes. J Supercomput 71(1):1–27CrossRef Liu Z, Fan J, Jia X (2015) Embedding complete binary trees into parity cubes. J Supercomput 71(1):1–27CrossRef
26.
go back to reference Farrington PA, Nembhard HB, Sturrock DT, Evans GW, Chang X (2009) Network simulations with Opnet. In: Winter Simulation Conference Farrington PA, Nembhard HB, Sturrock DT, Evans GW, Chang X (2009) Network simulations with Opnet. In: Winter Simulation Conference
27.
go back to reference Lang H, Quan Z (2008) OPNET modeling and simulation of MSM Clos switch fabric and algorithm with OPNET. Mod Electron Tech 19:011 Lang H, Quan Z (2008) OPNET modeling and simulation of MSM Clos switch fabric and algorithm with OPNET. Mod Electron Tech 19:011
28.
go back to reference Li H, Cheng Y, Zhou C, Zhuang W (2009) Minimizing end-to-end delay: a novel routing metric for multi-radio wireless mesh networks. In: International Conference on Computer Communications Li H, Cheng Y, Zhou C, Zhuang W (2009) Minimizing end-to-end delay: a novel routing metric for multi-radio wireless mesh networks. In: International Conference on Computer Communications
29.
go back to reference Yu Y, Huang Y, Zhao B, Hua Y (2008) Throughput analysis of wireless mesh networks. In: International Conference on Acoustics, Speech, and Signal Processing Yu Y, Huang Y, Zhao B, Hua Y (2008) Throughput analysis of wireless mesh networks. In: International Conference on Acoustics, Speech, and Signal Processing
30.
go back to reference Zhao D, Zou J, Todd TD (2007) Admission control with load balancing in IEEE 802.11-based ESS mesh networks. Wireless Netw 13(3):351–359CrossRef Zhao D, Zou J, Todd TD (2007) Admission control with load balancing in IEEE 802.11-based ESS mesh networks. Wireless Netw 13(3):351–359CrossRef
31.
go back to reference Yu J, Bang HC, Lee H, Yang SL (2016) Adaptive internet of things and web of things convergence platform for Internet of reality services. J Supercomput 72(1):84–102CrossRef Yu J, Bang HC, Lee H, Yang SL (2016) Adaptive internet of things and web of things convergence platform for Internet of reality services. J Supercomput 72(1):84–102CrossRef
32.
go back to reference Wani MA, Arabnia HR (2003) Parallel edge-region-based segmentation algorithm targeted at reconfigurable multi-ring network. J Supercomput 25(1):43–63CrossRefMATH Wani MA, Arabnia HR (2003) Parallel edge-region-based segmentation algorithm targeted at reconfigurable multi-ring network. J Supercomput 25(1):43–63CrossRefMATH
33.
go back to reference Arabnia HR (1990) A parallel algorithm for the arbitrary rotation of digitized images using process-and-data-decomposition approach. J Parallel Distrib Comput 10(2):188–193CrossRef Arabnia HR (1990) A parallel algorithm for the arbitrary rotation of digitized images using process-and-data-decomposition approach. J Parallel Distrib Comput 10(2):188–193CrossRef
34.
go back to reference Arabnia HR (1996) Distributed stereocorrelation algorithm. Int J Comput Commun 19(8):707–712CrossRef Arabnia HR (1996) Distributed stereocorrelation algorithm. Int J Comput Commun 19(8):707–712CrossRef
35.
go back to reference Wang X, Fan JX, Lin CK (2018) BCDC: a high-performance, server-centric data center network. J Comput Sci Technol 33(2):400–416MathSciNetCrossRef Wang X, Fan JX, Lin CK (2018) BCDC: a high-performance, server-centric data center network. J Comput Sci Technol 33(2):400–416MathSciNetCrossRef
36.
go back to reference Wang T, Su Z, Xia Y (2018) CLOT: a cost-effective low-latency overlaid torus-based network architecture for data centers. In: IEEE International Conference on Communications Wang T, Su Z, Xia Y (2018) CLOT: a cost-effective low-latency overlaid torus-based network architecture for data centers. In: IEEE International Conference on Communications
Metadata
Title
Mesh-of-Torus: a new topology for server-centric data center networks
Authors
Peibo Xie
Huaxi Gu
Kun Wang
Xiaoshan Yu
Shangqi Ma
Publication date
19-09-2018
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 1/2019
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-018-2610-4

Other articles of this Issue 1/2019

The Journal of Supercomputing 1/2019 Go to the issue

Premium Partner