Skip to main content
Erschienen in: The International Journal of Advanced Manufacturing Technology 1/2022

13.01.2022 | ORIGINAL ARTICLE

DCI-NACC: flow scheduling and congestion control based on programmable data plane in high-performance data center networks

verfasst von: Junjie Geng

Erschienen in: The International Journal of Advanced Manufacturing Technology | Ausgabe 1/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

PFC (Priority-based Flow Control), which is used to give full play to the performance advantages of RDMA, can cause problems such as head-of-line (HoL) blocking. PFC-based method cannot eliminate the congestion fundamentally. At the same time, another approach to deal with congestion in the data center networks, commonly known as “injection throttling” mechanism, may result in slower effective response due to the delay between congestion detection and feedback to the sources. In this article, we have optimized the two types of algorithms (DCI and NACC) and used them in combination (DCI-NACC). We propose a novel scheduling strategy of combining PFC-based algorithm and “injection throttling”-based algorithm using programmable data plane device for lossless data center networks. Specifically, we design a dynamic congestion isolation (DCI) method to optimize PFC. DCI ensures lossless transmission in the data center networks and eliminates the HoL blocking problem caused by PFC. We also propose a network-assisted congestion control (NACC) algorithm in which the feedback information is directly sent from the switch where congestion occurs in the network. NACC increases the efficiency of network congestion perception and reduces the delay that feedback take effect. Experimental results show that, compared with PFC and ECN algorithm, DCI-NACC algorithm has significant improvement in network performance (normalized throughput, network latency, and flow completion time). Especially, simulation experiments show that NACC can greatly reduce the number of times that PFC PAUSE is triggered. The simulation experiments confirm that DCI-NACC algorithm can effectively improve the performance of the data center networks on the basis of lossless transmission.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Vasan D, Alazab M, Wassan S et al (2020) IMCFN: Image-based malware classification using fine-tuned convolutional neural network architecture. Comput Net 171:07138 Vasan D, Alazab M, Wassan S et al (2020) IMCFN: Image-based malware classification using fine-tuned convolutional neural network architecture. Comput Net 171:07138
2.
Zurück zum Zitat Çınar Z M, Abdussalam Nuhu A, Zeeshan Q et al (2020) Machine learning in predictive maintenance towards sustainable smart manufacturing in industry 4.0. Sustainability 12(19):8211 Çınar Z M, Abdussalam Nuhu A, Zeeshan Q et al (2020) Machine learning in predictive maintenance towards sustainable smart manufacturing in industry 4.0. Sustainability 12(19):8211
3.
Zurück zum Zitat Vasan D, Alazab M, Wassan S et al (2020) Image-Based malware classification using ensemble of CNN architectures (IMCEC). Comput Secur 92:101748 Vasan D, Alazab M, Wassan S et al (2020) Image-Based malware classification using ensemble of CNN architectures (IMCEC). Comput Secur 92:101748
5.
Zurück zum Zitat Marinos I, Watson RNM, Handley M (2014) Network stack specialization for performance. ACM SIGCOMM Computer Communication Review 44(4):175–186CrossRef Marinos I, Watson RNM, Handley M (2014) Network stack specialization for performance. ACM SIGCOMM Computer Communication Review 44(4):175–186CrossRef
7.
Zurück zum Zitat Vienne J, Chen J, Wasi-Ur-Rahman M et al (2012) Performance analysis and evaluation of infiniband fdr and 40gige roce on hpc and cloud computing systems. In 2012 IEEE 20th Annual Symposium on High-Performance Interconnects. IEEE, pp 48–55 Vienne J, Chen J, Wasi-Ur-Rahman M et al (2012) Performance analysis and evaluation of infiniband fdr and 40gige roce on hpc and cloud computing systems. In 2012 IEEE 20th Annual Symposium on High-Performance Interconnects. IEEE, pp 48–55
8.
Zurück zum Zitat Beck M, Kagan M (2011) Performance evaluation of the RDMA over ethernet (RoCE) standard in enterprise data centers infrastructure. In proceedings of the 3rd Workshop on Data Center-Converged and Virtual Ethernet Switching, pp 9–15 Beck M, Kagan M (2011) Performance evaluation of the RDMA over ethernet (RoCE) standard in enterprise data centers infrastructure. In proceedings of the 3rd Workshop on Data Center-Converged and Virtual Ethernet Switching, pp 9–15
9.
Zurück zum Zitat Zhu Y, Eran H, Firestone D et al (2015) Congestion control for large-scale RDMA deployments. ACM SIGCOMM Computer Communication Review 45(4):523–536CrossRef Zhu Y, Eran H, Firestone D et al (2015) Congestion control for large-scale RDMA deployments. ACM SIGCOMM Computer Communication Review 45(4):523–536CrossRef
10.
Zurück zum Zitat IEEE (2008) 802.11 Qbb. Priority Based Flow Control IEEE (2008) 802.11 Qbb. Priority Based Flow Control
11.
Zurück zum Zitat Mittal R, Shpiner A, Panda A et al (2018) Revisiting network support for RDMA. In Proceedings of the. Conference of the ACM Special Interest Group on Data Communication 2018:313–326 Mittal R, Shpiner A, Panda A et al (2018) Revisiting network support for RDMA. In Proceedings of the. Conference of the ACM Special Interest Group on Data Communication 2018:313–326
12.
Zurück zum Zitat Guo C, Wu H, Deng Z et al (2016) RDMA over commodity ethernet at scale. In Proceedings of the. ACM SIGCOMM Conference 2016:202–215 Guo C, Wu H, Deng Z et al (2016) RDMA over commodity ethernet at scale. In Proceedings of the. ACM SIGCOMM Conference 2016:202–215
13.
Zurück zum Zitat Hu S, Zhu Y, Cheng P et al (2016) Deadlocks in datacenter networks: Why do they form, and how to avoid them. In Proceedings of the 15th ACM Workshop on Hot Topics in Networks, pp 92–98 Hu S, Zhu Y, Cheng P et al (2016) Deadlocks in datacenter networks: Why do they form, and how to avoid them. In Proceedings of the 15th ACM Workshop on Hot Topics in Networks, pp 92–98
14.
Zurück zum Zitat Shpiner A, Zahavi E, Zdornov V et al (2016) Unlocking credit loop deadlocks. In Proceedings of the 15th ACM Workshop on Hot Topics in Networks, pp 85–91 Shpiner A, Zahavi E, Zdornov V et al (2016) Unlocking credit loop deadlocks. In Proceedings of the 15th ACM Workshop on Hot Topics in Networks, pp 85–91
15.
Zurück zum Zitat Stephens B, Cox A L, Singla A et al (2014) Practical DCB for improved data center networks. In IEEE INFOCOM 2014-IEEE Conf Comp Commun. IEEE, pp 1824–1832 Stephens B, Cox A L, Singla A et al (2014) Practical DCB for improved data center networks. In IEEE INFOCOM 2014-IEEE Conf Comp Commun. IEEE, pp 1824–1832
16.
Zurück zum Zitat Nachiondo T, Flich J, Duato J (2010) Buffer management strategies to reduce HoL blocking. IEEE Trans Parallel Distrib Syst 21(2010):739–753CrossRef Nachiondo T, Flich J, Duato J (2010) Buffer management strategies to reduce HoL blocking. IEEE Trans Parallel Distrib Syst 21(2010):739–753CrossRef
18.
Zurück zum Zitat Cho I, Jang K, Han D (2017) Credit-scheduled delay-bounded congestion control for datacenters. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication, pp 239–252 Cho I, Jang K, Han D (2017) Credit-scheduled delay-bounded congestion control for datacenters. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication, pp 239–252
19.
Zurück zum Zitat Hu S, Zhu Y, Cheng P et al (2017) Tagger: practical PFC deadlock prevention in data center networks. In Proceedings of the 13th International Conference on emerging Networking EXperiments and Technologies, pp 451–463 Hu S, Zhu Y, Cheng P et al (2017) Tagger: practical PFC deadlock prevention in data center networks. In Proceedings of the 13th International Conference on emerging Networking EXperiments and Technologies, pp 451–463
20.
Zurück zum Zitat Pahlevanzadeh B, Mohamad I J, Wan T C (2010) Per-priority flow control (PPFC) for enhancing metro ethernet QoS. In 2010 Sec Int Conf Com Res Dev IEEE, pp 83–87 Pahlevanzadeh B, Mohamad I J, Wan T C (2010) Per-priority flow control (PPFC) for enhancing metro ethernet QoS. In 2010 Sec Int Conf Com Res Dev IEEE, pp 83–87
21.
Zurück zum Zitat Olmedilla C, Escudero-Sahuquillo J, García PJ et al (2020) Optimizing packet dropping by efficient congesting-flow isolation in lossy data-center networks. In 2020 IEEE Symposium on High-Performance Interconnects (HOTI). IEEE, pp 47–54 Olmedilla C, Escudero-Sahuquillo J, García PJ et al (2020) Optimizing packet dropping by efficient congesting-flow isolation in lossy data-center networks. In 2020 IEEE Symposium on High-Performance Interconnects (HOTI). IEEE, pp 47–54
22.
Zurück zum Zitat Olmedilla C, Escudero-Sahuquillo J, Garcia-Garcia PJ et al (2020) DVL-lossy: isolating congesting flows to optimize packet dropping in lossy data-center networks. IEEE Micro 41(1):37–44CrossRef Olmedilla C, Escudero-Sahuquillo J, Garcia-Garcia PJ et al (2020) DVL-lossy: isolating congesting flows to optimize packet dropping in lossy data-center networks. IEEE Micro 41(1):37–44CrossRef
23.
Zurück zum Zitat Zhu Y, Ghobadi M, Misra V et al (2016) ECN or delay: lessons learnt from analysis of DCQCN and TIMELY. In Proceedings of the 12th International on Conf Emerg Net Exp Techno 313–327 Zhu Y, Ghobadi M, Misra V et al (2016) ECN or delay: lessons learnt from analysis of DCQCN and TIMELY. In Proceedings of the 12th International on Conf Emerg Net Exp Techno 313–327
24.
Zurück zum Zitat Alizadeh M, Greenberg A, Maltz DA et al (2010) Data center tcp (dctcp). In Proceedings of the ACM SIGCOMM. Conference 2010:63–74 Alizadeh M, Greenberg A, Maltz DA et al (2010) Data center tcp (dctcp). In Proceedings of the ACM SIGCOMM. Conference 2010:63–74
25.
Zurück zum Zitat Zhang Y, Ansari N (2013) Fair quantized congestion notification in data center networks. IEEE Transac Commun 25;61(11):4690–9 Zhang Y, Ansari N (2013) Fair quantized congestion notification in data center networks. IEEE Transac Commun 25;61(11):4690–9
26.
Zurück zum Zitat Willinger W, Taqqu MS, Sherman R et al (2002) Self-similarity through high-variability: statistical analysis of Ethernet LAN traffic at the source level. IEEE/ACM Trans Networking 5(1):71–86CrossRef Willinger W, Taqqu MS, Sherman R et al (2002) Self-similarity through high-variability: statistical analysis of Ethernet LAN traffic at the source level. IEEE/ACM Trans Networking 5(1):71–86CrossRef
27.
Zurück zum Zitat Mckeown N, Izzard M, Mekkittikul A et al (2008) The Tiny Tera: a packet switch core. IEEE Micro 17(1):26–33CrossRef Mckeown N, Izzard M, Mekkittikul A et al (2008) The Tiny Tera: a packet switch core. IEEE Micro 17(1):26–33CrossRef
28.
Zurück zum Zitat Bai W, Chen K, Hu S et al (2017) Congestion control for high-speed extremely shallow-buffered datacenter networks. In Proceedings of the First Asia-Pacific Workshop on Networking, pp 29–35 Bai W, Chen K, Hu S et al (2017) Congestion control for high-speed extremely shallow-buffered datacenter networks. In Proceedings of the First Asia-Pacific Workshop on Networking, pp 29–35
29.
Zurück zum Zitat Geng J, Yan J, Zhang Y (2019) P4QCN: Congestion control using p4-capable device in data center networks. Electronics 8(3):280CrossRef Geng J, Yan J, Zhang Y (2019) P4QCN: Congestion control using p4-capable device in data center networks. Electronics 8(3):280CrossRef
30.
Zurück zum Zitat Lantz B, Heller B, McKeown N (2010) A network in a laptop: rapid prototyping for software-defined networks. In Proceedings of the 9th ACM SIGCOMM Workshop on Hot Topics in Networks. 2010:1–6 Lantz B, Heller B, McKeown N (2010) A network in a laptop: rapid prototyping for software-defined networks. In Proceedings of the 9th ACM SIGCOMM Workshop on Hot Topics in Networks. 2010:1–6
Metadaten
Titel
DCI-NACC: flow scheduling and congestion control based on programmable data plane in high-performance data center networks
verfasst von
Junjie Geng
Publikationsdatum
13.01.2022
Verlag
Springer London
Erschienen in
The International Journal of Advanced Manufacturing Technology / Ausgabe 1/2022
Print ISSN: 0268-3768
Elektronische ISSN: 1433-3015
DOI
https://doi.org/10.1007/s00170-021-08459-4

Weitere Artikel der Ausgabe 1/2022

The International Journal of Advanced Manufacturing Technology 1/2022 Zur Ausgabe

    Marktübersichten

    Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.