Skip to main content
Erschienen in: The Journal of Supercomputing 10/2019

05.06.2019

Performance improvement of Apache Storm using InfiniBand RDMA

verfasst von: Seokwoo Yang, Siwoon Son, Mi-Jung Choi, Yang-Sae Moon

Erschienen in: The Journal of Supercomputing | Ausgabe 10/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we attempt to improve the performance of a real-time stream processing by applying Apache Storm on InfiniBand. Apache Storm is a representative distributed framework for real-time stream processing, and InfiniBand is a high-performance communication standard. The default approach of running Storm on InfiniBand is to use IP over InfiniBand (IPoIB), which causes a serious CPU overload and fails to exploit high performance of InfiniBand. The CPU overload is mainly caused by frequent context switching and buffer copying operations. To solve this, we propose a new communication methodology using InfiniBand’s remote direct memory access (RDMA). In the proposed method, we replace the existing communication framework Netty to RJ-Netty, which is an RDMA/JXIO-based communication method. We can use Netty as well as RJ-Netty in Storm based on the preference. We also maximize the performance of RJ-Netty by applying multithreading on JXIO servers. Experimental results show that RJ-Netty significantly reduces CPU load while improving message throughput and complete latency compared to IPoIB as well as Ethernet. We believe that, as the first attempt to run Storm on InfiniBand, our approach is excellent in improving the processing performance of Storm by using InfiniBand RDMA functions.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Fußnoten
1
All source code of RJ-Netty designed and implemented in this paper can be accessed as open source in GitHub. The GitHub address is https://​github.​com/​dke-knu/​i2am/​tree/​master/​rdma-based-storm.
 
Literatur
10.
Zurück zum Zitat Assuncaoa M, Veith A, Buyya R (2018) Distributed data stream processing and edge computing: a survey on resource elasticity and future directions. J Netw Comput Appl 103:1–17CrossRef Assuncaoa M, Veith A, Buyya R (2018) Distributed data stream processing and edge computing: a survey on resource elasticity and future directions. J Netw Comput Appl 103:1–17CrossRef
12.
Zurück zum Zitat Caneill M, Rheddane AE, Leroy V, Palma ND (2016) Locality-aware routing in stateful streaming applications. In: Proceedings of the 17th International Middleware Conference, Trento, Italy, pp 4:1–4:13 Caneill M, Rheddane AE, Leroy V, Palma ND (2016) Locality-aware routing in stateful streaming applications. In: Proceedings of the 17th International Middleware Conference, Trento, Italy, pp 4:1–4:13
14.
Zurück zum Zitat Goetz P, O’Neill B (2014) Storm blueprints: patterns for distributed real-time computation. Packt Publishing, Birmingham Goetz P, O’Neill B (2014) Storm blueprints: patterns for distributed real-time computation. Packt Publishing, Birmingham
15.
Zurück zum Zitat Huang J, Ouyang X, Jose J, Wasi-Ur-Rahman M, Wang H, Luo M, Subramoni H, Murthy C, Panda DK (2012) High-performance design of HBase with RDMA over InfiniBand. In: Proceedings of the IEEE 26th International Symposium on Parallel and Distributed Processing, Shanghai, China, pp 774–785 Huang J, Ouyang X, Jose J, Wasi-Ur-Rahman M, Wang H, Luo M, Subramoni H, Murthy C, Panda DK (2012) High-performance design of HBase with RDMA over InfiniBand. In: Proceedings of the IEEE 26th International Symposium on Parallel and Distributed Processing, Shanghai, China, pp 774–785
16.
Zurück zum Zitat Hunt P, Konar M, Junqueira FP, Reed B (2010) Zookeeper: wait-free coordination for internet-scale systems. In: Proceedings of the USENIX Annual Technical Conference, Boston, MA, pp 1–6 Hunt P, Konar M, Junqueira FP, Reed B (2010) Zookeeper: wait-free coordination for internet-scale systems. In: Proceedings of the USENIX Annual Technical Conference, Boston, MA, pp 1–6
18.
Zurück zum Zitat Inoubli W, Aridhi S, Mezni H, Maddouri M, Nguifo E (2018) An experimental survey on big data frameworks. Future Gener Comput Syst 86:546–564CrossRef Inoubli W, Aridhi S, Mezni H, Maddouri M, Nguifo E (2018) An experimental survey on big data frameworks. Future Gener Comput Syst 86:546–564CrossRef
19.
Zurück zum Zitat Islam NS, Wasi-Ur-Rahman M, Jose J, Rajachandrasekar R, Wang H, Subramoni H, Murthy C, Panda DK (2012) High performance RDMA-based design of HDFS over InfiniBand. In: Proceedings of the International Conference on High Performance Computing, Networking, Storage, and Analysis, Salt Lake City, UT, pp 1–12 Islam NS, Wasi-Ur-Rahman M, Jose J, Rajachandrasekar R, Wang H, Subramoni H, Murthy C, Panda DK (2012) High performance RDMA-based design of HDFS over InfiniBand. In: Proceedings of the International Conference on High Performance Computing, Networking, Storage, and Analysis, Salt Lake City, UT, pp 1–12
22.
Zurück zum Zitat Kim Y, Son S, Moon Y-S (2019) Apache storm configuration platform for dynamic sampling and filtering of data streams. ICIC Express Lett 10(1):1537 Kim Y, Son S, Moon Y-S (2019) Apache storm configuration platform for dynamic sampling and filtering of data streams. ICIC Express Lett 10(1):1537
24.
Zurück zum Zitat Lu X, Islam NS, Wasi-Ur-Rahman M, Jose J, Subramoni H, Wang H, Panda DK (2013) High-performance design of hadoop RPC with RDMA over InfiniBand. In: Proceedings of the IEEE 42nd International Conference on Parallel Processing, Lyon, France, pp 641–650 Lu X, Islam NS, Wasi-Ur-Rahman M, Jose J, Subramoni H, Wang H, Panda DK (2013) High-performance design of hadoop RPC with RDMA over InfiniBand. In: Proceedings of the IEEE 42nd International Conference on Parallel Processing, Lyon, France, pp 641–650
25.
Zurück zum Zitat Lu X, Wasi-Ur-Rahman M, Islam N, Shankar D, Panda DK (2014) Accelerating spark with RDMA for big data processing: early experiences. In: Proceedings of the IEEE 22nd Annual Symposium on High-Performance Interconnects, Mountain View, CA, pp 9–16 Lu X, Wasi-Ur-Rahman M, Islam N, Shankar D, Panda DK (2014) Accelerating spark with RDMA for big data processing: early experiences. In: Proceedings of the IEEE 22nd Annual Symposium on High-Performance Interconnects, Mountain View, CA, pp 9–16
27.
Zurück zum Zitat Silberschatz A, Galvin PB, Gagne G (2012) Operating system concepts, 9th edn. Wiley, Nwe YorkMATH Silberschatz A, Galvin PB, Gagne G (2012) Operating system concepts, 9th edn. Wiley, Nwe YorkMATH
28.
Zurück zum Zitat Son S, Lee S, Gil M-S, Choi M-J, Moon Y-S (2018) Locality aware traffic distribution in apache storm for energy analytics platform. In: Proceedings of IEEE International Conference on Big Data and Smart Computing, Shanghai, China, pp 721–724 Son S, Lee S, Gil M-S, Choi M-J, Moon Y-S (2018) Locality aware traffic distribution in apache storm for energy analytics platform. In: Proceedings of IEEE International Conference on Big Data and Smart Computing, Shanghai, China, pp 721–724
29.
Zurück zum Zitat Stevens WR, Fenner B, Rudoff AM (2013) UNIX network programming, 3rd edn. Addison-Wesley Professional, Boston Stevens WR, Fenner B, Rudoff AM (2013) UNIX network programming, 3rd edn. Addison-Wesley Professional, Boston
31.
Zurück zum Zitat Toshniwal A, Taneja S, Shukla A, Ramasamy K, Patel JM, Kulkarni S, Jackson J, Gade K, Fu M, Donham J, Bhagat N, Mittal S, Ryaboy D (2014) Storm@Twitter. In: Proceedings of the Internationl Conference on Management of Data, ACM SIGMOD, Snowbird, Utah, pp 147–156 Toshniwal A, Taneja S, Shukla A, Ramasamy K, Patel JM, Kulkarni S, Jackson J, Gade K, Fu M, Donham J, Bhagat N, Mittal S, Ryaboy D (2014) Storm@Twitter. In: Proceedings of the Internationl Conference on Management of Data, ACM SIGMOD, Snowbird, Utah, pp 147–156
32.
Zurück zum Zitat Wang C, Meng X, Guo Q, Weng Z, Yang C (2017) Automating characterization deployment in distributed data stream management systems. IEEE Trans Knowl Data Eng 29(12):2669–2681CrossRef Wang C, Meng X, Guo Q, Weng Z, Yang C (2017) Automating characterization deployment in distributed data stream management systems. IEEE Trans Knowl Data Eng 29(12):2669–2681CrossRef
33.
Zurück zum Zitat Wasi-Ur-Rahman M, Islam NS, Lu X, Jose J, Subramoni H, Wang H, Panda DK (2013) High-performance RDMA-based design of Hadoop MapReduce over InfiniBand. In: Proceedings of the IEEE 27th International Parallel and Distributed Processing Symposium Workshops, Cambridge, MA, pp 1908–1917 Wasi-Ur-Rahman M, Islam NS, Lu X, Jose J, Subramoni H, Wang H, Panda DK (2013) High-performance RDMA-based design of Hadoop MapReduce over InfiniBand. In: Proceedings of the IEEE 27th International Parallel and Distributed Processing Symposium Workshops, Cambridge, MA, pp 1908–1917
34.
Zurück zum Zitat Yang S (2017) IoT stream processing and analytics in the fog. IEEE Commun Mag 55(8):21–27CrossRef Yang S (2017) IoT stream processing and analytics in the fog. IEEE Commun Mag 55(8):21–27CrossRef
35.
Zurück zum Zitat Zeuch S, Monte B, Karimov J, Lutz C, Renz M, Traub J, BreB S, Tabl T, Markl V (2019) Analyzing efficient stream processing on modern hardware. Proc VLDB Endow 12(5):516–530CrossRef Zeuch S, Monte B, Karimov J, Lutz C, Renz M, Traub J, BreB S, Tabl T, Markl V (2019) Analyzing efficient stream processing on modern hardware. Proc VLDB Endow 12(5):516–530CrossRef
Metadaten
Titel
Performance improvement of Apache Storm using InfiniBand RDMA
verfasst von
Seokwoo Yang
Siwoon Son
Mi-Jung Choi
Yang-Sae Moon
Publikationsdatum
05.06.2019
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 10/2019
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-019-02905-7

Weitere Artikel der Ausgabe 10/2019

The Journal of Supercomputing 10/2019 Zur Ausgabe