Skip to main content
Erschienen in: The Journal of Supercomputing 9/2021

05.03.2021

Heterogeneity-aware elastic scaling of streaming applications on cloud platforms

verfasst von: Jyoti Sahni, Deo Prakash Vidyarthi

Erschienen in: The Journal of Supercomputing | Ausgabe 9/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Rise of Big Data techniques has led to the requirement for low latency analysis of high-velocity continuous data streams in real time. Several solutions, including Stream Processing Systems (SPSs), have been developed to enable real-time distributed stream processing. However, emerging application scenarios such as smart cities and wearable assistance that involve highly variable data rates keep on posing new challenges to the established stream processing engines for maintaining cost-effective executions. To cater to such scenarios, many modern SPSs have been proposed that leverage Cloud environment. The run-time scalability incorporated in these SPSs is in their early adaptations and are based on fixed scale sizes. Moreover, these scaling approaches do not adequately consider both the structure of the hosted streaming applications and the characteristic features of the underlying Cloud environment. Achieving true cost benefits of orchestrating streaming applications on Cloud-based pay-as-you-go model while maintaining the desired QoS, necessitates that both these issues are accounted in making the scaling decisions. This work presents a heterogeneity-aware, efficient auto-scaling strategy StreamScale-H which addresses both these issues. Simulation experiments, on representative stream applications, indicate that the proposed StreamScale-H auto-scaling algorithm exhibits much better performance in comparison with the state-of-the-art algorithms.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Stonebraker M, Çetintemel U, Zdonik S (2005) The 8 requirements of real-time stream processing. ACM Sigmod Record 34(4):42–47CrossRef Stonebraker M, Çetintemel U, Zdonik S (2005) The 8 requirements of real-time stream processing. ACM Sigmod Record 34(4):42–47CrossRef
3.
Zurück zum Zitat Biem A, Bouillet E, Feng H, Ranganathan A, Riabov A, Verscheure O, Koutsopoulos H and Moran C (2010) Ibm infosphere streams for scalable, real-time, intelligent transportation services. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, pp 1093–1104. June 2010 Biem A, Bouillet E, Feng H, Ranganathan A, Riabov A, Verscheure O, Koutsopoulos H and Moran C (2010) Ibm infosphere streams for scalable, real-time, intelligent transportation services. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, pp 1093–1104. June 2010
4.
Zurück zum Zitat Toshniwal A, Taneja S, Shukla A, Ramasamy K, Patel JM, Kulkarni S, Jackson J, Gade K, Fu M, Donham J and Bhagat N (2014) Storm@ twitter. In Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp 147–156. June 2014 Toshniwal A, Taneja S, Shukla A, Ramasamy K, Patel JM, Kulkarni S, Jackson J, Gade K, Fu M, Donham J and Bhagat N (2014) Storm@ twitter. In Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp 147–156. June 2014
5.
Zurück zum Zitat Jain N, Amini L, Andrade H, King R, Park Y, Selo P and Venkatramani C (2006) Design, implementation, and evaluation of the linear road bnchmark on the stream processing core. In Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, pp 431–442. June 2006 Jain N, Amini L, Andrade H, King R, Park Y, Selo P and Venkatramani C (2006) Design, implementation, and evaluation of the linear road bnchmark on the stream processing core. In Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, pp 431–442. June 2006
7.
Zurück zum Zitat Arasu A, Babcock B, Babu S, Datar M, Ito K, Nishizawa I, Rosenstein J and Widom J (2003) STREAM: the stanford stream data manager (demonstration description). In Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, pp 665–665. June 2003 Arasu A, Babcock B, Babu S, Datar M, Ito K, Nishizawa I, Rosenstein J and Widom J (2003) STREAM: the stanford stream data manager (demonstration description). In Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, pp 665–665. June 2003
8.
Zurück zum Zitat Abadi DJ, Ahmad Y, Balazinska M, Cetintemel U, Cherniack M, Hwang JH, Lindner W, Maskey A, Rasin A, Ryvkina E, Tatbul N (2005) The design of the borealis stream processing engine. Cidr 5(2005):277–289 Abadi DJ, Ahmad Y, Balazinska M, Cetintemel U, Cherniack M, Hwang JH, Lindner W, Maskey A, Rasin A, Ryvkina E, Tatbul N (2005) The design of the borealis stream processing engine. Cidr 5(2005):277–289
9.
Zurück zum Zitat Lécué F, Tallevi-Diotallevi S, Hayes J, Tucker R, Bicer V, Sbodio M, Tommasi P (2014) Smart traffic analytics in the semantic web with STAR-CITY: Scenarios, system and lessons learned in Dublin City. J Web Semant 27:26–33CrossRef Lécué F, Tallevi-Diotallevi S, Hayes J, Tucker R, Bicer V, Sbodio M, Tommasi P (2014) Smart traffic analytics in the semantic web with STAR-CITY: Scenarios, system and lessons learned in Dublin City. J Web Semant 27:26–33CrossRef
10.
Zurück zum Zitat Gulisano V, Jimenez-Peris R, Patino-Martinez M, Soriente C, Valduriez P (2012) Streamcloud: An elastic and scalable data streaming system. IEEE Trans Parallel Distrib Syst 23(12):2351–2365CrossRef Gulisano V, Jimenez-Peris R, Patino-Martinez M, Soriente C, Valduriez P (2012) Streamcloud: An elastic and scalable data streaming system. IEEE Trans Parallel Distrib Syst 23(12):2351–2365CrossRef
11.
Zurück zum Zitat Satzger B, Hummer W, Leitner P and Dustdar S (2011) Esc: towards an elastic stream computing platform for the cloud. In 2011 IEEE 4th International Conference on Cloud Computing, IEEE, pp 348–355. July 2011 Satzger B, Hummer W, Leitner P and Dustdar S (2011) Esc: towards an elastic stream computing platform for the cloud. In 2011 IEEE 4th International Conference on Cloud Computing, IEEE, pp 348–355. July 2011
12.
Zurück zum Zitat Schad J, Dittrich J, Quiané-Ruiz JA (2010) Runtime measurements in the cloud: observing, analyzing, and reducing variance. Proc VLDB Endow 3(1–2):460–471CrossRef Schad J, Dittrich J, Quiané-Ruiz JA (2010) Runtime measurements in the cloud: observing, analyzing, and reducing variance. Proc VLDB Endow 3(1–2):460–471CrossRef
13.
Zurück zum Zitat Sahni J and Vidyarthi DP (2016) Scalable online analytics on cloud infrastructures. In International Conference on Advances in Computing and Data Sciences, pp 399–408. Springer, Singapore. November 2016 Sahni J and Vidyarthi DP (2016) Scalable online analytics on cloud infrastructures. In International Conference on Advances in Computing and Data Sciences, pp 399–408. Springer, Singapore. November 2016
14.
Zurück zum Zitat Gedik B, Özsema HG, Öztürk Ö (2016) Pipelined fission for stream programs with dynamic selectivity and partitioned state. J Parallel Distrib Comput 96:106–120CrossRef Gedik B, Özsema HG, Öztürk Ö (2016) Pipelined fission for stream programs with dynamic selectivity and partitioned state. J Parallel Distrib Comput 96:106–120CrossRef
15.
Zurück zum Zitat Schneider S, Andrade H, Gedik B, Biem A and Wu KL (2009) Elastic scaling of data parallel operators in stream processing. In 2009 IEEE International Symposium on Parallel and Distributed Processing, pp 1–12. May 2009 Schneider S, Andrade H, Gedik B, Biem A and Wu KL (2009) Elastic scaling of data parallel operators in stream processing. In 2009 IEEE International Symposium on Parallel and Distributed Processing, pp 1–12. May 2009
16.
Zurück zum Zitat Min C and Eom YI (2013) DANBI: Dynamic scheduling of irregular stream programs for many-core systems. In Proceedings of the 22nd IEEE International Conference on Parallel Architectures and Compilation Techniques, pp 189–200. September 2013 Min C and Eom YI (2013) DANBI: Dynamic scheduling of irregular stream programs for many-core systems. In Proceedings of the 22nd IEEE International Conference on Parallel Architectures and Compilation Techniques, pp 189–200. September 2013
17.
Zurück zum Zitat Hidalgo N, Wladdimiro D, Rosas E (2017) Self-adaptive processing graph with operator fission for elastic stream processing. J Syst Softw 127:205–216CrossRef Hidalgo N, Wladdimiro D, Rosas E (2017) Self-adaptive processing graph with operator fission for elastic stream processing. J Syst Softw 127:205–216CrossRef
18.
Zurück zum Zitat Cheng Y, Hao Z, Cai R (2019) Auto-scaling for real-time stream analytics on HPC cloud. SOCA 13(2):169–183CrossRef Cheng Y, Hao Z, Cai R (2019) Auto-scaling for real-time stream analytics on HPC cloud. SOCA 13(2):169–183CrossRef
19.
Zurück zum Zitat Vijayakumar S, Zhu Q and Agrawal G (2010) Dynamic resource provisioning for data streaming applications in a cloud environment. In 2010 IEEE Second International Conference on Cloud Computing Technology and Science, pp 441–448. November 2010 Vijayakumar S, Zhu Q and Agrawal G (2010) Dynamic resource provisioning for data streaming applications in a cloud environment. In 2010 IEEE Second International Conference on Cloud Computing Technology and Science, pp 441–448. November 2010
20.
Zurück zum Zitat Castro Fernandez R, Migliavacca M, Kalyvianaki E and Pietzuch P (2013) Integrating scale out and fault tolerance in stream processing using operator state management. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pp 725–736. June 2013 Castro Fernandez R, Migliavacca M, Kalyvianaki E and Pietzuch P (2013) Integrating scale out and fault tolerance in stream processing using operator state management. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pp 725–736. June 2013
21.
Zurück zum Zitat Bellavista P, Corradi A, Kotoulas S and Reale A (2014) Adaptive fault-tolerance for dynamic resource provisioning in distributed stream processing systems. In EDBT pp 85–96. Bellavista P, Corradi A, Kotoulas S and Reale A (2014) Adaptive fault-tolerance for dynamic resource provisioning in distributed stream processing systems. In EDBT pp 85–96.
22.
Zurück zum Zitat Gedik B, Schneider S, Hirzel M, Wu KL (2013) Elastic scaling for data stream processing. IEEE Trans Parallel Distrib Syst 25(6):1447–1463CrossRef Gedik B, Schneider S, Hirzel M, Wu KL (2013) Elastic scaling for data stream processing. IEEE Trans Parallel Distrib Syst 25(6):1447–1463CrossRef
23.
Zurück zum Zitat Hesse G and Lorenz M (2015) Conceptual survey on data stream processing systems. In 2015 IEEE 21st International Conference on PARALLEL and Distributed Systems (ICPADS), pp 797–802. December 2015 Hesse G and Lorenz M (2015) Conceptual survey on data stream processing systems. In 2015 IEEE 21st International Conference on PARALLEL and Distributed Systems (ICPADS), pp 797–802. December 2015
24.
Zurück zum Zitat Wang J, Taal A, Martin P, Hu Y, Zhou H, Pang J, de Laat C, Zhao Z (2017) Planning virtual infrastructures for time critical applications with multiple deadline constraints. Futur Gener Comput Syst 75:365–375CrossRef Wang J, Taal A, Martin P, Hu Y, Zhou H, Pang J, de Laat C, Zhao Z (2017) Planning virtual infrastructures for time critical applications with multiple deadline constraints. Futur Gener Comput Syst 75:365–375CrossRef
26.
Zurück zum Zitat Pisani F, Brunetta JR, Do Rosario VM and Borin E (2017) Beyond the fog: Bringing cross-platform code execution to constrained iot devices. In 2017 29th IEEE International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), pp 17–24. October 2017 Pisani F, Brunetta JR, Do Rosario VM and Borin E (2017) Beyond the fog: Bringing cross-platform code execution to constrained iot devices. In 2017 29th IEEE International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), pp 17–24. October 2017
27.
Zurück zum Zitat Cardellini V, Lo Presti F, Nardelli M, Russo Russo G (2018) Optimal operator deployment and replication for elastic distributed data stream processing. Concurr Comput Pract Exp 30(9):cpe4334CrossRef Cardellini V, Lo Presti F, Nardelli M, Russo Russo G (2018) Optimal operator deployment and replication for elastic distributed data stream processing. Concurr Comput Pract Exp 30(9):cpe4334CrossRef
28.
Zurück zum Zitat Cardellini V, Presti FL, Nardelli M, Russo GR (2018) Decentralized self-adaptation for elastic data stream processing. Futur Gener Comput Syst 87:171–185CrossRef Cardellini V, Presti FL, Nardelli M, Russo GR (2018) Decentralized self-adaptation for elastic data stream processing. Futur Gener Comput Syst 87:171–185CrossRef
29.
Zurück zum Zitat de Assuncao MD, da Silva Veith A, Buyya R (2018) Distributed data stream processing and edge computing: A survey on resource elasticity and future directions. J Netw Comput Appl 103:1–17CrossRef de Assuncao MD, da Silva Veith A, Buyya R (2018) Distributed data stream processing and edge computing: A survey on resource elasticity and future directions. J Netw Comput Appl 103:1–17CrossRef
30.
Zurück zum Zitat Hochreiner C, Vögler M, Schulte S and Dustdar S (2016) Elastic stream processing for the internet of things. In 2016 IEEE 9th International Conference on Cloud Computing (CLOUD), pp 100–107. June 2016 Hochreiner C, Vögler M, Schulte S and Dustdar S (2016) Elastic stream processing for the internet of things. In 2016 IEEE 9th International Conference on Cloud Computing (CLOUD), pp 100–107. June 2016
31.
Zurück zum Zitat Bello O, Zeadally S (2019) Toward efficient smartification of the Internet of Things (IoT) services. Futur Gener Comput Syst 92:663–673CrossRef Bello O, Zeadally S (2019) Toward efficient smartification of the Internet of Things (IoT) services. Futur Gener Comput Syst 92:663–673CrossRef
32.
Zurück zum Zitat MH ur Rehman MH, Liew CS, Wah TY and Khan MK (2017) Towards next-generation heterogeneous mobile data stream mining applications: opportunities, challenges, and future research directions. J Netw Comput Appl 79:1–24CrossRef MH ur Rehman MH, Liew CS, Wah TY and Khan MK (2017) Towards next-generation heterogeneous mobile data stream mining applications: opportunities, challenges, and future research directions. J Netw Comput Appl 79:1–24CrossRef
33.
Zurück zum Zitat Kaur M, Aron R (2021) A systematic study of load balancing approaches in the fog computing environment. J Supercomput 8:1–46 Kaur M, Aron R (2021) A systematic study of load balancing approaches in the fog computing environment. J Supercomput 8:1–46
34.
Zurück zum Zitat Al-Sinayyid A, Zhu M (2020) Job scheduler for streaming applications in heterogeneous distributed processing systems. J Supercomput 76(12):9609–9628CrossRef Al-Sinayyid A, Zhu M (2020) Job scheduler for streaming applications in heterogeneous distributed processing systems. J Supercomput 76(12):9609–9628CrossRef
35.
Zurück zum Zitat Belkhiria M and Tedeschi C (2019) Decentralized scaling for stream processing engines Belkhiria M and Tedeschi C (2019) Decentralized scaling for stream processing engines
36.
Zurück zum Zitat Kumbhare AG, Simmhan Y, Frincu M, Prasanna VK (2015) Reactive resource provisioning heuristics for dynamic dataflows on cloud infrastructure. IEEE Trans Cloud Comput 3(2):105–118CrossRef Kumbhare AG, Simmhan Y, Frincu M, Prasanna VK (2015) Reactive resource provisioning heuristics for dynamic dataflows on cloud infrastructure. IEEE Trans Cloud Comput 3(2):105–118CrossRef
37.
Zurück zum Zitat Higashino WA, Capretz MA, Bittencourt LF (2016) CEPSim: Modelling and simulation of complex event processing systems in cloud environments. Futur Gener Comput Syst 65:122–139CrossRef Higashino WA, Capretz MA, Bittencourt LF (2016) CEPSim: Modelling and simulation of complex event processing systems in cloud environments. Futur Gener Comput Syst 65:122–139CrossRef
38.
Zurück zum Zitat Parker RG, Rardin RL (2014) Discrete optimization. Elsevier, LondonMATH Parker RG, Rardin RL (2014) Discrete optimization. Elsevier, LondonMATH
39.
Zurück zum Zitat Calheiros RN, Ranjan R, Beloglazov A, De Rose CA, Buyya R (2011) CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. Softw Pract exp 41(1):23–50CrossRef Calheiros RN, Ranjan R, Beloglazov A, De Rose CA, Buyya R (2011) CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. Softw Pract exp 41(1):23–50CrossRef
40.
Zurück zum Zitat Iosup A, Ostermann S, Yigitbasi MN, Prodan R, Fahringer T, Epema D (2011) Performance analysis of cloud computing services for many-tasks scientific computing. IEEE Trans Parallel Distrib Syst 22(6):931–945CrossRef Iosup A, Ostermann S, Yigitbasi MN, Prodan R, Fahringer T, Epema D (2011) Performance analysis of cloud computing services for many-tasks scientific computing. IEEE Trans Parallel Distrib Syst 22(6):931–945CrossRef
41.
Zurück zum Zitat Sahni J, Vidyarthi DP (2017) Heterogeneity-aware adaptive auto-scaling heuristic for improved QoS and resource usage in cloud environments. Computing 99(4):351–381MathSciNetCrossRef Sahni J, Vidyarthi DP (2017) Heterogeneity-aware adaptive auto-scaling heuristic for improved QoS and resource usage in cloud environments. Computing 99(4):351–381MathSciNetCrossRef
42.
Zurück zum Zitat Rodriguez MA, Buyya R (2014) Deadline based resource provisioning and scheduling algorithm for scientific workflows on clouds. IEEE Trans Cloud Comput 2(2):222–235CrossRef Rodriguez MA, Buyya R (2014) Deadline based resource provisioning and scheduling algorithm for scientific workflows on clouds. IEEE Trans Cloud Comput 2(2):222–235CrossRef
43.
Zurück zum Zitat Palankar MR, Iamnitchi A, Ripeanu M and Garfinkel S (2008) Amazon S3 for science grids: a viable solution, In Proceedings of the 2008 International Workshop on Data-Aware Distributed Computing, pp 55–64. June 2008 Palankar MR, Iamnitchi A, Ripeanu M and Garfinkel S (2008) Amazon S3 for science grids: a viable solution, In Proceedings of the 2008 International Workshop on Data-Aware Distributed Computing, pp 55–64. June 2008
45.
Zurück zum Zitat Kumbhare A, Simmhan Y and Prasanna VK (2013) Exploiting application dynamism and cloud elasticity for continuous dataflows. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, pp 1–12. November 2013 Kumbhare A, Simmhan Y and Prasanna VK (2013) Exploiting application dynamism and cloud elasticity for continuous dataflows. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, pp 1–12. November 2013
Metadaten
Titel
Heterogeneity-aware elastic scaling of streaming applications on cloud platforms
verfasst von
Jyoti Sahni
Deo Prakash Vidyarthi
Publikationsdatum
05.03.2021
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 9/2021
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-021-03692-w

Weitere Artikel der Ausgabe 9/2021

The Journal of Supercomputing 9/2021 Zur Ausgabe

Premium Partner