Skip to main content
Erschienen in: The Journal of Supercomputing 2/2021

15.05.2020

Predicting the performance of big data applications on the cloud

verfasst von: D. Ardagna, E. Barbierato, E. Gianniti, M. Gribaudo, T. B. M. Pinto, A. P. C. da Silva, J. M. Almeida

Erschienen in: The Journal of Supercomputing | Ausgabe 2/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Data science applications have become widespread as a means to extract knowledge from large datasets. Such applications are often characterized by highly heterogeneous and irregular data access patterns, thus often being referred to as big data applications. Such characteristics make the application execution quite challenging for existing software and hardware infrastructures to meet their resource demands. The cloud computing paradigm, in turn, offers a natural hosting solution to such applications since its on-demand pricing model allows allocating effectively computing resources according to application’s needs. However, these properties impose extra challenge to the accurate performance prediction of cloud-based applications, which is a key step to adequate capacity planning and managing of the hosting infrastructure. In this article, we tackle this challenge by exploring three modeling approaches for predicting the performance of big data applications running on the cloud. We evaluate two queuing-based analytical models and dagSim, a fast ad-hoc simulator, in various scenarios based on different applications and infrastructure setups. The considered approaches are compared in terms of prediction accuracy and execution time. Our results indicate that our two best approaches, one analytical model and dagSim, can predict average application execution times with only up to a \(7\%\) relative error, on average. Moreover, a comparison with the widely used event-based simulator available with the Java Modeling Tool (JMT) suite demonstrates that both the analytical model and dagSim run very fast, requiring at least two orders of magnitude lower execution time than JMT while providing slightly better accuracy, being thus practical for online prediction.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Fußnoten
3
As mentioned, the model assumes that the execution times of the nodes in the input DAG, i.e., the stages of the Spark application, are exponentially distributed.
 
6
A related but different problem would be to find the optimal configuration given a performance target, i.e., capacity planning. This is certainly a relevant problem as well. Yet, it is not part of the scope of this article and is thus left for future work.
 
7
Ratio of standard deviation to mean value.
 
Literatur
1.
Zurück zum Zitat Jagadish HV, Gehrke J, Labrinidis A, Papakonstantinou Y, Patel JM, Ramakrishnan R, Shahabi C (2014) Big data and its technical challenges. Commun ACM 57:86–94CrossRef Jagadish HV, Gehrke J, Labrinidis A, Papakonstantinou Y, Patel JM, Ramakrishnan R, Shahabi C (2014) Big data and its technical challenges. Commun ACM 57:86–94CrossRef
2.
Zurück zum Zitat Wang T, Wang J, Nguyen SN, Yang Z, Mi N, Sheng B (2017) Ea2s2: an efficient application-aware storage system for big data processing in heterogeneous clusters. In: 2017 26th International Conference on Computer Communication and Networks (ICCCN) Wang T, Wang J, Nguyen SN, Yang Z, Mi N, Sheng B (2017) Ea2s2: an efficient application-aware storage system for big data processing in heterogeneous clusters. In: 2017 26th International Conference on Computer Communication and Networks (ICCCN)
3.
Zurück zum Zitat Bertoli M, Casale G, Serazzi G (2009) JMT: performance engineering tools for system modeling. SIGMETRICS Perform Eval Rev 36(4):10–15CrossRef Bertoli M, Casale G, Serazzi G (2009) JMT: performance engineering tools for system modeling. SIGMETRICS Perform Eval Rev 36(4):10–15CrossRef
4.
Zurück zum Zitat Chiola G (1985) A software package for the analysis of generalized stochastic petri net models. In: International Workshop on Timed Petri Nets, Torino, Italy, July 1–3, 1985, pp 136–143 Chiola G (1985) A software package for the analysis of generalized stochastic petri net models. In: International Workshop on Timed Petri Nets, Torino, Italy, July 1–3, 1985, pp 136–143
5.
Zurück zum Zitat Nelson RD, Tantawi AN (1988) Approximate analysis of fork/join synchronization in parallel queues. IEEE Trans Comput 37(6):739–743CrossRef Nelson RD, Tantawi AN (1988) Approximate analysis of fork/join synchronization in parallel queues. IEEE Trans Comput 37(6):739–743CrossRef
6.
Zurück zum Zitat Mak V, Lundstrom S (1990) Predicting performance of parallel computations. IEEE Trans Parallel Distrib Syst 1(3):257–270CrossRef Mak V, Lundstrom S (1990) Predicting performance of parallel computations. IEEE Trans Parallel Distrib Syst 1(3):257–270CrossRef
7.
Zurück zum Zitat Tripathi SK, Liang D-R (2000) On performance prediction of parallel computations with precedent constraints. IEEE Trans Parallel Distrib Syst 11(5):491–508CrossRef Tripathi SK, Liang D-R (2000) On performance prediction of parallel computations with precedent constraints. IEEE Trans Parallel Distrib Syst 11(5):491–508CrossRef
8.
Zurück zum Zitat Towsley D, Lui JC, Muntz RR (1998) Computing performance bounds of fork–join parallel programs under a multiprocessing environment. IEEE Trans Parallel Distrib Syst 9(3):295–311CrossRef Towsley D, Lui JC, Muntz RR (1998) Computing performance bounds of fork–join parallel programs under a multiprocessing environment. IEEE Trans Parallel Distrib Syst 9(3):295–311CrossRef
9.
Zurück zum Zitat Varki E, Dowdy LW (1996) Analysis of balanced fork–join queueing networks. SIGMETRICS Perform Eval Rev 24:232–241CrossRef Varki E, Dowdy LW (1996) Analysis of balanced fork–join queueing networks. SIGMETRICS Perform Eval Rev 24:232–241CrossRef
10.
Zurück zum Zitat Ardagna D, Bernardi S, Gianniti E, Aliabadi SK, Perez-Palacin D, Requeno JI (2016) Modeling performance of hadoop applications: a journey from queueing networks to stochastic well formed nets, In: Algorithms and Architectures for Parallel Processing—16th International Conference, ICA3PP 2016, Granada, Spain, December 14–16, 2016, Proceedings, pp 599–613 Ardagna D, Bernardi S, Gianniti E, Aliabadi SK, Perez-Palacin D, Requeno JI (2016) Modeling performance of hadoop applications: a journey from queueing networks to stochastic well formed nets, In: Algorithms and Architectures for Parallel Processing—16th International Conference, ICA3PP 2016, Granada, Spain, December 14–16, 2016, Proceedings, pp 599–613
12.
Zurück zum Zitat Lazowska ED, Zahorjan J, Graham GS, Sevcik KC (1984) Quantitative system performance. Prentice-Hall, Englewood Cliffs Lazowska ED, Zahorjan J, Graham GS, Sevcik KC (1984) Quantitative system performance. Prentice-Hall, Englewood Cliffs
13.
Zurück zum Zitat Ardagna D, Barbierato E, Evangelinou A, Gianniti E, Gribaudo M, Pinto TBM, Guimarães A, Couto da Silva AP, Almeida JM (2018) Performance prediction of cloud-based big data applications, In: Proceedings of the 2018 ACM/SPEC International Conference on Performance Engineering, ICPE’18 (New York, NY, USA). ACM, pp 192–199 Ardagna D, Barbierato E, Evangelinou A, Gianniti E, Gribaudo M, Pinto TBM, Guimarães A, Couto da Silva AP, Almeida JM (2018) Performance prediction of cloud-based big data applications, In: Proceedings of the 2018 ACM/SPEC International Conference on Performance Engineering, ICPE’18 (New York, NY, USA). ACM, pp 192–199
14.
Zurück zum Zitat Trivedi K, Kulkarni V (1993) Fspns: fluid stochastic petri nets. In: Applications and Theory of Petri nets 1993 Proceedings of 14th International Conference (Berlin, Germany), Springer Verlag Trivedi K, Kulkarni V (1993) Fspns: fluid stochastic petri nets. In: Applications and Theory of Petri nets 1993 Proceedings of 14th International Conference (Berlin, Germany), Springer Verlag
15.
Zurück zum Zitat Badue CS, Almeida JM, Almeida VAF, Baeza-Yates RA, Ribeiro-Neto BA, Ziviani A, Ziviani N (2010) Capacity planning for vertical search engines. CoRR. Arxiv: abs/1006.5059 Badue CS, Almeida JM, Almeida VAF, Baeza-Yates RA, Ribeiro-Neto BA, Ziviani A, Ziviani N (2010) Capacity planning for vertical search engines. CoRR. Arxiv:​ abs/​1006.​5059
17.
Zurück zum Zitat Li M, Tan J, Wang Y, Zhang L, Salapura V (2017) Sparkbench: a spark benchmarking suite characterizing large-scale in-memory data analytics. Cluster Comput 20(3):2575–2589CrossRef Li M, Tan J, Wang Y, Zhang L, Salapura V (2017) Sparkbench: a spark benchmarking suite characterizing large-scale in-memory data analytics. Cluster Comput 20(3):2575–2589CrossRef
18.
Zurück zum Zitat Popescu AD, Balmin A, Ercegovac V, Ailamaki A (2013) Predict: towards predicting the runtime of large scale iterative analytics. PVLDB 6(14):1678–1689 Popescu AD, Balmin A, Ercegovac V, Ailamaki A (2013) Predict: towards predicting the runtime of large scale iterative analytics. PVLDB 6(14):1678–1689
19.
Zurück zum Zitat Bhimani J, Mi N, Leeser M, Yang Z (2019) New performance modeling methods for parallel data processing applications. ACM Trans Model Comput Simul 29(3):15:1–15:24MathSciNetCrossRef Bhimani J, Mi N, Leeser M, Yang Z (2019) New performance modeling methods for parallel data processing applications. ACM Trans Model Comput Simul 29(3):15:1–15:24MathSciNetCrossRef
20.
Zurück zum Zitat Wang K, Khan MMH (2015) Performance prediction for apache spark platform. In: HPCC/CSS/ICESS. IEEE, pp 166–173 Wang K, Khan MMH (2015) Performance prediction for apache spark platform. In: HPCC/CSS/ICESS. IEEE, pp 166–173
21.
Zurück zum Zitat Malakar P, Balaprakash P, Vishwanath V, Morozov V, Kumaran K (2018) Benchmarking machine learning methods for performance modeling of scientific applications, pp 33–44, 11 Malakar P, Balaprakash P, Vishwanath V, Morozov V, Kumaran K (2018) Benchmarking machine learning methods for performance modeling of scientific applications, pp 33–44, 11
22.
Zurück zum Zitat Riihijarvi J, Mahonen P (2018) Machine learning for performance prediction in mobile cellular networks. IEEE Comput Intell Mag 13:51–60CrossRef Riihijarvi J, Mahonen P (2018) Machine learning for performance prediction in mobile cellular networks. IEEE Comput Intell Mag 13:51–60CrossRef
23.
Zurück zum Zitat Nemirovsky D, Arkose T, Markovic N, Nemirovsky M, Unsal O, Cristal A (2017) A machine learning approach for performance prediction and scheduling on heterogeneous CPUs. In: Proceedings of 29th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) Nemirovsky D, Arkose T, Markovic N, Nemirovsky M, Unsal O, Cristal A (2017) A machine learning approach for performance prediction and scheduling on heterogeneous CPUs. In: Proceedings of 29th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)
24.
Zurück zum Zitat Jamshidi P, Siegmund N, Velez M, Kästner C, Patel A, Agarwal Y (2017) Transfer learning for performance modeling of configurable systems: An exploratory analysis. In: Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering, ASE 2017. IEEE Press, pp 497–508 Jamshidi P, Siegmund N, Velez M, Kästner C, Patel A, Agarwal Y (2017) Transfer learning for performance modeling of configurable systems: An exploratory analysis. In: Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering, ASE 2017. IEEE Press, pp 497–508
25.
Zurück zum Zitat Marathe A, Anirudh R, Jain N, Bhatele A, Thiagarajan JJ, Kailkhura B, Yeom J-S, Rountree B, Gamblin T (2017) Performance modeling under resource constraints using deep transfer learning. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis Marathe A, Anirudh R, Jain N, Bhatele A, Thiagarajan JJ, Kailkhura B, Yeom J-S, Rountree B, Gamblin T (2017) Performance modeling under resource constraints using deep transfer learning. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
26.
Zurück zum Zitat Liang D, Tripathi SK (2000) On performance prediction of parallel computations with precedent constraints. IEEE Trans Parallel Distrib Syst 11:491–508CrossRef Liang D, Tripathi SK (2000) On performance prediction of parallel computations with precedent constraints. IEEE Trans Parallel Distrib Syst 11:491–508CrossRef
27.
Zurück zum Zitat Reisig W, Rozenberg G, Thiagarajan PS (2013) In memoriam: Carl adam petri, In: Transactions on Petri Nets and Other Models of Concurrency VII (K. Jensen, W. M. P. van der Aalst, G. Balbo, M. Koutny, and K. Wolf, eds.), pp. 1–5, Berlin, Heidelberg: Springer Berlin Heidelberg Reisig W, Rozenberg G, Thiagarajan PS (2013) In memoriam: Carl adam petri, In: Transactions on Petri Nets and Other Models of Concurrency VII (K. Jensen, W. M. P. van der Aalst, G. Balbo, M. Koutny, and K. Wolf, eds.), pp. 1–5, Berlin, Heidelberg: Springer Berlin Heidelberg
28.
Zurück zum Zitat Nicol DM, Miner AS (1995) The fluid stochastic petri net simulator. In: Proceedings of the Sixth International Workshop on Petri Nets and Performance Models, PNPM ’95, (Washington, DC, USA). IEEE Computer Society, p 214 Nicol DM, Miner AS (1995) The fluid stochastic petri net simulator. In: Proceedings of the Sixth International Workshop on Petri Nets and Performance Models, PNPM ’95, (Washington, DC, USA). IEEE Computer Society, p 214
29.
Zurück zum Zitat Ciardo G, Jones RL III, Miner AS, Siminiceanu RI (2006) Logic and stochastic modeling with SMART. Perform Eval 63:578–608CrossRef Ciardo G, Jones RL III, Miner AS, Siminiceanu RI (2006) Logic and stochastic modeling with SMART. Perform Eval 63:578–608CrossRef
30.
Zurück zum Zitat Trivedi KS (2002) SHARPE 2002: symbolic hierarchical automated reliability and performance evaluator. In: DSN ’02: Proceedings of the 2002 International Conference on Dependable Systems and Networks, (Washington, DC, USA). IEEE Computer Society, p 544 Trivedi KS (2002) SHARPE 2002: symbolic hierarchical automated reliability and performance evaluator. In: DSN ’02: Proceedings of the 2002 International Conference on Dependable Systems and Networks, (Washington, DC, USA). IEEE Computer Society, p 544
31.
Zurück zum Zitat Song G, Meng Z, Huet F, Magoules F, Yu L et al (2013) A hadoop mapreduce performance prediction method. HPCC 2013:820–825 Song G, Meng Z, Huet F, Magoules F, Yu L et al (2013) A hadoop mapreduce performance prediction method. HPCC 2013:820–825
32.
Zurück zum Zitat Vianna E, Comarela G, Pontes T, Almeida J, Almeida V, Wilkinson K, Kuno H, Dayal U (2013) Analytical performance models for mapreduce workloads. Int J Parallel Program 41(4):495–525CrossRef Vianna E, Comarela G, Pontes T, Almeida J, Almeida V, Wilkinson K, Kuno H, Dayal U (2013) Analytical performance models for mapreduce workloads. Int J Parallel Program 41(4):495–525CrossRef
33.
Zurück zum Zitat Chen K, Powers J, Guo S, Tian F (2014) Cresp: towards optimal resource provisioning for mapreduce computing in public clouds. IEEE Trans Parallel Distrib Syst 25(6):1403–1412CrossRef Chen K, Powers J, Guo S, Tian F (2014) Cresp: towards optimal resource provisioning for mapreduce computing in public clouds. IEEE Trans Parallel Distrib Syst 25(6):1403–1412CrossRef
34.
Zurück zum Zitat Wang G, Butt AR, Pandey P, Gupta K (2009) A simulation approach to evaluating design decisions in mapreduce setups. In: MASCOTS. IEEE Computer Society, pp 1–11 Wang G, Butt AR, Pandey P, Gupta K (2009) A simulation approach to evaluating design decisions in mapreduce setups. In: MASCOTS. IEEE Computer Society, pp 1–11
35.
Zurück zum Zitat Bergstra JA, Ponse A, Smolka SA (eds) (2001) Handbook of process algebra. Elsevier, New YorkMATH Bergstra JA, Ponse A, Smolka SA (eds) (2001) Handbook of process algebra. Elsevier, New YorkMATH
36.
Zurück zum Zitat Hillston J (1996) A compositional approach to performance modelling. Cambridge University Press, New YorkCrossRef Hillston J (1996) A compositional approach to performance modelling. Cambridge University Press, New YorkCrossRef
37.
Zurück zum Zitat Zaharia M, Xin RS, Wendell P, Das T, Armbrust M, Dave A, Meng X, Rosen J, Venkataraman S, Franklin MJ, Ghodsi A, Gonzalez J, Shenker S, Stoica I (2016) Apache spark: a unified engine for big data processing. Commun ACM 59(11):56–65CrossRef Zaharia M, Xin RS, Wendell P, Das T, Armbrust M, Dave A, Meng X, Rosen J, Venkataraman S, Franklin MJ, Ghodsi A, Gonzalez J, Shenker S, Stoica I (2016) Apache spark: a unified engine for big data processing. Commun ACM 59(11):56–65CrossRef
40.
Zurück zum Zitat Zaki MJ, Wagner Meira J (2014) Data mining and analysis: fundamental concepts and algorithms. Cambridge University Press, CambridgeCrossRef Zaki MJ, Wagner Meira J (2014) Data mining and analysis: fundamental concepts and algorithms. Cambridge University Press, CambridgeCrossRef
Metadaten
Titel
Predicting the performance of big data applications on the cloud
verfasst von
D. Ardagna
E. Barbierato
E. Gianniti
M. Gribaudo
T. B. M. Pinto
A. P. C. da Silva
J. M. Almeida
Publikationsdatum
15.05.2020
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 2/2021
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-020-03307-w

Weitere Artikel der Ausgabe 2/2021

The Journal of Supercomputing 2/2021 Zur Ausgabe