Skip to main content
Top
Published in: The Journal of Supercomputing 2/2021

15-05-2020

Predicting the performance of big data applications on the cloud

Authors: D. Ardagna, E. Barbierato, E. Gianniti, M. Gribaudo, T. B. M. Pinto, A. P. C. da Silva, J. M. Almeida

Published in: The Journal of Supercomputing | Issue 2/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Data science applications have become widespread as a means to extract knowledge from large datasets. Such applications are often characterized by highly heterogeneous and irregular data access patterns, thus often being referred to as big data applications. Such characteristics make the application execution quite challenging for existing software and hardware infrastructures to meet their resource demands. The cloud computing paradigm, in turn, offers a natural hosting solution to such applications since its on-demand pricing model allows allocating effectively computing resources according to application’s needs. However, these properties impose extra challenge to the accurate performance prediction of cloud-based applications, which is a key step to adequate capacity planning and managing of the hosting infrastructure. In this article, we tackle this challenge by exploring three modeling approaches for predicting the performance of big data applications running on the cloud. We evaluate two queuing-based analytical models and dagSim, a fast ad-hoc simulator, in various scenarios based on different applications and infrastructure setups. The considered approaches are compared in terms of prediction accuracy and execution time. Our results indicate that our two best approaches, one analytical model and dagSim, can predict average application execution times with only up to a \(7\%\) relative error, on average. Moreover, a comparison with the widely used event-based simulator available with the Java Modeling Tool (JMT) suite demonstrates that both the analytical model and dagSim run very fast, requiring at least two orders of magnitude lower execution time than JMT while providing slightly better accuracy, being thus practical for online prediction.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Footnotes
3
As mentioned, the model assumes that the execution times of the nodes in the input DAG, i.e., the stages of the Spark application, are exponentially distributed.
 
6
A related but different problem would be to find the optimal configuration given a performance target, i.e., capacity planning. This is certainly a relevant problem as well. Yet, it is not part of the scope of this article and is thus left for future work.
 
7
Ratio of standard deviation to mean value.
 
Literature
1.
go back to reference Jagadish HV, Gehrke J, Labrinidis A, Papakonstantinou Y, Patel JM, Ramakrishnan R, Shahabi C (2014) Big data and its technical challenges. Commun ACM 57:86–94CrossRef Jagadish HV, Gehrke J, Labrinidis A, Papakonstantinou Y, Patel JM, Ramakrishnan R, Shahabi C (2014) Big data and its technical challenges. Commun ACM 57:86–94CrossRef
2.
go back to reference Wang T, Wang J, Nguyen SN, Yang Z, Mi N, Sheng B (2017) Ea2s2: an efficient application-aware storage system for big data processing in heterogeneous clusters. In: 2017 26th International Conference on Computer Communication and Networks (ICCCN) Wang T, Wang J, Nguyen SN, Yang Z, Mi N, Sheng B (2017) Ea2s2: an efficient application-aware storage system for big data processing in heterogeneous clusters. In: 2017 26th International Conference on Computer Communication and Networks (ICCCN)
3.
go back to reference Bertoli M, Casale G, Serazzi G (2009) JMT: performance engineering tools for system modeling. SIGMETRICS Perform Eval Rev 36(4):10–15CrossRef Bertoli M, Casale G, Serazzi G (2009) JMT: performance engineering tools for system modeling. SIGMETRICS Perform Eval Rev 36(4):10–15CrossRef
4.
go back to reference Chiola G (1985) A software package for the analysis of generalized stochastic petri net models. In: International Workshop on Timed Petri Nets, Torino, Italy, July 1–3, 1985, pp 136–143 Chiola G (1985) A software package for the analysis of generalized stochastic petri net models. In: International Workshop on Timed Petri Nets, Torino, Italy, July 1–3, 1985, pp 136–143
5.
go back to reference Nelson RD, Tantawi AN (1988) Approximate analysis of fork/join synchronization in parallel queues. IEEE Trans Comput 37(6):739–743CrossRef Nelson RD, Tantawi AN (1988) Approximate analysis of fork/join synchronization in parallel queues. IEEE Trans Comput 37(6):739–743CrossRef
6.
go back to reference Mak V, Lundstrom S (1990) Predicting performance of parallel computations. IEEE Trans Parallel Distrib Syst 1(3):257–270CrossRef Mak V, Lundstrom S (1990) Predicting performance of parallel computations. IEEE Trans Parallel Distrib Syst 1(3):257–270CrossRef
7.
go back to reference Tripathi SK, Liang D-R (2000) On performance prediction of parallel computations with precedent constraints. IEEE Trans Parallel Distrib Syst 11(5):491–508CrossRef Tripathi SK, Liang D-R (2000) On performance prediction of parallel computations with precedent constraints. IEEE Trans Parallel Distrib Syst 11(5):491–508CrossRef
8.
go back to reference Towsley D, Lui JC, Muntz RR (1998) Computing performance bounds of fork–join parallel programs under a multiprocessing environment. IEEE Trans Parallel Distrib Syst 9(3):295–311CrossRef Towsley D, Lui JC, Muntz RR (1998) Computing performance bounds of fork–join parallel programs under a multiprocessing environment. IEEE Trans Parallel Distrib Syst 9(3):295–311CrossRef
9.
go back to reference Varki E, Dowdy LW (1996) Analysis of balanced fork–join queueing networks. SIGMETRICS Perform Eval Rev 24:232–241CrossRef Varki E, Dowdy LW (1996) Analysis of balanced fork–join queueing networks. SIGMETRICS Perform Eval Rev 24:232–241CrossRef
10.
go back to reference Ardagna D, Bernardi S, Gianniti E, Aliabadi SK, Perez-Palacin D, Requeno JI (2016) Modeling performance of hadoop applications: a journey from queueing networks to stochastic well formed nets, In: Algorithms and Architectures for Parallel Processing—16th International Conference, ICA3PP 2016, Granada, Spain, December 14–16, 2016, Proceedings, pp 599–613 Ardagna D, Bernardi S, Gianniti E, Aliabadi SK, Perez-Palacin D, Requeno JI (2016) Modeling performance of hadoop applications: a journey from queueing networks to stochastic well formed nets, In: Algorithms and Architectures for Parallel Processing—16th International Conference, ICA3PP 2016, Granada, Spain, December 14–16, 2016, Proceedings, pp 599–613
12.
go back to reference Lazowska ED, Zahorjan J, Graham GS, Sevcik KC (1984) Quantitative system performance. Prentice-Hall, Englewood Cliffs Lazowska ED, Zahorjan J, Graham GS, Sevcik KC (1984) Quantitative system performance. Prentice-Hall, Englewood Cliffs
13.
go back to reference Ardagna D, Barbierato E, Evangelinou A, Gianniti E, Gribaudo M, Pinto TBM, Guimarães A, Couto da Silva AP, Almeida JM (2018) Performance prediction of cloud-based big data applications, In: Proceedings of the 2018 ACM/SPEC International Conference on Performance Engineering, ICPE’18 (New York, NY, USA). ACM, pp 192–199 Ardagna D, Barbierato E, Evangelinou A, Gianniti E, Gribaudo M, Pinto TBM, Guimarães A, Couto da Silva AP, Almeida JM (2018) Performance prediction of cloud-based big data applications, In: Proceedings of the 2018 ACM/SPEC International Conference on Performance Engineering, ICPE’18 (New York, NY, USA). ACM, pp 192–199
14.
go back to reference Trivedi K, Kulkarni V (1993) Fspns: fluid stochastic petri nets. In: Applications and Theory of Petri nets 1993 Proceedings of 14th International Conference (Berlin, Germany), Springer Verlag Trivedi K, Kulkarni V (1993) Fspns: fluid stochastic petri nets. In: Applications and Theory of Petri nets 1993 Proceedings of 14th International Conference (Berlin, Germany), Springer Verlag
15.
go back to reference Badue CS, Almeida JM, Almeida VAF, Baeza-Yates RA, Ribeiro-Neto BA, Ziviani A, Ziviani N (2010) Capacity planning for vertical search engines. CoRR. Arxiv: abs/1006.5059 Badue CS, Almeida JM, Almeida VAF, Baeza-Yates RA, Ribeiro-Neto BA, Ziviani A, Ziviani N (2010) Capacity planning for vertical search engines. CoRR. Arxiv:​ abs/​1006.​5059
17.
go back to reference Li M, Tan J, Wang Y, Zhang L, Salapura V (2017) Sparkbench: a spark benchmarking suite characterizing large-scale in-memory data analytics. Cluster Comput 20(3):2575–2589CrossRef Li M, Tan J, Wang Y, Zhang L, Salapura V (2017) Sparkbench: a spark benchmarking suite characterizing large-scale in-memory data analytics. Cluster Comput 20(3):2575–2589CrossRef
18.
go back to reference Popescu AD, Balmin A, Ercegovac V, Ailamaki A (2013) Predict: towards predicting the runtime of large scale iterative analytics. PVLDB 6(14):1678–1689 Popescu AD, Balmin A, Ercegovac V, Ailamaki A (2013) Predict: towards predicting the runtime of large scale iterative analytics. PVLDB 6(14):1678–1689
19.
go back to reference Bhimani J, Mi N, Leeser M, Yang Z (2019) New performance modeling methods for parallel data processing applications. ACM Trans Model Comput Simul 29(3):15:1–15:24MathSciNetCrossRef Bhimani J, Mi N, Leeser M, Yang Z (2019) New performance modeling methods for parallel data processing applications. ACM Trans Model Comput Simul 29(3):15:1–15:24MathSciNetCrossRef
20.
go back to reference Wang K, Khan MMH (2015) Performance prediction for apache spark platform. In: HPCC/CSS/ICESS. IEEE, pp 166–173 Wang K, Khan MMH (2015) Performance prediction for apache spark platform. In: HPCC/CSS/ICESS. IEEE, pp 166–173
21.
go back to reference Malakar P, Balaprakash P, Vishwanath V, Morozov V, Kumaran K (2018) Benchmarking machine learning methods for performance modeling of scientific applications, pp 33–44, 11 Malakar P, Balaprakash P, Vishwanath V, Morozov V, Kumaran K (2018) Benchmarking machine learning methods for performance modeling of scientific applications, pp 33–44, 11
22.
go back to reference Riihijarvi J, Mahonen P (2018) Machine learning for performance prediction in mobile cellular networks. IEEE Comput Intell Mag 13:51–60CrossRef Riihijarvi J, Mahonen P (2018) Machine learning for performance prediction in mobile cellular networks. IEEE Comput Intell Mag 13:51–60CrossRef
23.
go back to reference Nemirovsky D, Arkose T, Markovic N, Nemirovsky M, Unsal O, Cristal A (2017) A machine learning approach for performance prediction and scheduling on heterogeneous CPUs. In: Proceedings of 29th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) Nemirovsky D, Arkose T, Markovic N, Nemirovsky M, Unsal O, Cristal A (2017) A machine learning approach for performance prediction and scheduling on heterogeneous CPUs. In: Proceedings of 29th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)
24.
go back to reference Jamshidi P, Siegmund N, Velez M, Kästner C, Patel A, Agarwal Y (2017) Transfer learning for performance modeling of configurable systems: An exploratory analysis. In: Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering, ASE 2017. IEEE Press, pp 497–508 Jamshidi P, Siegmund N, Velez M, Kästner C, Patel A, Agarwal Y (2017) Transfer learning for performance modeling of configurable systems: An exploratory analysis. In: Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering, ASE 2017. IEEE Press, pp 497–508
25.
go back to reference Marathe A, Anirudh R, Jain N, Bhatele A, Thiagarajan JJ, Kailkhura B, Yeom J-S, Rountree B, Gamblin T (2017) Performance modeling under resource constraints using deep transfer learning. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis Marathe A, Anirudh R, Jain N, Bhatele A, Thiagarajan JJ, Kailkhura B, Yeom J-S, Rountree B, Gamblin T (2017) Performance modeling under resource constraints using deep transfer learning. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
26.
go back to reference Liang D, Tripathi SK (2000) On performance prediction of parallel computations with precedent constraints. IEEE Trans Parallel Distrib Syst 11:491–508CrossRef Liang D, Tripathi SK (2000) On performance prediction of parallel computations with precedent constraints. IEEE Trans Parallel Distrib Syst 11:491–508CrossRef
27.
go back to reference Reisig W, Rozenberg G, Thiagarajan PS (2013) In memoriam: Carl adam petri, In: Transactions on Petri Nets and Other Models of Concurrency VII (K. Jensen, W. M. P. van der Aalst, G. Balbo, M. Koutny, and K. Wolf, eds.), pp. 1–5, Berlin, Heidelberg: Springer Berlin Heidelberg Reisig W, Rozenberg G, Thiagarajan PS (2013) In memoriam: Carl adam petri, In: Transactions on Petri Nets and Other Models of Concurrency VII (K. Jensen, W. M. P. van der Aalst, G. Balbo, M. Koutny, and K. Wolf, eds.), pp. 1–5, Berlin, Heidelberg: Springer Berlin Heidelberg
28.
go back to reference Nicol DM, Miner AS (1995) The fluid stochastic petri net simulator. In: Proceedings of the Sixth International Workshop on Petri Nets and Performance Models, PNPM ’95, (Washington, DC, USA). IEEE Computer Society, p 214 Nicol DM, Miner AS (1995) The fluid stochastic petri net simulator. In: Proceedings of the Sixth International Workshop on Petri Nets and Performance Models, PNPM ’95, (Washington, DC, USA). IEEE Computer Society, p 214
29.
go back to reference Ciardo G, Jones RL III, Miner AS, Siminiceanu RI (2006) Logic and stochastic modeling with SMART. Perform Eval 63:578–608CrossRef Ciardo G, Jones RL III, Miner AS, Siminiceanu RI (2006) Logic and stochastic modeling with SMART. Perform Eval 63:578–608CrossRef
30.
go back to reference Trivedi KS (2002) SHARPE 2002: symbolic hierarchical automated reliability and performance evaluator. In: DSN ’02: Proceedings of the 2002 International Conference on Dependable Systems and Networks, (Washington, DC, USA). IEEE Computer Society, p 544 Trivedi KS (2002) SHARPE 2002: symbolic hierarchical automated reliability and performance evaluator. In: DSN ’02: Proceedings of the 2002 International Conference on Dependable Systems and Networks, (Washington, DC, USA). IEEE Computer Society, p 544
31.
go back to reference Song G, Meng Z, Huet F, Magoules F, Yu L et al (2013) A hadoop mapreduce performance prediction method. HPCC 2013:820–825 Song G, Meng Z, Huet F, Magoules F, Yu L et al (2013) A hadoop mapreduce performance prediction method. HPCC 2013:820–825
32.
go back to reference Vianna E, Comarela G, Pontes T, Almeida J, Almeida V, Wilkinson K, Kuno H, Dayal U (2013) Analytical performance models for mapreduce workloads. Int J Parallel Program 41(4):495–525CrossRef Vianna E, Comarela G, Pontes T, Almeida J, Almeida V, Wilkinson K, Kuno H, Dayal U (2013) Analytical performance models for mapreduce workloads. Int J Parallel Program 41(4):495–525CrossRef
33.
go back to reference Chen K, Powers J, Guo S, Tian F (2014) Cresp: towards optimal resource provisioning for mapreduce computing in public clouds. IEEE Trans Parallel Distrib Syst 25(6):1403–1412CrossRef Chen K, Powers J, Guo S, Tian F (2014) Cresp: towards optimal resource provisioning for mapreduce computing in public clouds. IEEE Trans Parallel Distrib Syst 25(6):1403–1412CrossRef
34.
go back to reference Wang G, Butt AR, Pandey P, Gupta K (2009) A simulation approach to evaluating design decisions in mapreduce setups. In: MASCOTS. IEEE Computer Society, pp 1–11 Wang G, Butt AR, Pandey P, Gupta K (2009) A simulation approach to evaluating design decisions in mapreduce setups. In: MASCOTS. IEEE Computer Society, pp 1–11
35.
go back to reference Bergstra JA, Ponse A, Smolka SA (eds) (2001) Handbook of process algebra. Elsevier, New YorkMATH Bergstra JA, Ponse A, Smolka SA (eds) (2001) Handbook of process algebra. Elsevier, New YorkMATH
36.
go back to reference Hillston J (1996) A compositional approach to performance modelling. Cambridge University Press, New YorkCrossRef Hillston J (1996) A compositional approach to performance modelling. Cambridge University Press, New YorkCrossRef
37.
go back to reference Zaharia M, Xin RS, Wendell P, Das T, Armbrust M, Dave A, Meng X, Rosen J, Venkataraman S, Franklin MJ, Ghodsi A, Gonzalez J, Shenker S, Stoica I (2016) Apache spark: a unified engine for big data processing. Commun ACM 59(11):56–65CrossRef Zaharia M, Xin RS, Wendell P, Das T, Armbrust M, Dave A, Meng X, Rosen J, Venkataraman S, Franklin MJ, Ghodsi A, Gonzalez J, Shenker S, Stoica I (2016) Apache spark: a unified engine for big data processing. Commun ACM 59(11):56–65CrossRef
40.
go back to reference Zaki MJ, Wagner Meira J (2014) Data mining and analysis: fundamental concepts and algorithms. Cambridge University Press, CambridgeCrossRef Zaki MJ, Wagner Meira J (2014) Data mining and analysis: fundamental concepts and algorithms. Cambridge University Press, CambridgeCrossRef
Metadata
Title
Predicting the performance of big data applications on the cloud
Authors
D. Ardagna
E. Barbierato
E. Gianniti
M. Gribaudo
T. B. M. Pinto
A. P. C. da Silva
J. M. Almeida
Publication date
15-05-2020
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 2/2021
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-020-03307-w

Other articles of this Issue 2/2021

The Journal of Supercomputing 2/2021 Go to the issue

Premium Partner