Top

The Journal of Supercomputing

Published in:

15-05-2020

Predicting the performance of big data applications on the cloud

Authors: D. Ardagna, E. Barbierato, E. Gianniti, M. Gribaudo, T. B. M. Pinto, A. P. C. da Silva, J. M. Almeida

Published in: The Journal of Supercomputing | Issue 2/2021

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Data science applications have become widespread as a means to extract knowledge from large datasets. Such applications are often characterized by highly heterogeneous and irregular data access patterns, thus often being referred to as big data applications. Such characteristics make the application execution quite challenging for existing software and hardware infrastructures to meet their resource demands. The cloud computing paradigm, in turn, offers a natural hosting solution to such applications since its on-demand pricing model allows allocating effectively computing resources according to application’s needs. However, these properties impose extra challenge to the accurate performance prediction of cloud-based applications, which is a key step to adequate capacity planning and managing of the hosting infrastructure. In this article, we tackle this challenge by exploring three modeling approaches for predicting the performance of big data applications running on the cloud. We evaluate two queuing-based analytical models and dagSim, a fast ad-hoc simulator, in various scenarios based on different applications and infrastructure setups. The considered approaches are compared in terms of prediction accuracy and execution time. Our results indicate that our two best approaches, one analytical model and dagSim, can predict average application execution times with only up to a \(7\%\) relative error, on average. Moreover, a comparison with the widely used event-based simulator available with the Java Modeling Tool (JMT) suite demonstrates that both the analytical model and dagSim run very fast, requiring at least two orders of magnitude lower execution time than JMT while providing slightly better accuracy, being thus practical for online prediction.

previous article Web behavior analysis in social life logging

next article Prediction of highway asphalt pavement performance based on Markov chain and artificial neural network approach

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

https://www.marketsandmarkets.com/Market-Reports/data-science-platform-market-21532997.html.

https://www.marketsandmarkets.com/Market-Reports/big-data-market-1068.html.

As mentioned, the model assumes that the execution times of the nodes in the input DAG, i.e., the stages of the Spark application, are exponentially distributed.

The tool is available at https://github.com/eubr-bigsea/dagSim.

https://www.lua.org/home.html.

A related but different problem would be to find the optimal configuration given a performance target, i.e., capacity planning. This is certainly a relevant problem as well. Yet, it is not part of the scope of this article and is thus left for future work.

Ratio of standard deviation to mean value.

Jagadish HV, Gehrke J, Labrinidis A, Papakonstantinou Y, Patel JM, Ramakrishnan R, Shahabi C (2014) Big data and its technical challenges. Commun ACM 57:86–94CrossRef

Wang T, Wang J, Nguyen SN, Yang Z, Mi N, Sheng B (2017) Ea2s2: an efficient application-aware storage system for big data processing in heterogeneous clusters. In: 2017 26th International Conference on Computer Communication and Networks (ICCCN)

Bertoli M, Casale G, Serazzi G (2009) JMT: performance engineering tools for system modeling. SIGMETRICS Perform Eval Rev 36(4):10–15CrossRef

Chiola G (1985) A software package for the analysis of generalized stochastic petri net models. In: International Workshop on Timed Petri Nets, Torino, Italy, July 1–3, 1985, pp 136–143

Nelson RD, Tantawi AN (1988) Approximate analysis of fork/join synchronization in parallel queues. IEEE Trans Comput 37(6):739–743CrossRef

Mak V, Lundstrom S (1990) Predicting performance of parallel computations. IEEE Trans Parallel Distrib Syst 1(3):257–270CrossRef

Tripathi SK, Liang D-R (2000) On performance prediction of parallel computations with precedent constraints. IEEE Trans Parallel Distrib Syst 11(5):491–508CrossRef

Towsley D, Lui JC, Muntz RR (1998) Computing performance bounds of fork–join parallel programs under a multiprocessing environment. IEEE Trans Parallel Distrib Syst 9(3):295–311CrossRef

Varki E, Dowdy LW (1996) Analysis of balanced fork–join queueing networks. SIGMETRICS Perform Eval Rev 24:232–241CrossRef

10.

Ardagna D, Bernardi S, Gianniti E, Aliabadi SK, Perez-Palacin D, Requeno JI (2016) Modeling performance of hadoop applications: a journey from queueing networks to stochastic well formed nets, In: Algorithms and Architectures for Parallel Processing—16th International Conference, ICA3PP 2016, Granada, Spain, December 14–16, 2016, Proceedings, pp 599–613

11.

Woodie A. A decade later, apache spark still going strong. https://www.datanami.com/2019/03/08/a-decade-later-apache-spark-still-going-strong/. Accessed 7 Apr 2020

12.

Lazowska ED, Zahorjan J, Graham GS, Sevcik KC (1984) Quantitative system performance. Prentice-Hall, Englewood Cliffs

13.

Ardagna D, Barbierato E, Evangelinou A, Gianniti E, Gribaudo M, Pinto TBM, Guimarães A, Couto da Silva AP, Almeida JM (2018) Performance prediction of cloud-based big data applications, In: Proceedings of the 2018 ACM/SPEC International Conference on Performance Engineering, ICPE’18 (New York, NY, USA). ACM, pp 192–199

14.

Trivedi K, Kulkarni V (1993) Fspns: fluid stochastic petri nets. In: Applications and Theory of Petri nets 1993 Proceedings of 14th International Conference (Berlin, Germany), Springer Verlag

15.

Badue CS, Almeida JM, Almeida VAF, Baeza-Yates RA, Ribeiro-Neto BA, Ziviani A, Ziviani N (2010) Capacity planning for vertical search engines. CoRR. Arxiv: abs/1006.5059

16.

TPC. The TPC-DS benchmark. http://www.tpc.org/tpcds/. Accessed 7 Apr 2020

17.

Li M, Tan J, Wang Y, Zhang L, Salapura V (2017) Sparkbench: a spark benchmarking suite characterizing large-scale in-memory data analytics. Cluster Comput 20(3):2575–2589CrossRef

18.

Popescu AD, Balmin A, Ercegovac V, Ailamaki A (2013) Predict: towards predicting the runtime of large scale iterative analytics. PVLDB 6(14):1678–1689

19.

Bhimani J, Mi N, Leeser M, Yang Z (2019) New performance modeling methods for parallel data processing applications. ACM Trans Model Comput Simul 29(3):15:1–15:24MathSciNetCrossRef

20.

Wang K, Khan MMH (2015) Performance prediction for apache spark platform. In: HPCC/CSS/ICESS. IEEE, pp 166–173

21.

Malakar P, Balaprakash P, Vishwanath V, Morozov V, Kumaran K (2018) Benchmarking machine learning methods for performance modeling of scientific applications, pp 33–44, 11

22.

Riihijarvi J, Mahonen P (2018) Machine learning for performance prediction in mobile cellular networks. IEEE Comput Intell Mag 13:51–60CrossRef

23.

Nemirovsky D, Arkose T, Markovic N, Nemirovsky M, Unsal O, Cristal A (2017) A machine learning approach for performance prediction and scheduling on heterogeneous CPUs. In: Proceedings of 29th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

24.

Jamshidi P, Siegmund N, Velez M, Kästner C, Patel A, Agarwal Y (2017) Transfer learning for performance modeling of configurable systems: An exploratory analysis. In: Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering, ASE 2017. IEEE Press, pp 497–508

25.

Marathe A, Anirudh R, Jain N, Bhatele A, Thiagarajan JJ, Kailkhura B, Yeom J-S, Rountree B, Gamblin T (2017) Performance modeling under resource constraints using deep transfer learning. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

26.

Liang D, Tripathi SK (2000) On performance prediction of parallel computations with precedent constraints. IEEE Trans Parallel Distrib Syst 11:491–508CrossRef

27.

Reisig W, Rozenberg G, Thiagarajan PS (2013) In memoriam: Carl adam petri, In: Transactions on Petri Nets and Other Models of Concurrency VII (K. Jensen, W. M. P. van der Aalst, G. Balbo, M. Koutny, and K. Wolf, eds.), pp. 1–5, Berlin, Heidelberg: Springer Berlin Heidelberg

28.

Nicol DM, Miner AS (1995) The fluid stochastic petri net simulator. In: Proceedings of the Sixth International Workshop on Petri Nets and Performance Models, PNPM ’95, (Washington, DC, USA). IEEE Computer Society, p 214

29.

Ciardo G, Jones RL III, Miner AS, Siminiceanu RI (2006) Logic and stochastic modeling with SMART. Perform Eval 63:578–608CrossRef

30.

Trivedi KS (2002) SHARPE 2002: symbolic hierarchical automated reliability and performance evaluator. In: DSN ’02: Proceedings of the 2002 International Conference on Dependable Systems and Networks, (Washington, DC, USA). IEEE Computer Society, p 544

31.

Song G, Meng Z, Huet F, Magoules F, Yu L et al (2013) A hadoop mapreduce performance prediction method. HPCC 2013:820–825

32.

Vianna E, Comarela G, Pontes T, Almeida J, Almeida V, Wilkinson K, Kuno H, Dayal U (2013) Analytical performance models for mapreduce workloads. Int J Parallel Program 41(4):495–525CrossRef

33.

Chen K, Powers J, Guo S, Tian F (2014) Cresp: towards optimal resource provisioning for mapreduce computing in public clouds. IEEE Trans Parallel Distrib Syst 25(6):1403–1412CrossRef

34.

Wang G, Butt AR, Pandey P, Gupta K (2009) A simulation approach to evaluating design decisions in mapreduce setups. In: MASCOTS. IEEE Computer Society, pp 1–11

35.

Bergstra JA, Ponse A, Smolka SA (eds) (2001) Handbook of process algebra. Elsevier, New YorkMATH

36.

Hillston J (1996) A compositional approach to performance modelling. Cambridge University Press, New YorkCrossRef

37.

Zaharia M, Xin RS, Wendell P, Das T, Armbrust M, Dave A, Meng X, Rosen J, Venkataraman S, Franklin MJ, Ghodsi A, Gonzalez J, Shenker S, Stoica I (2016) Apache spark: a unified engine for big data processing. Commun ACM 59(11):56–65CrossRef

38.

Microsoft (2020) What is PaaS?. https://azure.microsoft.com/en-us/overview/what-is-paas/. Accessed 05 Apr 2020

39.

Top 4 Apache Spark Use Cases. https://www.knowledgenile.com/blogs/apache-spark-use-cases/. Accessed 9 Apr 2020

40.

Zaki MJ, Wagner Meira J (2014) Data mining and analysis: fundamental concepts and algorithms. Cambridge University Press, CambridgeCrossRef

41.

Microsoft. Sizes for Windows virtual machines in Azure. https://docs.microsoft.com/en-us/azure/virtual-machines/windows/sizes. Accessed 7 Apr 2020

Title: Predicting the performance of big data applications on the cloud
Authors: D. Ardagna
E. Barbierato
E. Gianniti
M. Gribaudo
T. B. M. Pinto
A. P. C. da Silva
J. M. Almeida
Publication date: 15-05-2020
Publisher: Springer US
Published in: The Journal of Supercomputing / Issue 2/2021
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI: https://doi.org/10.1007/s11227-020-03307-w

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Other articles of this Issue 2/2021

Efficient parallelization of multilevel fast multipole algorithm for electromagnetic simulation on many-core SW26010 processor

Neighborhood search-based job scheduling for IoT big data real-time processing in distributed edge-cloud computing environment

Investigating the performance of Hadoop and Spark platforms on machine learning algorithms

Adoption of human metabolic processes as Data Quality Based Models

High-performance and deep pedestrian detection based on estimation of different parts

MDSbSP: a search protocol based on MDS codes for RFID-based Internet of vehicle

Premium Partner