Skip to main content
Erschienen in: Information Systems Frontiers 1/2019

14.04.2018

Quantitative Analysis of Apache Storm Applications: The NewsAsset Case Study

verfasst von: José I. Requeno, José Merseguer, Simona Bernardi, Diego Perez-Palacin, Giorgos Giotis, Vasilis Papanikolaou

Erschienen in: Information Systems Frontiers | Ausgabe 1/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The development of Information Systems today faces the era of Big Data. Large volumes of information need to be processed in real-time, for example, for Facebook or Twitter analysis. This paper addresses the redesign of NewsAsset, a commercial product that helps journalists by providing services, which analyzes millions of media items from the social network in real-time. Technologies like Apache Storm can help enormously in this context. We have quantitatively analyzed the new design of NewsAsset to assess whether the introduction of Apache Storm can meet the demanding performance requirements of this media product. Our assessment approach, guided by the Unified Modeling Language (UML), takes advantage, for performance analysis, of the software designs already used for development. In addition, we converted UML into a domain-specific modeling language (DSML) for Apache Storm, thus creating a profile for Storm. Later, we transformed said DSML into an appropriate language for performance evaluation, specifically, stochastic Petri nets. The assessment ended with a successful software design that certainly met the scalability requirements of NewsAsset.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
Domain Specific Modeling Language.
 
2
Small and medium-sized enterprise.
 
3
Modelling and Analysis of Real Time and Embedded Systems (OMG 2011a).
 
4
Dependability Modelling and Analysis (Bernardi et al. 2011).
 
5
The Student t-distribution with \(N-1 = 21\) degrees of freedom has been used.
 
6
Figure 7 does not show the utilizations of the bolts that are below \(5\%\).
 
Literatur
Zurück zum Zitat Ardagna, D., & et al. (2016). Modeling performance of Hadoop applications: a journey from queueing networks to stochastic well formed nets. In Carretero, J., & et al. (Eds.) Proceedings of the 16th Int. Conf. on algorithms and architectures for parallel processing. ISBN 978-3-319-49583-5 (pp. 599–613). Cham: Springer. Ardagna, D., & et al. (2016). Modeling performance of Hadoop applications: a journey from queueing networks to stochastic well formed nets. In Carretero, J., & et al. (Eds.) Proceedings of the 16th Int. Conf. on algorithms and architectures for parallel processing. ISBN 978-3-319-49583-5 (pp. 599–613). Cham: Springer.
Zurück zum Zitat Bernardi, S., Merseguer, J., Petriu, D.C. (2011). A dependability profile within MARTE. Software &, Systems Modeling, 10(3), 313–336.CrossRef Bernardi, S., Merseguer, J., Petriu, D.C. (2011). A dependability profile within MARTE. Software &, Systems Modeling, 10(3), 313–336.CrossRef
Zurück zum Zitat Chiola, G., Marsan, M.A., Balbo, G., Conte, G. (1993). Generalized stochastic Petri nets: a definition at the net level and its implications. IEEE Transactions on Software Engineering, 19(2), 89–107.CrossRef Chiola, G., Marsan, M.A., Balbo, G., Conte, G. (1993). Generalized stochastic Petri nets: a definition at the net level and its implications. IEEE Transactions on Software Engineering, 19(2), 89–107.CrossRef
Zurück zum Zitat Diplaris, S., & et al. (2012). SocialSensor: sensing user generated input for improved media discovery and experience. In Proceedings of the 21st international conference on World Wide Web (pp. 243–246). ACM. Diplaris, S., & et al. (2012). SocialSensor: sensing user generated input for improved media discovery and experience. In Proceedings of the 21st international conference on World Wide Web (pp. 243–246). ACM.
Zurück zum Zitat Gianniti, E., & et al. (2017). Fluid Petri nets for the performance evaluation of MapReduce and Spark applications. ACM SIGMETRICS Performance Evaluation Review, 44(4), 23–36.CrossRef Gianniti, E., & et al. (2017). Fluid Petri nets for the performance evaluation of MapReduce and Spark applications. ACM SIGMETRICS Performance Evaluation Review, 44(4), 23–36.CrossRef
Zurück zum Zitat ISO. (2008). Systems and software engineering – High-level Petri nets – Part 2: Transfer format. ISO/IEC 159092:2011. Geneva: International Organization for Standardization. ISO. (2008). Systems and software engineering – High-level Petri nets – Part 2: Transfer format. ISO/IEC 159092:2011. Geneva: International Organization for Standardization.
Zurück zum Zitat Kroß, J., Brunnert, A., Krcmar, H. (2015). Modeling big data systems by extending the Palladio component model. Softwaretechnik-Trends, 35(3), 1–3. Kroß, J., Brunnert, A., Krcmar, H. (2015). Modeling big data systems by extending the Palladio component model. Softwaretechnik-Trends, 35(3), 1–3.
Zurück zum Zitat Kroß, J., & Krcmar, H. (2016). Modeling and simulating Apache Spark streaming applications. Softwaretechnik-Trends, 36(4), 1–3. Kroß, J., & Krcmar, H. (2016). Modeling and simulating Apache Spark streaming applications. Softwaretechnik-Trends, 36(4), 1–3.
Zurück zum Zitat Lagarde, F., Espinoza, H., Terrier, F., Gérard, S. (2007). Improving UML profile design practices by leveraging conceptual domain models. In Kurt Stirewalt, RE, Egyed, A., Fischer, B. (Eds.) Proceeedins of the 22nd IEEE/ACM international conference on automated software engineering (ASE 2007) (pp. 445–448). Atlanta : ACM. Lagarde, F., Espinoza, H., Terrier, F., Gérard, S. (2007). Improving UML profile design practices by leveraging conceptual domain models. In Kurt Stirewalt, RE, Egyed, A., Fischer, B. (Eds.) Proceeedins of the 22nd IEEE/ACM international conference on automated software engineering (ASE 2007) (pp. 445–448). Atlanta : ACM.
Zurück zum Zitat Law Averill, M. (2015). Simulation modeling and analysis. McGraw-Hill. Law Averill, M. (2015). Simulation modeling and analysis. McGraw-Hill.
Zurück zum Zitat Marsan, M.A., Balbo, G., Conte, G., Donatelli, S., Franceschinis, G. (1994). Modelling with generalized stochastic Petri nets, 1st edn. New York: John Wiley & Sons, Inc. Marsan, M.A., Balbo, G., Conte, G., Donatelli, S., Franceschinis, G. (1994). Modelling with generalized stochastic Petri nets, 1st edn. New York: John Wiley & Sons, Inc.
Zurück zum Zitat Nalepa, F., Batko, M., Zezula, P. (2015a). Model for performance analysis of distributed stream processing applications. In Proceedings of the 20th international conference on database and expert systems applications (pp. 520–533). Springer. Nalepa, F., Batko, M., Zezula, P. (2015a). Model for performance analysis of distributed stream processing applications. In Proceedings of the 20th international conference on database and expert systems applications (pp. 520–533). Springer.
Zurück zum Zitat Nalepa, F., Batko, M., Zezula, P. (2015b). Performance analysis of distributed stream processing applications through colored Petri nets. In Proceedings of the 10th international doctoral workshop on mathematical and engineering methods in computer science (pp. 93–106). Springer. Nalepa, F., Batko, M., Zezula, P. (2015b). Performance analysis of distributed stream processing applications through colored Petri nets. In Proceedings of the 10th international doctoral workshop on mathematical and engineering methods in computer science (pp. 93–106). Springer.
Zurück zum Zitat Ranjan, R. (2014). Modeling and simulation in performance optimization of big data processing frameworks. IEEE Cloud Computing, 1(4), 14–19.CrossRef Ranjan, R. (2014). Modeling and simulation in performance optimization of big data processing frameworks. IEEE Cloud Computing, 1(4), 14–19.CrossRef
Zurück zum Zitat Requeno, J.I., Merseguer, J., Bernardi, S. (2017). Performance analysis of Apache Storm applications using stochastic Petri nets. In Proceedings of the 5th international workshop on formal methods integration. Requeno, J.I., Merseguer, J., Bernardi, S. (2017). Performance analysis of Apache Storm applications using stochastic Petri nets. In Proceedings of the 5th international workshop on formal methods integration.
Zurück zum Zitat Samolej, S., & Rak, T. (2009). Simulation and performance analysis of distributed internet systems using TCPNs. Informatica, 33(4), 405–415. Samolej, S., & Rak, T. (2009). Simulation and performance analysis of distributed internet systems using TCPNs. Informatica, 33(4), 405–415.
Zurück zum Zitat Selic, B. (2007). A systematic approach to domain-specific language design using UML. In Proceedings of the 10th IEEE international symposium on object-oriented real-time distributed computing (pp. 2–9). IEEE Computer Society. Selic, B. (2007). A systematic approach to domain-specific language design using UML. In Proceedings of the 10th IEEE international symposium on object-oriented real-time distributed computing (pp. 2–9). IEEE Computer Society.
Zurück zum Zitat Singhal, R., & Verma, A. (2016). Predicting job completion time in heterogeneous MapReduce environments. In Proceedings of the 30th IEEE international parallel and distributed processing symposium workshops (pp. 17–27). IEEE. Singhal, R., & Verma, A. (2016). Predicting job completion time in heterogeneous MapReduce environments. In Proceedings of the 30th IEEE international parallel and distributed processing symposium workshops (pp. 17–27). IEEE.
Zurück zum Zitat Wang, K., & Khan, M.M.H. (2015). Performance prediction for Apache Spark platform. In 2015 IEEE 17th international conference on high performance computing and communications (HPCC), 2015 IEEE 7th international symposium on cyberspace safety and security (CSS), and 2015 IEEE 12th international conferen on embedded software and systems (ICESS) (pp. 166–173). IEEE. Wang, K., & Khan, M.M.H. (2015). Performance prediction for Apache Spark platform. In 2015 IEEE 17th international conference on high performance computing and communications (HPCC), 2015 IEEE 7th international symposium on cyberspace safety and security (CSS), and 2015 IEEE 12th international conferen on embedded software and systems (ICESS) (pp. 166–173). IEEE.
Zurück zum Zitat Zimmermann, A. (2017). Modelling and performance evaluation with TimeNET 4.4 (pp. 300–303). Cham: Springer International Publishing. Zimmermann, A. (2017). Modelling and performance evaluation with TimeNET 4.4 (pp. 300–303). Cham: Springer International Publishing.
Metadaten
Titel
Quantitative Analysis of Apache Storm Applications: The NewsAsset Case Study
verfasst von
José I. Requeno
José Merseguer
Simona Bernardi
Diego Perez-Palacin
Giorgos Giotis
Vasilis Papanikolaou
Publikationsdatum
14.04.2018
Verlag
Springer US
Erschienen in
Information Systems Frontiers / Ausgabe 1/2019
Print ISSN: 1387-3326
Elektronische ISSN: 1572-9419
DOI
https://doi.org/10.1007/s10796-018-9851-x

Weitere Artikel der Ausgabe 1/2019

Information Systems Frontiers 1/2019 Zur Ausgabe