Skip to main content
Erschienen in: Software and Systems Modeling 6/2019

15.04.2019 | Regular Paper

A UML Profile for the Design, Quality Assessment and Deployment of Data-intensive Applications

verfasst von: Diego Perez-Palacin, José Merseguer, José I. Requeno, M. Guerriero, Elisabetta Di Nitto, D. A. Tamburri

Erschienen in: Software and Systems Modeling | Ausgabe 6/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Big Data or Data-Intensive applications (DIAs) seek to mine, manipulate, extract or otherwise exploit the potential intelligence hidden behind Big Data. However, several practitioner surveys remark that DIAs potential is still untapped because of very difficult and costly design, quality assessment and continuous refinement. To address the above shortcoming, we propose the use of a UML domain-specific modeling language or profile specifically tailored to support the design, assessment and continuous deployment of DIAs. This article illustrates our DIA-specific profile and outlines its usage in the context of DIA performance engineering and deployment. For DIA performance engineering, we rely on the Apache Hadoop technology, while for DIA deployment, we leverage the TOSCA language. We conclude that the proposed profile offers a powerful language for data-intensive software and systems modeling, quality evaluation and automated deployment of DIAs on private or public clouds.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
TOSCA is a language to specify deployable blueprints in line with the emerging Infrastructure-as-Code (IasC) paradigm [18].
 
2
Modeling and Analysis of Real-Time Embedded Systems.
 
3
Dependability Analysis and Modeling.
 
4
The DIA library is described in the technical “Appendix B”.
 
5
See “Appendix B” for details on data types.
 
6
In Fig. 5, stereotypes with dark gray background have been taken from MARTE and the light gray ones from DAM.
 
7
Yet Another Resource Negotiator.
 
12
We say “at least” because we use Erlang-k distributions for the firing times, which are possible to be represented in CTMC, although increasing even further the number of states in function of the number of Erlang-k transitions and the value of k.
 
13
TOSCA is a language to specify deployable blueprints in line with the IaC paradigm [35]. See Appendix 1 for TOSCA details.
 
Literatur
1.
Zurück zum Zitat Ajmone-Marsan, M., Balbo, G., Conte, G., Donatelli, S., Franceschinis, G.: Modeling with Generalized Stochastic Petri Nets. Wiley, New York (1994)MATH Ajmone-Marsan, M., Balbo, G., Conte, G., Donatelli, S., Franceschinis, G.: Modeling with Generalized Stochastic Petri Nets. Wiley, New York (1994)MATH
2.
Zurück zum Zitat Ardagna, D., Bernardi, S., Gianniti, E., Karimian Aliabadi, S., Perez-Palacin, D., Requeno, J.I.: Modeling performance of hadoop applications: a journey from queueing networks to stochastic well formed nets. In: International Conference on Algorithms and Architectures for Parallel Processing, pp. 599–613. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-49583-5_47 Ardagna, D., Bernardi, S., Gianniti, E., Karimian Aliabadi, S., Perez-Palacin, D., Requeno, J.I.: Modeling performance of hadoop applications: a journey from queueing networks to stochastic well formed nets. In: International Conference on Algorithms and Architectures for Parallel Processing, pp. 599–613. Springer, Cham (2016). https://​doi.​org/​10.​1007/​978-3-319-49583-5_​47
3.
Zurück zum Zitat Ardagna, D., Di Nitto, E., Casale, G., Petcu, D., Mohagheghi, P., Mosser, S., Matthews, P., Gericke, A., Ballagny, C., D’Andria, F., Nechifor, C.-S., Sheridan, C.: Modaclouds: a model-driven approach for the design and execution of applications on multiple clouds. In: Proceedings of the 4th International Workshop on Modeling in Software Engineering, MiSE’12, pp. 50–56. IEEE Press, Piscataway, NJ (2012). http://dl.acm.org/citation.cfm?id=2664431.2664439 Ardagna, D., Di Nitto, E., Casale, G., Petcu, D., Mohagheghi, P., Mosser, S., Matthews, P., Gericke, A., Ballagny, C., D’Andria, F., Nechifor, C.-S., Sheridan, C.: Modaclouds: a model-driven approach for the design and execution of applications on multiple clouds. In: Proceedings of the 4th International Workshop on Modeling in Software Engineering, MiSE’12, pp. 50–56. IEEE Press, Piscataway, NJ (2012). http://​dl.​acm.​org/​citation.​cfm?​id=​2664431.​2664439
4.
Zurück zum Zitat Artac, M., Borovsak, T., Di Nitto, E., Guerriero, M., Perez-Palacin, D., Tamburri, D.A.: Infrastructure-as-code for data-intensive architectures: a model-driven development approach. In: IEEE International Conference on Software Architecture, ICSA 2018, Seattle, WA, April 30–May 4, 2018, pp. 156–165. IEEE Computer Society (2018). https://doi.org/10.1109/ICSA.2018.00025 Artac, M., Borovsak, T., Di Nitto, E., Guerriero, M., Perez-Palacin, D., Tamburri, D.A.: Infrastructure-as-code for data-intensive architectures: a model-driven development approach. In: IEEE International Conference on Software Architecture, ICSA 2018, Seattle, WA, April 30–May 4, 2018, pp. 156–165. IEEE Computer Society (2018). https://​doi.​org/​10.​1109/​ICSA.​2018.​00025
7.
Zurück zum Zitat Bell, G., Hey, T., Szalay, A.: Beyond the data deluge. Science 323(5919), 1297–1298 (2009)CrossRef Bell, G., Hey, T., Szalay, A.: Beyond the data deluge. Science 323(5919), 1297–1298 (2009)CrossRef
8.
Zurück zum Zitat Bernardi, S., Dominguez, J.L., Gómez, A., Joubert, C., Merseguer, José, Perez-Palacin, D., Requeno, J.I., Romeu, A.: A systematic approach for performance assessment using process mining. Empir. Softw. Eng. (2018) (accepted for publication). https://doi.org/10.1007/s10664-018-9606-9 Bernardi, S., Dominguez, J.L., Gómez, A., Joubert, C., Merseguer, José, Perez-Palacin, D., Requeno, J.I., Romeu, A.: A systematic approach for performance assessment using process mining. Empir. Softw. Eng. (2018) (accepted for publication). https://​doi.​org/​10.​1007/​s10664-018-9606-9
9.
Zurück zum Zitat Bernardi, S., Requeno, J.I., Joubert, C., Romeu, A.: A systematic approach for performance evaluation using process mining: the Posidonia Operations case study. In: Proceedings of the 2nd International Workshop on Quality-Aware DevOps, QUDOS 2016, pp. 24–29. ACM, New York, NY (2016). https://doi.org/10.1145/2945408.2945413 Bernardi, S., Requeno, J.I., Joubert, C., Romeu, A.: A systematic approach for performance evaluation using process mining: the Posidonia Operations case study. In: Proceedings of the 2nd International Workshop on Quality-Aware DevOps, QUDOS 2016, pp. 24–29. ACM, New York, NY (2016). https://​doi.​org/​10.​1145/​2945408.​2945413
10.
Zurück zum Zitat Bernardi, S., Merseguer, J., Petriu, D.C.: A dependability profile within MARTE. Softw. Syst. Model. 10(3), 313–336 (2011)CrossRef Bernardi, S., Merseguer, J., Petriu, D.C.: A dependability profile within MARTE. Softw. Syst. Model. 10(3), 313–336 (2011)CrossRef
11.
Zurück zum Zitat Bernardi, S., Merseguer, J., Petriu, D.C.: Model-Driven Dependability Assessment of Software Systems. Springer, New York (2013)CrossRefMATH Bernardi, S., Merseguer, J., Petriu, D.C.: Model-Driven Dependability Assessment of Software Systems. Springer, New York (2013)CrossRefMATH
15.
Zurück zum Zitat Chen, C.L.P., Zhang, C.-Y.: Data-intensive applications, challenges, techniques and technologies: a survey on big data. Inf. Sci. 275, 314–347 (2014)CrossRef Chen, C.L.P., Zhang, C.-Y.: Data-intensive applications, challenges, techniques and technologies: a survey on big data. Inf. Sci. 275, 314–347 (2014)CrossRef
17.
Zurück zum Zitat Clements, P., Kazman, R., Klein, M.: Evaluating Software Architectures: Methods and Case Studies. Addison-Wesley, Boston (2001) Clements, P., Kazman, R., Klein, M.: Evaluating Software Architectures: Methods and Case Studies. Addison-Wesley, Boston (2001)
20.
Zurück zum Zitat Cortellessa, V., Di Marco, A., Inverardi, P.: Model-Based Software Performance Analysis. Springer, New York (2011)CrossRef Cortellessa, V., Di Marco, A., Inverardi, P.: Model-Based Software Performance Analysis. Springer, New York (2011)CrossRef
21.
Zurück zum Zitat Dean, J., Ghemawat, S.: Mapreduce: a flexible data processing tool. Commun. ACM 53(1), 72–77 (2010)CrossRef Dean, J., Ghemawat, S.: Mapreduce: a flexible data processing tool. Commun. ACM 53(1), 72–77 (2010)CrossRef
22.
Zurück zum Zitat Di Nitto, E., Mattew, P., Petcu, D., Solberg, A. (eds.): Model-Driven Development and Operation of Multi-Cloud Applications. PoliMI SpringerBriefs. Springer, New York (2017) Di Nitto, E., Mattew, P., Petcu, D., Solberg, A. (eds.): Model-Driven Development and Operation of Multi-Cloud Applications. PoliMI SpringerBriefs. Springer, New York (2017)
25.
Zurück zum Zitat Gómez, A., Merseguer, J., Di Nitto, E., Tamburri, D.A.: Towards a uml profile for data intensive applications. In: Proceedings of the 2Nd International Workshop on Quality-Aware DevOps, QUDOS 2016, pp. 18–23, ACM, New York, NY (2016). https://doi.org/10.1145/2945408.2945412 Gómez, A., Merseguer, J., Di Nitto, E., Tamburri, D.A.: Towards a uml profile for data intensive applications. In: Proceedings of the 2Nd International Workshop on Quality-Aware DevOps, QUDOS 2016, pp. 18–23, ACM, New York, NY (2016). https://​doi.​org/​10.​1145/​2945408.​2945412
27.
Zurück zum Zitat Kroß, J., Brunnert, A., Krcmar, H.: Modeling big data systems by extending the palladio component model. Softwaretechnik-Trends 35(3) (2015) Kroß, J., Brunnert, A., Krcmar, H.: Modeling big data systems by extending the palladio component model. Softwaretechnik-Trends 35(3) (2015)
28.
Zurück zum Zitat Kroß, J., Krcmar, H.: Modeling and simulating Apache Spark streaming applications. Softwaretechnik-Trends 36(4), 1–3 (2016) Kroß, J., Krcmar, H.: Modeling and simulating Apache Spark streaming applications. Softwaretechnik-Trends 36(4), 1–3 (2016)
29.
Zurück zum Zitat Lagarde, F., Espinoza, H., Terrier, F., Gérard, S.: Improving UML profile design practices by leveraging conceptual domain models. In: 22nd IEEE/ACM International Conference on Automated Software Engineering (ASE 2007), Atlanta (USA), ACM, Nov 2007, pp. 445–448 Lagarde, F., Espinoza, H., Terrier, F., Gérard, S.: Improving UML profile design practices by leveraging conceptual domain models. In: 22nd IEEE/ACM International Conference on Automated Software Engineering (ASE 2007), Atlanta (USA), ACM, Nov 2007, pp. 445–448
30.
Zurück zum Zitat Langheinrich, M.: Privacy by design. In: Abowd, G.D., Brumitt, B., Shafer, A. (eds.) UBICOMP 2001, pp. 273–291. Springer, New York (2001)CrossRef Langheinrich, M.: Privacy by design. In: Abowd, G.D., Brumitt, B., Shafer, A. (eds.) UBICOMP 2001, pp. 273–291. Springer, New York (2001)CrossRef
31.
Zurück zum Zitat Lazowska, E.D., Zahorjan, J., Scott Graham, G., Sevcik, C.: Quantitative System Performance: Computer System Analysis Using Queueing Network models. Prentice-Hall, Upper Saddle River (1984) Lazowska, E.D., Zahorjan, J., Scott Graham, G., Sevcik, C.: Quantitative System Performance: Computer System Analysis Using Queueing Network models. Prentice-Hall, Upper Saddle River (1984)
32.
Zurück zum Zitat Lipton, P., Palma, D., Rutkowski, M., Tamburri, D.A.: TOSCA solves big problems in the cloud and beyond. IEEE Cloud 21(11), 31–39 (2016) Lipton, P., Palma, D., Rutkowski, M., Tamburri, D.A.: TOSCA solves big problems in the cloud and beyond. IEEE Cloud 21(11), 31–39 (2016)
33.
Zurück zum Zitat López-Grao, J.P., Merseguer, J., Campos, J.: From UML activity diagrams to stochastic petri nets: application to software performance engineering. In: Proceedings of the 4th International Workshop on Software and Performance, WOSP’04, pp. 25–36, ACM, New York, NY (2004). https://doi.org/10.1145/974044.974048 López-Grao, J.P., Merseguer, J., Campos, J.: From UML activity diagrams to stochastic petri nets: application to software performance engineering. In: Proceedings of the 4th International Workshop on Software and Performance, WOSP’04, pp. 25–36, ACM, New York, NY (2004). https://​doi.​org/​10.​1145/​974044.​974048
34.
Zurück zum Zitat Morris, K.: Infrastructure As Code: Managing Servers in the Cloud. Oreilly & Associates Incorporated, Sebastopol (2016) Morris, K.: Infrastructure As Code: Managing Servers in the Cloud. Oreilly & Associates Incorporated, Sebastopol (2016)
36.
Zurück zum Zitat Perez-Palacin, D, Ridene, Y., Merseguer, J.: Quality assessment in DevOps: automated analysis of a tax fraud detection system. In: Proceedings of the 8th ACM/SPEC on International Conference on Performance Engineering Companion, ICPE’17 Companion, pp. 133–138, ACM, New York, NY (2017) Perez-Palacin, D, Ridene, Y., Merseguer, J.: Quality assessment in DevOps: automated analysis of a tax fraud detection system. In: Proceedings of the 8th ACM/SPEC on International Conference on Performance Engineering Companion, ICPE’17 Companion, pp. 133–138, ACM, New York, NY (2017)
37.
Zurück zum Zitat Petriu, D.C., Alhaj, M., Tawhid, R.: Software Performance Modeling. Lecture Notes in Computer Science, vol. 7320. Springer, Berlin (2012) Petriu, D.C., Alhaj, M., Tawhid, R.: Software Performance Modeling. Lecture Notes in Computer Science, vol. 7320. Springer, Berlin (2012)
39.
Zurück zum Zitat Rajbhoj, A., Kulkarni, V., Bellarykar, N.: Early experience with model-driven development of MapReduce based big data application. In: 2014 21st Asia-Pacific Software Engineering Conference (APSEC), vol. 1, pp. 94–97 (Dec 2014). https://doi.org/10.1109/APSEC.2014.23 Rajbhoj, A., Kulkarni, V., Bellarykar, N.: Early experience with model-driven development of MapReduce based big data application. In: 2014 21st Asia-Pacific Software Engineering Conference (APSEC), vol. 1, pp. 94–97 (Dec 2014). https://​doi.​org/​10.​1109/​APSEC.​2014.​23
40.
Zurück zum Zitat Ranjan, R.: Modeling and simulation in performance optimization of big data processing frameworks. IEEE Cloud Comput. 1(4), 14–19 (2014)CrossRef Ranjan, R.: Modeling and simulation in performance optimization of big data processing frameworks. IEEE Cloud Comput. 1(4), 14–19 (2014)CrossRef
41.
43.
Zurück zum Zitat Sanders, W.H., Meyer, J.F.: Stochastic Activity Networks: Formal Definitions and Concepts. Lecture Notes in Computer Science, vol. 2090. Springer, Berlin (2001) Sanders, W.H., Meyer, J.F.: Stochastic Activity Networks: Formal Definitions and Concepts. Lecture Notes in Computer Science, vol. 2090. Springer, Berlin (2001)
45.
Zurück zum Zitat Santurkar, S., Arora, A., Chandrasekaran, K.: Stormgen—a domain specific language to create ad-hoc storm topologies. In: 2014 Federated Conference on Computer Science and Information Systems (FedCSIS), pp. 1621–1628 (Sept 2014). https://doi.org/10.15439/2014F278 Santurkar, S., Arora, A., Chandrasekaran, K.: Stormgen—a domain specific language to create ad-hoc storm topologies. In: 2014 Federated Conference on Computer Science and Information Systems (FedCSIS), pp. 1621–1628 (Sept 2014). https://​doi.​org/​10.​15439/​2014F278
47.
Zurück zum Zitat Selic, B.: A systematic approach to domain-specific language design using UML. In: Tenth IEEE International Symposium on Object-Oriented Real-Time Distributed Computing (ISORC 2007), 7–9 May 2007, Santorini Island, Greece, pp. 2–9 Computer Society (2007) Selic, B.: A systematic approach to domain-specific language design using UML. In: Tenth IEEE International Symposium on Object-Oriented Real-Time Distributed Computing (ISORC 2007), 7–9 May 2007, Santorini Island, Greece, pp. 2–9 Computer Society (2007)
48.
Zurück zum Zitat Selic, B., Gerard, S. (eds.): Modeling and Analysis of Real-Time and Embedded Systems with UML and MARTE. Morgan Kaufmann, Boston (2014) Selic, B., Gerard, S. (eds.): Modeling and Analysis of Real-Time and Embedded Systems with UML and MARTE. Morgan Kaufmann, Boston (2014)
49.
Zurück zum Zitat Smith, C.U., Williams, L.G.: Performance Solutions: A Practical Guide to Creating Responsive. Scalable Software. Addison Wesley Longman Publishing Co., Inc., Redwood City, CA (2002) Smith, C.U., Williams, L.G.: Performance Solutions: A Practical Guide to Creating Responsive. Scalable Software. Addison Wesley Longman Publishing Co., Inc., Redwood City, CA (2002)
64.
Zurück zum Zitat UML Profile for MARTE: Modeling and Analysis of Real-Time and Embedded Systems (June 2011). Version 1.1, OMG document: formal/2011-06-02 UML Profile for MARTE: Modeling and Analysis of Real-Time and Embedded Systems (June 2011). Version 1.1, OMG document: formal/2011-06-02
65.
Zurück zum Zitat Unified Modeling Language: Infrastructure, 2017. Version 2.5.1, OMG document: formal/2017-12-05 Unified Modeling Language: Infrastructure, 2017. Version 2.5.1, OMG document: formal/2017-12-05
66.
Zurück zum Zitat Wang, K., Khan, M.M.H.: Performance prediction for Apache Apark platform. In: 2015 IEEE 17th International Conference on High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS), and 2015 IEEE 12th International Conference on Embedded Software and Systems (ICESS), pp. 166–173 (2015) Wang, K., Khan, M.M.H.: Performance prediction for Apache Apark platform. In: 2015 IEEE 17th International Conference on High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS), and 2015 IEEE 12th International Conference on Embedded Software and Systems (ICESS), pp. 166–173 (2015)
67.
69.
Zurück zum Zitat Wille, R.: Formal concept analysis as mathematical theory of concepts and concept hierarchies. In: Formal Concept Analysis, pp. 1–33 (2005) Wille, R.: Formal concept analysis as mathematical theory of concepts and concept hierarchies. In: Formal Concept Analysis, pp. 1–33 (2005)
Metadaten
Titel
A UML Profile for the Design, Quality Assessment and Deployment of Data-intensive Applications
verfasst von
Diego Perez-Palacin
José Merseguer
José I. Requeno
M. Guerriero
Elisabetta Di Nitto
D. A. Tamburri
Publikationsdatum
15.04.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Software and Systems Modeling / Ausgabe 6/2019
Print ISSN: 1619-1366
Elektronische ISSN: 1619-1374
DOI
https://doi.org/10.1007/s10270-019-00730-3

Weitere Artikel der Ausgabe 6/2019

Software and Systems Modeling 6/2019 Zur Ausgabe