Skip to main content
Erschienen in: The Journal of Supercomputing 1/2018

28.08.2017

A comprehensive evaluation of availability and operational cost for a virtualized server system using stochastic reward nets

verfasst von: Tuan Anh Nguyen, Dugki Min, Eunmi Choi

Erschienen in: The Journal of Supercomputing | Ausgabe 1/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Virtualized server systems, as a major underlying element in high-performance computing systems, require further studies on many aspects of dependability. Among the significant factors, the availability measures are crucial to deliver high-quality services. Previous studies presented various modeling and analysis results on system availability of a virtualized system with two servers using a continuous-time Markov chain. In this study, we propose a cluster model of m virtualized servers using stochastic reward nets (SRNs). We focused on the overall configuration of the entire system, and in the modeling, we considered the detailed interactions between the servers. The model incorporates specific techniques for high availability of the system: standby techniques, virtual machine (VM) live migration and VM failover techniques. Simplified failures and recovery behaviors of physical servers and VMs are taken into consideration. Various SRN models are developed based on different case studies in which the techniques to improve the system’s overall availability are incorporated one after another. We conducted comprehensive analyses on the models with significant metrics of interest including: steady-state availability (SSA), sensitivity analysis of the SSA, downtime cost and operational cost analyses. We propose to use reward functions featured in SRN as a solution to help ease the computation of operational costs. The study provides an analytical basis for system adjustment and configuration of virtualized systems in data centers, cloud computing in practice.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
2.
Zurück zum Zitat Stansberry M (2013) 2013 data center industry survey. Uptime Institute, LLC, Washington Stansberry M (2013) 2013 data center industry survey. Uptime Institute, LLC, Washington
3.
Zurück zum Zitat Thein T, Chi SD, Park JS (2008) Availability modeling and analysis on virtualized clustering with rejuvenation. IJCSNS Int J Comput Sci Netw Secur 8(9):72–80 Thein T, Chi SD, Park JS (2008) Availability modeling and analysis on virtualized clustering with rejuvenation. IJCSNS Int J Comput Sci Netw Secur 8(9):72–80
9.
Zurück zum Zitat Nguyen TA, Kim DS, Park JS (2014) A comprehensive availability modeling and analysis of a virtualized servers system using stochastic reward nets. Sci World J 2014:1–18. doi:10.1155/2014/165316 Nguyen TA, Kim DS, Park JS (2014) A comprehensive availability modeling and analysis of a virtualized servers system using stochastic reward nets. Sci World J 2014:1–18. doi:10.​1155/​2014/​165316
12.
Zurück zum Zitat Sahoo J, Mohapatra S, Lath R (2010) Virtualization: a survey on concepts, taxonomy and associated security issues. In: 2010 Second International Conference on Computer and Network Technology (ICCNT). doi:10.1109/ICCNT.2010.49 Sahoo J, Mohapatra S, Lath R (2010) Virtualization: a survey on concepts, taxonomy and associated security issues. In: 2010 Second International Conference on Computer and Network Technology (ICCNT). doi:10.​1109/​ICCNT.​2010.​49
17.
Zurück zum Zitat Adeshiyan T, Attanasio CR, Farr EM, Harper RE, Pelleg D, Schulz C, Spainhower LF, Ta-Shma P, Tomek LA (2009) Using virtualization for high availability and disaster recovery. IBM J Res Dev 53(4):8:1–8:11. doi:10.1147/JRD.2009.5429062 CrossRef Adeshiyan T, Attanasio CR, Farr EM, Harper RE, Pelleg D, Schulz C, Spainhower LF, Ta-Shma P, Tomek LA (2009) Using virtualization for high availability and disaster recovery. IBM J Res Dev 53(4):8:1–8:11. doi:10.​1147/​JRD.​2009.​5429062 CrossRef
21.
Zurück zum Zitat Han L, Xu J (2013) Availability models for virtualized systems with rejuvenation. J Comput Inf Syst 20:8389–8396. doi:10.12733/jcis8586 Han L, Xu J (2013) Availability models for virtualized systems with rejuvenation. J Comput Inf Syst 20:8389–8396. doi:10.​12733/​jcis8586
22.
23.
Zurück zum Zitat Nguyen TA, Park JS (2014) Availability Modeling and Analysis in a Virtualized Servers Network, Seoul, Korea Nguyen TA, Park JS (2014) Availability Modeling and Analysis in a Virtualized Servers Network, Seoul, Korea
26.
Zurück zum Zitat Lumpp T, Schneider J, Holtz J, Mueller M, Lenz N, Biazetti a, Petersen D (2008) From high availability and disaster recovery to business continuity solutions. IBM Syst J 47(4):605–619. doi:10.1147/SJ.2008.5386516 CrossRef Lumpp T, Schneider J, Holtz J, Mueller M, Lenz N, Biazetti a, Petersen D (2008) From high availability and disaster recovery to business continuity solutions. IBM Syst J 47(4):605–619. doi:10.​1147/​SJ.​2008.​5386516 CrossRef
30.
Zurück zum Zitat Cully B, Lefebvre G, Meyer D, Feeley M, Hutchinson N, Warfield A (2008) Remus: high availability via asynchronous virtual machine replication. In: NSDI’08 Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation. USENIX Association, pp 161–174. ISBN: 111- 999-5555-22-1. http://dl.acm.org/citation.cfm?id=1387589.1387601 Cully B, Lefebvre G, Meyer D, Feeley M, Hutchinson N, Warfield A (2008) Remus: high availability via asynchronous virtual machine replication. In: NSDI’08 Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation. USENIX Association, pp 161–174. ISBN: 111- 999-5555-22-1. http://​dl.​acm.​org/​citation.​cfm?​id=​1387589.​1387601
35.
Zurück zum Zitat Bailey D, Frank-Schultz E, Lindeque P, Temple JL III (2008) Three reliability engineering techniques and their application to evaluating the availability of IT systems: an introduction. IBM Syst J 47(4):577–589. doi:10.1147/SJ.2008.5386507 CrossRef Bailey D, Frank-Schultz E, Lindeque P, Temple JL III (2008) Three reliability engineering techniques and their application to evaluating the availability of IT systems: an introduction. IBM Syst J 47(4):577–589. doi:10.​1147/​SJ.​2008.​5386507 CrossRef
39.
Zurück zum Zitat Seshadri S, Muench PH, Chiu L, Koltsidas I, Ioannou N, Haas R, Liu Y, Mei M, Blinick S (2014) Software defined just-in-time caching in an enterprise storage system. IBM J Res Dev 58(2/3):7:1–7:13. doi:10.1147/JRD.2014.2303595 CrossRef Seshadri S, Muench PH, Chiu L, Koltsidas I, Ioannou N, Haas R, Liu Y, Mei M, Blinick S (2014) Software defined just-in-time caching in an enterprise storage system. IBM J Res Dev 58(2/3):7:1–7:13. doi:10.​1147/​JRD.​2014.​2303595 CrossRef
41.
67.
Zurück zum Zitat Svärd P, Hudzia B, Tordsson J, Elmroth E, Svärd P, Hudzia B, Tordsson J, Elmroth E (2011) Evaluation of delta compression techniques for efficient live migration of large virtual machines. ACM SIGPLAN Not 46(7):111. doi:10.1145/2007477.1952698 CrossRef Svärd P, Hudzia B, Tordsson J, Elmroth E, Svärd P, Hudzia B, Tordsson J, Elmroth E (2011) Evaluation of delta compression techniques for efficient live migration of large virtual machines. ACM SIGPLAN Not 46(7):111. doi:10.​1145/​2007477.​1952698 CrossRef
72.
Zurück zum Zitat Maleszewski J, Sosnowski J (2018) Managing and enhancing performance benchmarks. In: Zamojski W, Mazurkiewicz J, Sugier J, Walkowiak T, Kacprzyk J (eds) Advances in Dependability Engineering of Complex Systems: Proceedings of the Twelfth International Conference on Dependability and Complex Systems DepCoS-RELCOMEX, July 2–6, 2017, Brunów, Poland. Springer, Cham, pp 287–297. doi:10.1007/978-3-319-59415-6_28. ISBN: 978-3-319-59415-6. http://dx.doi.org/10.1007/978-3-319-59415-6_28 Maleszewski J, Sosnowski J (2018) Managing and enhancing performance benchmarks. In: Zamojski W, Mazurkiewicz J, Sugier J, Walkowiak T, Kacprzyk J (eds) Advances in Dependability Engineering of Complex Systems: Proceedings of the Twelfth International Conference on Dependability and Complex Systems DepCoS-RELCOMEX, July 2–6, 2017, Brunów, Poland. Springer, Cham, pp 287–297. doi:10.​1007/​978-3-319-59415-6_​28. ISBN: 978-3-319-59415-6. http://​dx.​doi.​org/​10.​1007/​978-3-319-59415-6_​28
76.
Zurück zum Zitat Guida M, Longo M, Postiglione F, Trivedi KS, Yin X (2013) Semi-Markov models for performance evaluation of failure-prone IP multimedia subsystem core networks. Proc Inst Mech Eng Part O J Risk Reliab 227(3):290–301. doi:10.1177/1748006X13485191 Guida M, Longo M, Postiglione F, Trivedi KS, Yin X (2013) Semi-Markov models for performance evaluation of failure-prone IP multimedia subsystem core networks. Proc Inst Mech Eng Part O J Risk Reliab 227(3):290–301. doi:10.​1177/​1748006X13485191​
77.
Zurück zum Zitat Shojafar M, Javanmardi S, Abolfazli S, Cordeschi N (2015) FUGE: a joint meta-heuristic approach to cloud job scheduling algorithm using fuzzy theory and a genetic method. Cluster Comput 18(2):829–844. doi:10.1007/s10586-014-0420-x CrossRef Shojafar M, Javanmardi S, Abolfazli S, Cordeschi N (2015) FUGE: a joint meta-heuristic approach to cloud job scheduling algorithm using fuzzy theory and a genetic method. Cluster Comput 18(2):829–844. doi:10.​1007/​s10586-014-0420-x CrossRef
Metadaten
Titel
A comprehensive evaluation of availability and operational cost for a virtualized server system using stochastic reward nets
verfasst von
Tuan Anh Nguyen
Dugki Min
Eunmi Choi
Publikationsdatum
28.08.2017
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 1/2018
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-017-2127-2

Weitere Artikel der Ausgabe 1/2018

The Journal of Supercomputing 1/2018 Zur Ausgabe