Skip to main content

2022 | OriginalPaper | Buchkapitel

Optimal Design of Checkpoint Systems with General Structures, Tasks and Schemes

verfasst von : Kenichiro Naruse, Toshio Nakagawa

Erschienen in: Reliability and Maintainability Assessment of Industrial Systems

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This chapter proposes some kinds of checkpoint systems with general structures, tasks and schemes. We have already considered redundancy techniques which are duplex and majority systems, and have applied them to two checkpoint models in which their interval times are constant and random. Giving overheads for checkpoints, we have obtained the mean execution times until the process succeeds, and have derived optimal checkpoint times to minimize them. In this chapter, we first introduce the standard checkpoint model, and propose general checkpoint models which include parallel, series and bridge systems. Furthermore, we consider tandem and bulk tasks, and apply them to two schemes and compare optimal policies theoretically and numerically. Finally, as examples of the above models, we give four models, obtain their mean execution times analytically and discuss which scheme is better numerically.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Abd-El-Barr M (2007) Reliable and fault-tolerant. Imperial Colledge Press, LondonMATH Abd-El-Barr M (2007) Reliable and fault-tolerant. Imperial Colledge Press, LondonMATH
2.
Zurück zum Zitat Ito K, Nakagawa T (2019) Reliability properties of K-out-of-N: G systems. In: Ram M, Dohi T (eds) Systems engineering. CRC Press, pp 25–40 Ito K, Nakagawa T (2019) Reliability properties of K-out-of-N: G systems. In: Ram M, Dohi T (eds) Systems engineering. CRC Press, pp 25–40
3.
4.
Zurück zum Zitat Kim H, Shin KG (1996) Design and analysis of an optimal instruction-retry policy for TMR controller computers. IEEE Trans Comput 45(11):1217–1225CrossRef Kim H, Shin KG (1996) Design and analysis of an optimal instruction-retry policy for TMR controller computers. IEEE Trans Comput 45(11):1217–1225CrossRef
5.
Zurück zum Zitat Lee PA, Anderson T (1990) Fault tolerance principles and practice. Dependable computing and fault-tolerant systems. Springer, Wien Lee PA, Anderson T (1990) Fault tolerance principles and practice. Dependable computing and fault-tolerant systems. Springer, Wien
6.
Zurück zum Zitat Nakagawa S, Fukumoto S, Ishii N (2003) Optimal checkpointing intervals of three error detection schemes by a double modular redundancy. Math Comput Model 38:1357–1363CrossRef Nakagawa S, Fukumoto S, Ishii N (2003) Optimal checkpointing intervals of three error detection schemes by a double modular redundancy. Math Comput Model 38:1357–1363CrossRef
7.
Zurück zum Zitat Nakagawa S, Okuda Y, Yamada S (2003) Optimal checkpointing interval for task duplication with spare processing. In: Ninth ISSAT international conference on reliability and quality in design, Honolulu, Hawaii, vol 2003, pp 215–219 Nakagawa S, Okuda Y, Yamada S (2003) Optimal checkpointing interval for task duplication with spare processing. In: Ninth ISSAT international conference on reliability and quality in design, Honolulu, Hawaii, vol 2003, pp 215–219
8.
Zurück zum Zitat Nakagawa T (2008) Advanced reliability models and maintenance policies. Springer, London Nakagawa T (2008) Advanced reliability models and maintenance policies. Springer, London
9.
Zurück zum Zitat Naruse K, Nakagawa T (2020) Optimal checkpoint intervals, schemes and structures for computing modules. In: Pham H (ed) Reliability and statistical computing. Springer, pp 265–287 Naruse K, Nakagawa T (2020) Optimal checkpoint intervals, schemes and structures for computing modules. In: Pham H (ed) Reliability and statistical computing. Springer, pp 265–287
10.
Zurück zum Zitat Ohara M, Suzuki R, Arai M, Fukumoto S, Iwasaki K (2006) Analytical model on hybrid state saving with a limited number of checkpoints and bound rollbacks (reliability, maintainability and safety analysis). IEICE Trans Fundam Electron Commun Comput Sci 89(9):2386–2395CrossRef Ohara M, Suzuki R, Arai M, Fukumoto S, Iwasaki K (2006) Analytical model on hybrid state saving with a limited number of checkpoints and bound rollbacks (reliability, maintainability and safety analysis). IEICE Trans Fundam Electron Commun Comput Sci 89(9):2386–2395CrossRef
11.
Zurück zum Zitat Pradhan DK, Vaidya NH (1992) Rollforward checkpointing scheme: concurrent retry with nondedicated spares. IEEE Computer Society Press, pp 166–174 Pradhan DK, Vaidya NH (1992) Rollforward checkpointing scheme: concurrent retry with nondedicated spares. IEEE Computer Society Press, pp 166–174
12.
Zurück zum Zitat Ram M, Dohi T (2019) Systems engineering: reliability analysis using k-out-of-n structures. CRC Press Ram M, Dohi T (2019) Systems engineering: reliability analysis using k-out-of-n structures. CRC Press
13.
Zurück zum Zitat Siewiorek DP, Swarz RS (eds) (1982) The theory and practice of reliable system design. Digital Press, Bedford, Massachusetts Siewiorek DP, Swarz RS (eds) (1982) The theory and practice of reliable system design. Digital Press, Bedford, Massachusetts
14.
Zurück zum Zitat Ziv A, Bruck J (1997) Performance optimization of checkpointing schemes with task duplication. IEEE Trans Comput 46:1381–1386CrossRef Ziv A, Bruck J (1997) Performance optimization of checkpointing schemes with task duplication. IEEE Trans Comput 46:1381–1386CrossRef
15.
Zurück zum Zitat Ziv A, Bruck J (1998) Analysis of checkpointing schemes with task duplication. IEEE Trans Comput 47:222–227CrossRef Ziv A, Bruck J (1998) Analysis of checkpointing schemes with task duplication. IEEE Trans Comput 47:222–227CrossRef
Metadaten
Titel
Optimal Design of Checkpoint Systems with General Structures, Tasks and Schemes
verfasst von
Kenichiro Naruse
Toshio Nakagawa
Copyright-Jahr
2022
DOI
https://doi.org/10.1007/978-3-030-93623-5_4

Premium Partner