Skip to main content
Erschienen in: Cluster Computing 3/2013

01.09.2013

A dynamic approach to tolerate soft errors

verfasst von: Lei Xiong, Qingping Tan

Erschienen in: Cluster Computing | Ausgabe 3/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Dynamic implementation for software-based soft error tolerance method which can protect more types of codes can cover more soft errors. This paper explores soft error tolerance with dynamic software-based method. We propose a new dynamic software-based approach to tolerate soft errors. In our approach, the objective which is protected is dynamic program. For those protected dynamic binary codes, we make sure right control flow and right data flow to significant extent in our approach. Our approach copies every data and operates every operation twice to ensure those data stored into memory are right. Additionally, we ensure every branch instruction can jump to the right address by checking condition and destination address. Our approach is implemented by the technique dynamic binary instrumentation. Specifically, our tool is implemented on the basis of valgrind framework which is a heavyweight dynamic binary instrumentation tool. Our experimental results demonstrate that our approach can get higher reliability of dynamic software than those approaches which is implemented with static program protection method. However, our approach is only suitable for the system which has a strict requirement of reliability because our approach also sacrifices more performance of software than those static program protection methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Mahmood, A., McCluskey, E.J.: Concurrent error detection using watchdog processors a survey. IEEE Trans. Comput. 37(2), 160–174 (1988) CrossRef Mahmood, A., McCluskey, E.J.: Concurrent error detection using watchdog processors a survey. IEEE Trans. Comput. 37(2), 160–174 (1988) CrossRef
2.
Zurück zum Zitat Austin, T.M.: DIVA: a reliable substrate for deep submicron microarchitecture design. In: 32nd Annual International Symposium on Microarchitecture (MICRO), pp. 196–207 (1999) Austin, T.M.: DIVA: a reliable substrate for deep submicron microarchitecture design. In: 32nd Annual International Symposium on Microarchitecture (MICRO), pp. 196–207 (1999)
3.
Zurück zum Zitat Reinhardt, S.K., et al.: Transient fault detection via simultaneous multithreading. In: Proceedings of the 27th Annual International Symposium on Computer Architecture, pp. 25–36 (2000) Reinhardt, S.K., et al.: Transient fault detection via simultaneous multithreading. In: Proceedings of the 27th Annual International Symposium on Computer Architecture, pp. 25–36 (2000)
4.
Zurück zum Zitat Reis, G.A., Chang, J., Vachharajani, N., et al.: SWIFT: software-implemented fault tolerance. In: Proceedings of the 3rd International Symposium on Code Generation and Optimization, pp. 243–254, March 2005 CrossRef Reis, G.A., Chang, J., Vachharajani, N., et al.: SWIFT: software-implemented fault tolerance. In: Proceedings of the 3rd International Symposium on Code Generation and Optimization, pp. 243–254, March 2005 CrossRef
5.
Zurück zum Zitat Oh, N., Shirvani, P.P., McCluskey, E.J.: Error detection by duplicated instructions in super-scalar processors. IEEE Trans. Reliab. 51(1), 63–75 (2002) CrossRef Oh, N., Shirvani, P.P., McCluskey, E.J.: Error detection by duplicated instructions in super-scalar processors. IEEE Trans. Reliab. 51(1), 63–75 (2002) CrossRef
6.
Zurück zum Zitat Oh, N., Shirvani, P.P., McCluskey, E.J.: ED4I: error detection by diverse data and duplicated instructions. In: IEEE Transactions on Computers, pp. 180–199 (2002) Oh, N., Shirvani, P.P., McCluskey, E.J.: ED4I: error detection by diverse data and duplicated instructions. In: IEEE Transactions on Computers, pp. 180–199 (2002)
7.
Zurück zum Zitat Reis, G.A., Chang, J., Vachharajani, N., et al.: Software-controlled fault tolerance. ACM Trans. Archit. Code Optim. V(N), 1–28 (2005) Reis, G.A., Chang, J., Vachharajani, N., et al.: Software-controlled fault tolerance. ACM Trans. Archit. Code Optim. V(N), 1–28 (2005)
8.
Zurück zum Zitat Reis, G.A.: Software modulated fault tolerance. A dissertation presented to the faculty of Princeton University (2008) Reis, G.A.: Software modulated fault tolerance. A dissertation presented to the faculty of Princeton University (2008)
9.
Zurück zum Zitat Borin, E., Wang, C., Wu, Y., Araujo, G.: Software-based transparent and comprehensive control-flow error detection. In: International Symposium on Code Generation and Optimization, pp. 333–345 (2006) CrossRef Borin, E., Wang, C., Wu, Y., Araujo, G.: Software-based transparent and comprehensive control-flow error detection. In: International Symposium on Code Generation and Optimization, pp. 333–345 (2006) CrossRef
10.
Zurück zum Zitat Reis, G.A., Chang, J., August, D.I.: Configurable transient fault detection via dynamic binary translation. In: Proceedings of the 2nd Workshop on Architectural Reliability, December 2006 Reis, G.A., Chang, J., August, D.I.: Configurable transient fault detection via dynamic binary translation. In: Proceedings of the 2nd Workshop on Architectural Reliability, December 2006
11.
Zurück zum Zitat Luk, C.-K., Cohn, R., Muth, R., Patil, H., et al.: Pin: building customized program analysis tools with dynamic instrumentation. In: Proceedings of PLDI 2005, pp. 191–200, June 2005 Luk, C.-K., Cohn, R., Muth, R., Patil, H., et al.: Pin: building customized program analysis tools with dynamic instrumentation. In: Proceedings of PLDI 2005, pp. 191–200, June 2005
12.
Zurück zum Zitat Rebaudengo, M., Reorda, M.S., Violante, M., Torchiano, M.: A source-to-source compiler for generating dependable software. In: IEEE International Workshop on Source Code Analysis and Manipulation, pp. 33–42 (2001) Rebaudengo, M., Reorda, M.S., Violante, M., Torchiano, M.: A source-to-source compiler for generating dependable software. In: IEEE International Workshop on Source Code Analysis and Manipulation, pp. 33–42 (2001)
13.
Zurück zum Zitat Nethercote, N.: Dynamic Binary Analysis and Instrumentation. University of Cambridge, Cambridge (2004) Nethercote, N.: Dynamic Binary Analysis and Instrumentation. University of Cambridge, Cambridge (2004)
14.
Zurück zum Zitat Seward, J., Nethercote, N.: Using Valgrind to detect undefined value errors with bit-precision. In: Proceedings of the USENIX 05 Annual Technical Conference, April 2005 Seward, J., Nethercote, N.: Using Valgrind to detect undefined value errors with bit-precision. In: Proceedings of the USENIX 05 Annual Technical Conference, April 2005
16.
Zurück zum Zitat Nethercote, N., Seward, J.: Valgrind: a framework for heavyweight dynamic binary instrumentation. In: Proceedings of PLDI 2007, pp. 191–200, June 2007 Nethercote, N., Seward, J.: Valgrind: a framework for heavyweight dynamic binary instrumentation. In: Proceedings of PLDI 2007, pp. 191–200, June 2007
17.
Zurück zum Zitat Li, X.: Soft error modeling and analysis for microprocessors. A dissertation presented to computer science in the graduate college of the University of Illinois (2008) Li, X.: Soft error modeling and analysis for microprocessors. A dissertation presented to computer science in the graduate college of the University of Illinois (2008)
Metadaten
Titel
A dynamic approach to tolerate soft errors
verfasst von
Lei Xiong
Qingping Tan
Publikationsdatum
01.09.2013
Verlag
Springer US
Erschienen in
Cluster Computing / Ausgabe 3/2013
Print ISSN: 1386-7857
Elektronische ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-011-0196-1

Weitere Artikel der Ausgabe 3/2013

Cluster Computing 3/2013 Zur Ausgabe