Skip to main content

1993 | OriginalPaper | Buchkapitel

Fault Tolerance in Distributed Shared Memory Multiprocessors

verfasst von : M. Dal Cin, A. Grygier, H. Hessenauer, U. Hildebrand, J. Hönig, W. Hohl, E. Michel, A. Pataricza

Erschienen in: Parallel Computer Architectures

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Massively parallel systems represent a new challenge for fault tolerance. The designers of such systems cannot expect that no parts of the system will fail. With the significant increase in the complexity and number of components the chance of a single or multiple failure is no longer negligible. It is clear that the redundancy, reconfigurability and diagnosis techniques must be incorporated at the design stage itself and not as a subsequent add-on. In this paper we discuss the fault tolerance techniques developed for MEMSY, a massively parallel architecture. These techniques can, in principle, be easily transferred to other distributed shared memory multiprocessors.

Metadaten
Titel
Fault Tolerance in Distributed Shared Memory Multiprocessors
verfasst von
M. Dal Cin
A. Grygier
H. Hessenauer
U. Hildebrand
J. Hönig
W. Hohl
E. Michel
A. Pataricza
Copyright-Jahr
1993
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-662-21577-7_3

Neuer Inhalt