skip to main content
article
Free Access

Reliability Issues in Computing System Design

Authors Info & Claims
Published:01 June 1978Publication History
First page image

References

  1. ANDE76 ANDERSON, T.; AND KERR, R. "Recovery blocks in actmn: a system supporting high rehability," m Proc. Int Conf. Software Engineering. 1976. Google ScholarGoogle Scholar
  2. ANDE77 ANDERSON, T.; LEE, P. A.; AND SHRI- VASTAVA, S. K. A model of recoverabd~ty m multi.level systems, Tech. Rep 115, Computing Laboratory, Umv. Newcastle upon Tyne, UK, Nov. 1977. To appear in IEE Trans. Soflw EngGoogle ScholarGoogle Scholar
  3. AVIZ71 AVIZIENIS, A. et al. "The STAR (self testing and repah~ing) computer' an investigation of the theory and practice of fault tolerant computer design," IEEE Trans. Comput. C-20, 11 (Nov. 1971), 1312-1321.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. AVIZ72 AVIZIENIS, A.; AND RENNEI~, D. A. "Fault tolerance experiments with the JPL-STAR computer," IEEE COMP- CON 72, IEEE, New York, 1972, pp. 321-324.Google ScholarGoogle Scholar
  5. AVIZ76 AVIZIENIS, A "Fault-tolerant systerns." IEEE Trans. Comput C-25, 12 (Dec. 1976), 1304-1312.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. BASK72 BASKIN, H. B.; BORGERSON, B. R.; AND ROBERTS, R. "PRIME--a modular architecture for terminal-orientated systems," in Proc. 1972 AFIPS Spring Jr. Computer Conf, Vol. 40, AFIPS Press, Montvale, N.J., pp. 431-437.Google ScholarGoogle Scholar
  7. BELL64 Bell Syst. Tech. J., (Sept. 1964).Google ScholarGoogle Scholar
  8. BELL77 Bell Syst. Tech. J., (Feb. 1977).Google ScholarGoogle Scholar
  9. BJOR72 BJORK, L. A.; AND DAVIES, C.T. The semantws of the preservat~on and recovery of integrity tna data system, Rep. TR 02.540, IBM, San Jose, Calif., Dec. 1972.Google ScholarGoogle Scholar
  10. BJOR74 BJORK, L.A. Generahsed audtt trad (ledger) concepts for data base apphcatmns, Rep. TR 02.641, IBM, San Jose, Cahf., Sept. 1974Google ScholarGoogle Scholar
  11. BORG72 BORGERSON, B. R. "A fail-softly system for timesharing use," Dtgest of papers FTC-2, 1972, pp. 89-93.Google ScholarGoogle Scholar
  12. BORG73 BORGERSON, B. R "Spontaneous reconfiguration in a fail-softly computer utility," Datafa~r (1973), 326-331Google ScholarGoogle Scholar
  13. CLEM74 CLEMENT, C. F., AND TOYER, R. D. "Recovery from faults m the No. 1A processor," FTC-4, 1974, pp 5.2-5 7Google ScholarGoogle Scholar
  14. DAVI72 DAVIES, C. T. A recovery~integrity archttecture for a data system, Rep. TR 02.528, IBM, San Jose, Calif., May 1972.Google ScholarGoogle Scholar
  15. DENN76 DENNING, P. J. "Fault-tolerant operating systems," Comput. Surv. 8, 4 (Dec. 1976), 359-389. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. DIJK68 DIJKSTRA, E. W. "The structure of the 'THE'-multiprogramming system," Commun. ACM 11, 5 (May 1968), 341-346. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. DIJK76 DIJKSTRA, E. W A dtsctphne of programmmg, Prentice-Hall, Inc., Eng|ewood Cliffs, N.J., 1976.Google ScholarGoogle Scholar
  18. EDEL74 EDELBERG, M. "'Data base contamlna- Uon and recovery," in Proc. A CM SIG- MOD Workshop Data Descr~ptmn, Access and Control, 1974, ACM, New York, pp. 419-430. Google ScholarGoogle Scholar
  19. ELSP72 ELSPAS, B.; LEVITT, K. N ; WALDINGER, R J.; AND WAKSMAN, A "An assessment of techniques for proving program correctness," Comput. Surv. 4, 2 (June 1972), 97-147 Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. ESWA76 ESWARAN, K P.; GRAY, J. N, LORIE, R. A, AND TRAIGER, I.L. "The notions of consistency and predicate locks in a database system," Commun. ACM 19, 11 (Nov. 1976), 624-633. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. FABR73 FABRY, R S "Dynamm venficatmn of operating system decisions," Commun. ACM 16, 11 (Nov 1973), 659-668. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. FOSD76 FOSDICK, L. D., AND OSTERWEIL, I#. J "Data flow analysis in software reliability," Comput. Surv. 8, 3 (Sept. 1976), 305-330. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. GOOD75 GOODENOUGH, J. B. "Exception handling" issues and a proposed notation," Commun. ACM 18, 12 (Dec. 1975), 683-696. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. GRAY75 GRAY, J. N.; LORIE, R. A.; PUTZOLU, G. R.; AND TRAINER, L.L. Granularity of locks and degrees of consistency ~n a shared database, IBM Research Rep. RJ1654, Sept. 1975Google ScholarGoogle Scholar
  25. GRAY77 GRAY, J.N. Private commumcation.Google ScholarGoogle Scholar
  26. HANT76 HANTLER, S. L., AND KING, J.C. "An introduction to proving the correctness of programs," Comput. Surv. 8, 3 (Sept. 1976), 331-353. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. HEAR73 HEART, F. E.; ORNSTEIN, S. M., CROWTHER, W. R; AND BARKER, W. B "A new minlcomputer/multiprocessor for the ARPA network," in Proc. 1973 AFIPS Natl. Computer Conf, Vol 42, AFIPS Press, Montvale, N.J., pp. 529-537Google ScholarGoogle Scholar
  28. HORN74 HORNING, J.; LAUER, H C.; MELLIAR- SMITH, P. M.; AND RANDELL, B. "A program structure for error detection and recovery," in Proc. Conf. Operating Systems; Theoretical and Practical Aspects. IRIA, 1974, pp. 177-193. (Reprinted in Lecture notes m computer science, Vol. 16, Sprmger-Verlag, New York.) Google ScholarGoogle Scholar
  29. LAMP76 LAMPSON, B.; AND STURGIS, H. Crash recovery m a dtstr~buted data storage system, Computer Science Laboratory, Xerox Palo Alto Research Center, Palo Alto, Calif., 1976.Google ScholarGoogle Scholar
  30. LEVI77 LEVIN, R. "Program structures for exception condition handling," PhD Thesis, Carnegie-Mellon Univ., Pittsburgh, Pa, June 1977. Google ScholarGoogle Scholar
  31. LOME77 LOMET, D. B. "Process structuring, synchromsatlon and recovery using atomic actions," m Proc. A CM Conf. Language Design for Reliable Software, SIGPLAN Notices 12, 3 (March 1977), 128-137. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. LOND75 LONDON, R.L. "A view of program verification," in Proc. Int. Conf. Relmble Software, 1975, ACM, New York, pp. 534-545. Google ScholarGoogle ScholarCross RefCross Ref
  33. MCPH74 MCPHEE, W S "Operating system integrity in OS/VS2," IBM Syst. J. 13, 3 (1974), 230-252.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. MELL77 MELLIAR-SMITH, P. M.; AND RANDELL, B. "Software rehability: the role of programmed exception handling," in Proc. A CM Conf. Language Design for Rehable Software, SIGPLAN Notices 12, 3 (March 1977), 95-100. Google ScholarGoogle Scholar
  35. MERL77 MERLIN, P. M.; AND RANDELL, B. Consistent state restoratmn m d~str~buted systems, Tech. Rep. 113, Computing Laboratory, Umv. Newcastle upon Tyne, UK, Oct. 1977.Google ScholarGoogle Scholar
  36. NAUR77 NAUR, P. "Software reliability," in Infotech State of the Art Conf. Rehable Software, 1977, Infotech International Ltd., Maidenhead, UK, pp. 7-13.Google ScholarGoogle Scholar
  37. NEUM73 NEUMANN, P. G.; GOLDBERG, J.; LEV- ITT, K. N.; AND WENSLEY, J. H. A study of fault-tolerant computmg, Stanford Research Inst., Menlo Park, Calif., July 1973.Google ScholarGoogle Scholar
  38. ORNS75 ORNSTEIN, S. M.; CROWTHER, W. R.; KRALEY, M. F.; BRESSLER, R D.; MI- CHAEL, A., AND HEART, F.E. "Plurlbus--a reliable multi-processor," in Proc 1975 AFIPS Natl. Computer Conf., Vol. 44, AFIPS Press, Montvale, N.J., pp. 551-559.Google ScholarGoogle Scholar
  39. PARN76 PARNAS, D. L.; AND WURGES, H. "Response to undesired events in software systems," in Proc. Int. Conf. Software Engineering, 1976, pp. 437-446. Google ScholarGoogle Scholar
  40. RAND75 RANDELL, B. "System structure for software fault tolerance," IEEE Trans. Softw Eng. SE-1, 2 (June 1975), 220-232.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. RAND78 RANDELL, B., LEE, P. A.; AND TRE- LEAVEN, P. C "Reliable computing systems," to appear in Lecture notes m computer science, Springer-Verlag, New York. Google ScholarGoogle Scholar
  42. ROHR73 ROHR, J. A. "Starex self-repatr routines software recovery m the JPL- STAR computer", D~gest of papers FTC-3, 1973, pp. 11-16.Google ScholarGoogle Scholar
  43. RUSS76 RUSSEL, D. L. State restoratmn amongst commumcatmg processes, Tech. Rep. 112, Digital Systems Laboratory, Stanford Umv., Stanford, Calif., June 1976Google ScholarGoogle Scholar
  44. SHOO68 SHOOMAN, M. L. Probabd~st~c rehabihty; an engineering approach, Mc- Graw-Hill, Inc, New York, 1968Google ScholarGoogle Scholar
  45. SHRI78 SHRIVASTAVA, S. K.; AND BANATRE, J- P. "Reliable resource allocation between unreliable processes," to appear in IEEE Trans. Soflw Eng. (Also published as Tech. Rep. 99, Computing Laboratory, Univ Newcastle upon Tyne, UK, June 1977.Google ScholarGoogle Scholar
  46. VERH77a VERHOFSTAD, j. S.M. "Recovery and crash resistance m a filing system," m Proc. SIGMOD Conf., 1977, ACM, New York, pp. 158-167. Google ScholarGoogle Scholar
  47. VERH77b VERHOFSTAD J. S.M. "The construction of recoverable multi-level systems," PhD Thesm, Univ. Newcastle upon Tyne, UK, Aug. 1977Google ScholarGoogle Scholar
  48. WASS76 WASSERMAN, A.I. Procedure-oriented exceptmn handhng, Medical Information Science, Univ. California, San Francisco, Calif., 1976.Google ScholarGoogle Scholar
  49. WENS72 WENSLEY, J.H. "SIFT--software nnplemented fault tolerance," in Proc. 1972 AFIPS Fall Jt. Computer Conf., Vol. 41, Part I, AFIPS Press, Montvale, N.J., pp. 243-253.Google ScholarGoogle Scholar
  50. WULF75 WULF, W.A. "Reliable hardware-software architecture." in Proc. Int. Conf. Rehable Software; SIGPLAN Notices 10, 6 (June 1975), 122-130. Google ScholarGoogle Scholar

Index Terms

  1. Reliability Issues in Computing System Design

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Computing Surveys
      ACM Computing Surveys  Volume 10, Issue 2
      June 1978
      128 pages
      ISSN:0360-0300
      EISSN:1557-7341
      DOI:10.1145/356725
      Issue’s Table of Contents

      Copyright © 1978 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 June 1978
      Published in csur Volume 10, Issue 2

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader