skip to main content
article
Free Access

A bridging model for parallel computation

Published:01 August 1990Publication History
Skip Abstract Section

Abstract

The success of the von Neumann model of sequential computation is attributable to the fact that it is an efficient bridge between software and hardware: high-level languages can be efficiently compiled on to this model; yet it can be effeciently implemented in hardware. The author argues that an analogous bridge between software and hardware in required for parallel computation if that is to become as widely used. This article introduces the bulk-synchronous parallel (BSP) model as a candidate for this role, and gives results quantifying its efficiency both in implementing high-level language features and algorithms, as well as in being implemented in hardware.

References

  1. 1 Aggarwal, A., Chandra, A., and Snir, M. Communication complexity of PRAMs. Theor. Comput. $ci. To be published. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2 Aiello, B., Leighton, FT., Maggs, B., and Neumann, M. Fast algorithms for bit-serial routing on a hypercube. Manuscript, 1990.Google ScholarGoogle Scholar
  3. 3 Anderson, R.J. and Miller, G.L. Optical communication for pointer based algorithms. Tech. Pep. CRI 88-14, Computer Science Dept~, Univ. of Southern California, 1988.Google ScholarGoogle Scholar
  4. 4 Borodin, A. and Hopcroft, J.E. Routing merging and sorting on parallel models of computation. J. Comput. Syst. Sci 30 (1985) 130-145. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5 Carter, J.L. and Wegman, M.N. Universal classes of hash functions. J. Comput. Syst. Sci. 18 (1979) 143-154.Google ScholarGoogle ScholarCross RefCross Ref
  6. 6 Eppstein, D. and Galil, Z. Parallel algorithmic techniques for combinatorial computation. Annu. Rev. Comput. Sci. 3 (1988) 233-83. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7 Gibbons, P.B. A more practical PRAM model. In Proceedings of the 1989 ACM Symposium on Parallel Algorithms and Architectures. (1989) pp. 158-168. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8 Gottlieb, A. et al. The NYU ultracomputer-- Designing an MIMD shared memory parallel computer. IEEE 7~ans. Comput. 32, 2 (1983) 175-189.Google ScholarGoogle Scholar
  9. 9 Hoeffding, W. Probability inequalities for sums of bounded random variables. Am. Stat. Assoc. J. 58 (1963) 13-30.Google ScholarGoogle ScholarCross RefCross Ref
  10. 10 Karlin, A. and Upfal, E. Parallel hashing-An efficient' implementation of shared memory. J. ACM 35, 4 (1988) 876-892. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11 Karp, R.M. and Ramachandran, V. A survey of parallel algorithms for shared-memory machines. In Handbook of Theoretical Computer Science, J. van Leeuwen, Ed. North Holland, Amsterdam, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12 Kruskal, C.P., Rudolph, L., and Snir, M. A complexity theory of efficient parallel algorithms. Theor. Comput. Sci. To be published. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. 13 Ladner, R.E. and Fischer, M.J. Parallel prefix computation. J. ACM 27 (1980) 831-838. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. 14 Leighton, FT. Tight bounds on the complexity of sorting. IEEE 7?ans. Comput. C-34, 4 (1985) 344-354. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. 15 Littlestone, N. From on-line to batch learning. COLT 89, Morgan Kaufmann, San Mateo, CA., (1989) 269-284. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16 Maniloff, E.S.,Johnson, K.M., and Reif, J.H. Holographic routing network for parallel processing machines. Society of Photo Optical Instrumentation Engineers (SPIE), Paris, France 1989, V 1136, Holographic Optics II, Principles and Applications, 283-289.Google ScholarGoogle Scholar
  17. 17 Mehlhorn, K. and Vishkin, U. Randomized and deterministic simulations of PRAMs by parallel machines with restricted granularity of parallel memories. Acta Inf. 21 (1984) 339-374. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18 Papadimitriou, C.H. and Yannakakis, M. Towards an architecture-independent analysis of parallel algorithms. In Proceedings of the Twentieth ACM Symposium on Theory of Computing (1988) pp. 510-513. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19 Rajasekaran, S. and Reif, J.H. Optimal and sublogarithmic time randomized parallel sorting algorithms. SIAMJ. Comput. 18, 3 (1989) 594-607. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. 20 Ranade, A.G. How to emulate shared memory. In Proceedings of the Twenty-eighth IEEE Symposium on Foundations of Computer Science (1987) pp. 185-194.Google ScholarGoogle Scholar
  21. 21 Schwartz, J.T. Ultracomputers ACM TOPLAS 2 (1980) 484-521. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. 22 Siegel, A. On universal classes of fast high performance hash functions. In Proceedings of the Thirtieth IEEE Symposium on Foundations of Computer Science (1989).Google ScholarGoogle Scholar
  23. 23 Snyder, L. Type architectures, shared memory, and the corollary of modest potential. Annu. Rev. Comput. Sci. 1, (1986)289-317. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. 24 Turing, A.M. On computable numbers with an application to the Entscheidungs problem. In Proceedings of the London Mathematical Society 42 2 (1936) 230-265; correction, ibidem 43 (1937) 544-546.Google ScholarGoogle Scholar
  25. 25 Upfal, E. Efficient schemes for parallel cornmunication. J. ACM 31, 3 (1984) 507-517. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. 26 Valiant, L.G. A scheme for fast parallel communication. SIAM jr. Comput. 11 (1982) 350-361.Google ScholarGoogle Scholar
  27. 27 Valiant, L.G. Optimally universal parallel computers. Phil. Trans. R. Soc. Lond. A326 (1988) 373-376.Google ScholarGoogle ScholarCross RefCross Ref
  28. 28 Valiant, L.G. Bulk-synchronous parallel computers. In Parallel Processing and Artificial Intelligence, M. Reeve and S.E. Zenith, Eds., Wiley, 1989 15-22.Google ScholarGoogle Scholar
  29. 29 Valiant, L.G. General purpose parallel architectures. In Handbook of Theoretical Computer Science, J. van Leeuwen, Ed., North Holland, Amsterdam 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. 30 Walker, D.W. Portable programming within a message passing model: the FFT as an example. In Proc. 3rd Conference on Hypercube Concurrent Computers and Applications (1988), ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A bridging model for parallel computation

          Recommendations

          Reviews

          Ronald J. Watro

          Valiant observes that parallel processing has not made the expected progress in displacing sequential processing in computationally intensive domains. He attributes this failure to the lack of an appropriate bridging model for parallel computation, analogous to the von Neumann model for sequential computation. He then proposes the bulk-synchronous parallel (BSP) model as a candidate bridging model. The key claim concerning BSP is that it provides a fixed intermediate-level model through which high-level programs can be efficiently mapped to diverse machine architectures. The paper supports the claim with evidence of varying sorts: certain high-level constructs (such as memory hashing) and algorithms are efficiently implemented in BSP, and BSP is implemented in two low-level models. Most of the arguments are only sketched, with details left to the references. The BSP model consists of (1) components (each performing processing or memory functions or both), (2) a point-to-point routing mechanism for delivering messages between components, and (3) facilities for periodic synchronization of components. The efficiency of BSP as a bridging model is improved by parallel slackness in programs; that is, the number of virtual processes must be sufficiently larger than the number of processors. Another parameter important to BSP efficiency is the ratio <__?__Pub Fmt italic>g<__?__Pub Fmt /italic> of total local computation rate to total router delivery rate. Valiant states that existing machines have <__?__Pub Fmt italic>g<__?__Pub Fmt /italic>-values higher than ideal for BSP and investment in communication hardware to reduce <__?__Pub Fmt italic>g<__?__Pub Fmt /italic> (and keep it small as machine size increases) will result in more programmable machines. The paper is a brief, well-written introduction to a research topic. Whether a bridging model is needed for practical progress in parallel processing, or whether BSP is a suitable bridging model, remains unclear. The author does not discuss evidence against the existence of bridging models; for instance, parallel and distributed processing form a continuum with vastly different properties from one end to the other. Finally,<__?__Pub Caret> one might consider the analogous failure of nuclear fusion reactors to replace fission reactors—is something lacking in our understanding of fusion, or is fusion simply a more difficult engineering problem than was first expected__?__

          Access critical reviews of Computing literature here

          Become a reviewer for Computing Reviews.

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image Communications of the ACM
            Communications of the ACM  Volume 33, Issue 8
            Aug. 1990
            129 pages
            ISSN:0001-0782
            EISSN:1557-7317
            DOI:10.1145/79173
            Issue’s Table of Contents

            Copyright © 1990 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 1 August 1990

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • article

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader