Research Note
Data and Workload Distribution in a Multithreaded Architecture

https://doi.org/10.1006/jpdc.1996.1262Get rights and content

Abstract

Matching data distribution to workload distribution is important in improving the performance of distributed-memory multiprocessors. While data and workload distribution can be tailored to fit a particular problem to a particular distributed-memory architecture, it is often difficult to do so for various reasons including complexity of address computation, runtime data movement, and irregular resource usage. This report presents our study on multithreading for distributed-memory multiprocessors. Specifically, we investigate the effects of multithreading ondatadistribution andworkloaddistribution withvariable, thread granularity. Various types of workload distribution strategies are defined along with thread granularity. Several types of data distribution strategies are investigated. These include row-wise cyclic,k-way partial-row cyclic, and blocked distribution. To investigate the performance of multithreading, two problems are selected: highly sequential Gaussian elimination with partial pivoting and highly parallel matrix multiplication. Execution results on the 80-processor EM-4 distributed-memory multiprocessor indicate that multithreading can off set the loss due to the mismatch between data distribution and workload distribution even for sequential and irregular problems while giving high absolute performance.

References (20)

  • D Culler et al.

    TAM—A compiler controlled threaded abstract machine

    J. Parallel Distrib. Comput.

    (1993)
  • A. Agarwal, R. Bianchini, D. Chaiken, K. L. Johnson, D. Kranz, J. Kubiatowicz, B.-H. Lim, K. Mackenzie, D. Yeung, 1995,...
  • T Agerwala et al.

    SP2 system architecture

    IBM Systems J.

    (1995)
  • R. Alverson, D. Callahan, D. Cummings, B. Koblenz, A. Porterfield, B. Smith, June 1990, The Tera computer system,...
  • R. D. Blumofe, M. Frigo, C. F. Joerg, C. E. Leiserson, K. H. Randall, Apr. 1996, Dag-consistent distributed shared...
  • D. Chiou, B. S. Ang, R. Greiner, Arvind, J. C. Hoe, M. J. Beckerle, J. E. Hicks, A. Boughton, Aug. 29–31, 1995,...
  • G Gao et al.

    Advanced Topics in Dataflow Computing and Multithreading

    (1995)
  • K. Hayashi, T. Doi, T. Horie, Y. Koyanagi, O. Shiraki, N. Imamura, T. Shimizu, H. Ishihata, T. Shindo, Oct. 1994,...
  • W. C. Hsieh, P. Wang, W. E. Wiehl, May 1993, Computation migration: Enhancing locality for distributed-memory parallel...
There are more references available in the full text version of this article.

Cited by (0)

A preliminary version of this paper appears in theProceedings of 10th IEEE International Parallel Processing Symposium, Honolulu, Hawaii, April 1996.

2

E-mail: [email protected].

3

E-mail: [email protected].

4

E-mail: [email protected].

5

E-mail: [email protected].

View full text