research-article

Free Access

Scalable Parallel Programming with CUDA: Is CUDA the parallel programming model that application developers have been waiting for?

Authors:
John Nickolls

NVIDIA

NVIDIA
View Profile

,
Ian Buck

NVIDIA

NVIDIA
View Profile

,
Michael Garland

NVIDIA

NVIDIA
View Profile

,
Kevin Skadron

University of Virginia

University of Virginia
View Profile

Authors Info & Claims

Queue Volume 6 Issue 2March/April 2008pp 40–53https://doi.org/10.1145/1365490.1365500

Published:01 March 2008Publication History

Queue

Abstract

The advent of multicore CPUs and manycore GPUs means that mainstream processor chips are now parallel systems. Furthermore, their parallelism continues to scale with Moore’s law. The challenge is to develop mainstream application software that transparently scales its parallelism to leverage the increasing number of processor cores, much as 3D graphics applications transparently scale their parallelism to manycore GPUs with widely varying numbers of cores.

References

NVIDIA. 2007. CUDA Technology; http://www.nvidia.com/CUDA.Google Scholar
NVIDIA. 2007. CUDA Programming Guide 1.1; http://developer.download.nvidia.com/compute/cuda/1_1/NVIDIA_CUDA_Programming_Guide_1.1.pdf.Google Scholar
Stratton, J.A., Stone, S. S., Hwu, W. W. 2008. M-CUDA: An efficient implementation of CUDA kernels on multicores. IMPACT Technical Report 08-01, University of Illinois at Urbana-Champaign, (February).Google Scholar
See reference 3.Google Scholar
Buck, I., Foley, T., Horn, D., Sugerman, J., Fatahalian, K., Houston, M., Hanrahan, P. Brook for GPUs: Stream computing on graphics hardware. 2004. Proceedings of SIGGRAPH (August): 777-786; http://doi.acm.org/10.1145/1186562.1015800. Google ScholarDigital Library
Stone, S.S., Yi, H., Hwu, W.W., Haldar, J.P., Sutton, B.P., Liang, Z.-P. 2007. How GPUs can improve the quality of magnetic resonance imaging. The First Workshop on General-Purpose Processing on Graphics Processing Units (October).Google Scholar
Stone, J.E., Phillips, J.C., Freddolino, P.L., Hardy, D.J., Trabuco, L.G., Schulten, K. 2007. Accelerating molecular modeling applications with graphics processors. Journal of Computational Chemistry 28(16): 2618--2640; http://dx.doi.org/10.1002/jcc.20829.Google ScholarCross Ref
Nyland, L., Harris, M., Prins, J. 2007. Fast n-body simulation with CUDA. In GPU Gems 3. H. Nguyen, ed. Addison-Wesley.Google Scholar
Golub, G.H., and Van Loan, C.F. 1996. Matrix Computations, 3rd edition. Johns Hopkins University Press. Google ScholarDigital Library
Buatois, L., Caumon, G., Lévy, B. 2007. Concurrent number cruncher: An efficient sparse linear solver on the GPU. Proceedings of the High-Performance Computation Conference (HPCC), Springer LNCS. Google ScholarDigital Library
Sengupta, S., Harris, M., Zhang, Y., Owens, J.D. 2007. Scan primitives for GPU computing. In Proceedings of Graphics Hardware (August): 97--106. Google ScholarDigital Library
See Reference 3.Google Scholar

Index Terms

Scalable Parallel Programming with CUDA: Is CUDA the parallel programming model that application developers have been waiting for?

Recommendations

CUDA Programming: A Developer's Guide to Parallel Computing with GPUs
Read More
NVIDIA cuda software and gpu parallel computing architecture
ISMM '07: Proceedings of the 6th international symposium on Memory management

In the past, graphics processors were special purpose hardwired application accelerators, suitable only for conventional rasterization-style graphics applications. Modern GPUs are now fully programmable, massively parallel floating point processors. ...
Read More
Multi-Core Programming with CUDA and OpenCL
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

Queue Volume 6, Issue 2
GPU Computing
March/April 2008
63 pages
ISSN:1542-7730
EISSN:1542-7749
DOI:10.1145/1365490
Issue’s Table of Contents

Copyright © 2008 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 March 2008
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
- Popular
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1,409
  Total Citations
  View Citations
- 83,590
  Total Downloads
- Downloads (Last 12 months)5,719
- Downloads (Last 6 weeks)684
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Scalable Parallel Programming with CUDA: Is CUDA the parallel programming model that application developers have been waiting for?

Queue

Abstract

References

Cited By

Index Terms

Recommendations

CUDA Programming: A Developer's Guide to Parallel Computing with GPUs

NVIDIA cuda software and gpu parallel computing architecture

Multi-Core Programming with CUDA and OpenCL

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Scalable Parallel Programming with CUDA: Is CUDA the parallel programming model that application developers have been waiting for?

Queue

Abstract

References

Cited By

Index Terms

Recommendations

CUDA Programming: A Developer's Guide to Parallel Computing with GPUs

NVIDIA cuda software and gpu parallel computing architecture

Multi-Core Programming with CUDA and OpenCL

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media