Skip to main content
Top

2018 | OriginalPaper | Chapter

Simulation and Application Performance Evaluation Using GPU Through CUDA C & Deep Learning in TensorFlow

Authors : Ajeet Kumar, Abhishek Khanna

Published in: Data Science and Analytics

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

GPUs have as of late pulled in the consideration of numerous application designers as product information parallel coprocessors. The most current eras of GPU design give less demanding programmability and expanded all-inclusive statement while keeping up the gigantic memory data transfer capacity and computational force of conventional GPUs. This open door ought to divert endeavors in GPU examination to setting up standards and systems that permit proficient mapping of calculation to design equipment. The project, shows the GeForce GTX 560 Ti processors association, highlights, and summed up improvement systems. Method to execution on the platform is by utilizing gigantic multithreading and use vast quantity of centers, cover up global storage inactivity. In order to achieve it, designers confront the test of striking the right harmony between every string’s asset utilization and the quantity of all the while dynamic strings. The assets to oversee incorporate the quantity of resistors also the degree of on-chip storage utilized per string, given strings per multiprocessor, also worldwide memory transmission capacity. The researcher likewise get expanded execution on rearranging, gets to off-chip storage and join solicitations for similar else adjoining storage areas therefore, implement established enhancements by diminishing quantity of implemented function. Such methodologies are used over an assortment of utilizations and areas and accomplish between a 10.5X to 14X application speedup. The similar result was achieved with the single core GPU using deep learning technique in TensorFlow framework.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Buck, I.: Brook Specification v0.2, October 2003 Buck, I.: Brook Specification v0.2, October 2003
4.
go back to reference Kennedy, K., Allen, R.: Automatic translation of Fortran programs to vector form. ACM Trans. Prog. Lang. Syst. 9(4), 491–542 (1987)CrossRef Kennedy, K., Allen, R.: Automatic translation of Fortran programs to vector form. ACM Trans. Prog. Lang. Syst. 9(4), 491–542 (1987)CrossRef
5.
go back to reference Atallah, M.J. (ed.): Algorithms and Theory of Computation Handbook. CRC Press LLC, Boco Raton (1998)MATH Atallah, M.J. (ed.): Algorithms and Theory of Computation Handbook. CRC Press LLC, Boco Raton (1998)MATH
6.
go back to reference Kennedy, K., Callahan, D., Carr, S.: Improving register allocation for subscripted variables. ACM SIGPLAN Not. 9(4), 328–342 (2004) Kennedy, K., Callahan, D., Carr, S.: Improving register allocation for subscripted variables. ACM SIGPLAN Not. 9(4), 328–342 (2004)
7.
go back to reference Akeley, K., Glanville, R.S., Kilgard, M.J., Mark, W.R.: Cg: a system for programming graphics hardware in a C-like language. In: ACM SIGGRAPH 2003 Papers, pp. 896–907 (2003) Akeley, K., Glanville, R.S., Kilgard, M.J., Mark, W.R.: Cg: a system for programming graphics hardware in a C-like language. In: ACM SIGGRAPH 2003 Papers, pp. 896–907 (2003)
8.
go back to reference Loveman, D.B.: High performance Fortran. IEEE Parallel Distrib. Technol.: Syst. Technol. 1(1), 25–42 (1993)CrossRef Loveman, D.B.: High performance Fortran. IEEE Parallel Distrib. Technol.: Syst. Technol. 1(1), 25–42 (1993)CrossRef
9.
go back to reference Rothberg, E.E., Lam, M.S., Wolf, M.E.: The cache performance and optimizations of blocked algorithms. In: Proceedings of 4th International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 63–74, April 1991 Rothberg, E.E., Lam, M.S., Wolf, M.E.: The cache performance and optimizations of blocked algorithms. In: Proceedings of 4th International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 63–74, April 1991
10.
go back to reference Allen, J.R., Kennedy, K.: Optimizing Compilers for Modern Architectures: A Dependence-Based Approach. Morgan Kaufmann Publishers Inc., Burlington (2002) Allen, J.R., Kennedy, K.: Optimizing Compilers for Modern Architectures: A Dependence-Based Approach. Morgan Kaufmann Publishers Inc., Burlington (2002)
11.
go back to reference Gray, J., Govindaraju, N.K., Manocha, D., Larsen, S.: A memory model for scientific algorithms on graphics processors. In: Proceedings of 2006 ACM/IEEE Conference on Supercomputing, no. 89 (2006) Gray, J., Govindaraju, N.K., Manocha, D., Larsen, S.: A memory model for scientific algorithms on graphics processors. In: Proceedings of 2006 ACM/IEEE Conference on Supercomputing, no. 89 (2006)
12.
go back to reference Sugerman, J., Fatahalian, K., Hanrahan, P.: Understanding the efficiency of GPU algorithms for matrix-matrix multiplication. In: Proceedings of ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware, pp. 133–137 (2004) Sugerman, J., Fatahalian, K., Hanrahan, P.: Understanding the efficiency of GPU algorithms for matrix-matrix multiplication. In: Proceedings of ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware, pp. 133–137 (2004)
13.
go back to reference Brainerd, W.S., Adams, J.C., Smith, B.T., Martin, J.T., Wagener, J.L.: Fortran 90 Handbook: Complete ANSI/ISO Reference. Intertext Publications Inc./McGraw-Hill Inc., New York (1992) Brainerd, W.S., Adams, J.C., Smith, B.T., Martin, J.T., Wagener, J.L.: Fortran 90 Handbook: Complete ANSI/ISO Reference. Intertext Publications Inc./McGraw-Hill Inc., New York (1992)
15.
go back to reference The PeakStream Platform: High productivity software development for multi-core processors. Technical report (2006) The PeakStream Platform: High productivity software development for multi-core processors. Technical report (2006)
Metadata
Title
Simulation and Application Performance Evaluation Using GPU Through CUDA C & Deep Learning in TensorFlow
Authors
Ajeet Kumar
Abhishek Khanna
Copyright Year
2018
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-8527-7_34

Premium Partner