Skip to main content
Top

2019 | OriginalPaper | Chapter

Order Analysis for Translating NESL Programs into Efficient GPU Code

Authors : Ming-Yi Yan, Ming-Hsiang Huang, Wuu Yang

Published in: New Trends in Computer Technologies and Applications

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The language NESL aims to facilitate GPU programming. In order to utilize the computation power of GPUs, NESL programs must be translated into efficient low-level code for execution. We propose a new translation technique. In NESL, apply-to-each is the main construct to extract parallel computation capability of GPUs. The result of apply-to-each is a sequence of elements. In traditional translation, the order of the elements in a sequence is always preserved. However, sometimes, the order need not be preserved and hence a faster method (which may not preserve the order of elements) for calculating the sequence may be employed. We propose the order analysis to determine if the order of elements in a sequence needs to be preserved. Order analysis is based on the taint analysis. In our experiments, we obtained 8.76x speedup on average.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Blelloch, G.E., Hardwick, J.C., Chatterjee, S., Sipelstein, J., Zagha, M.: Implementation of a portable nested data-parallel language. J. Parallel Distrib. Comput. 102–111 (1994) Blelloch, G.E., Hardwick, J.C., Chatterjee, S., Sipelstein, J., Zagha, M.: Implementation of a portable nested data-parallel language. J. Parallel Distrib. Comput. 102–111 (1994)
2.
go back to reference Bergstrom, L., Reppy, J.: Nested data-parallelism on the GPU. In: Proceedings of 17th ACM SIGPLAN International Conference on Functional Programming, pp. 247–258 (2012) Bergstrom, L., Reppy, J.: Nested data-parallelism on the GPU. In: Proceedings of 17th ACM SIGPLAN International Conference on Functional Programming, pp. 247–258 (2012)
3.
go back to reference Zhang, Y., Mueller, F.: CuNesl: compiling nested data-parallel languages for SIMT architectures. In: Proceedings of 41st International Conference on Parallel Processing (ICPP), pp. 340–349 (2012) Zhang, Y., Mueller, F.: CuNesl: compiling nested data-parallel languages for SIMT architectures. In: Proceedings of 41st International Conference on Parallel Processing (ICPP), pp. 340–349 (2012)
4.
go back to reference Huang, M.-H., Yang, W.: Partial flattening: a compilation technique for irregular nested parallelism on GPGPUs. In: Proceedings of 45th International Conference on Parallel Processing (ICPP) (2016) Huang, M.-H., Yang, W.: Partial flattening: a compilation technique for irregular nested parallelism on GPGPUs. In: Proceedings of 45th International Conference on Parallel Processing (ICPP) (2016)
5.
go back to reference Huang, M.-H.: PFACC: an OpenACC-like programming model for irregular nested parallelism. Ph.D. thesis, National Chiao Tung University (2018) Huang, M.-H.: PFACC: an OpenACC-like programming model for irregular nested parallelism. Ph.D. thesis, National Chiao Tung University (2018)
6.
go back to reference Schwartz, E.J., Avgerinos, T., Brumley, D.: All you ever wanted to know about dynamic taint analysis and forward symbolic execution (but might have been afraid to ask). In: Proceedings of IEEE Symposium on Security and Privacy, pp. 317–331 (2010) Schwartz, E.J., Avgerinos, T., Brumley, D.: All you ever wanted to know about dynamic taint analysis and forward symbolic execution (but might have been afraid to ask). In: Proceedings of IEEE Symposium on Security and Privacy, pp. 317–331 (2010)
7.
go back to reference Harris, M., Sengupta, S., Owens, J.D.: Parallel prefix sum (scan) with CUDA. GPU Gems 3(39), 851–876 (2007) Harris, M., Sengupta, S., Owens, J.D.: Parallel prefix sum (scan) with CUDA. GPU Gems 3(39), 851–876 (2007)
8.
go back to reference Adinets, A.: CUDA Pro Tip: optimized filtering with warp-aggregated atomics. CUDA Pro Tip (2014) Adinets, A.: CUDA Pro Tip: optimized filtering with warp-aggregated atomics. CUDA Pro Tip (2014)
9.
go back to reference Yan, M.-Y.: Order analysis for translating NESL programs into efficient GPU Code. Master’s thesis, National Chiao Tung University (2018) Yan, M.-Y.: Order analysis for translating NESL programs into efficient GPU Code. Master’s thesis, National Chiao Tung University (2018)
Metadata
Title
Order Analysis for Translating NESL Programs into Efficient GPU Code
Authors
Ming-Yi Yan
Ming-Hsiang Huang
Wuu Yang
Copyright Year
2019
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-9190-3_34

Premium Partner