Skip to main content
Top

2015 | OriginalPaper | Chapter

Historic Learning Approach for Auto-tuning OpenACC Accelerated Scientific Applications

Authors : Shahzeb Siddiqui, Fatemah AlZayer, Saber Feki

Published in: High Performance Computing for Computational Science -- VECPAR 2014

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The performance optimization of scientific applications usually requires an in-depth knowledge of the hardware and software. A performance tuning mechanism is suggested to automatically tune OpenACC parameters to adapt to the execution environment on a given system. A historic learning based methodology is suggested to prune the parameter search space for a more efficient auto-tuning process. This approach is applied to tune the OpenACC gang and vector clauses for a better mapping of the compute kernels onto the underlying architecture. Our experiments show a significant performance improvement against the default compiler parameters and drastic reduction in tuning time compared to a brute force search-based approach.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
3.
go back to reference Gabriel, E., Feki, S., Benkert, K., Chaarawi, M.: The abstract data and communication library. J. Algorithms Comput. Technol. 2(4), 581600 (2008)CrossRef Gabriel, E., Feki, S., Benkert, K., Chaarawi, M.: The abstract data and communication library. J. Algorithms Comput. Technol. 2(4), 581600 (2008)CrossRef
4.
go back to reference Gabriel, E., Feki, S., Benkert, K., Resch, M.: Towards performance and portability through runtime adaption for high performance computing applications. In: International Supercomputing Conference, Dresden, Germany, June 2008 Gabriel, E., Feki, S., Benkert, K., Resch, M.: Towards performance and portability through runtime adaption for high performance computing applications. In: International Supercomputing Conference, Dresden, Germany, June 2008
5.
go back to reference Choi, J.W., Singh, A., Vuduc, R.W.: Model-driven autotuning of sparse matrix-vector multiply on GPUs. In: Proceedings of the 15th Symposium on Principles and Practice of Parallel Programming Choi, J.W., Singh, A., Vuduc, R.W.: Model-driven autotuning of sparse matrix-vector multiply on GPUs. In: Proceedings of the 15th Symposium on Principles and Practice of Parallel Programming
6.
go back to reference Dolbeau, R., Bihan, S., Bodin, F.: HMPP: a hybrid multi-core parallel programming environment. In: The 1st Workshop on General Purpose Processing on Graphics Processing Units, GPGPU (2007) Dolbeau, R., Bihan, S., Bodin, F.: HMPP: a hybrid multi-core parallel programming environment. In: The 1st Workshop on General Purpose Processing on Graphics Processing Units, GPGPU (2007)
7.
go back to reference Siddiqui, S., Feki, S.: Predictive performance tuning of OpenACC accelerated applications, 29th International Conference, 22–26 June 2014, Leipzig, Germany. LNCS, vol. 8488, pp. 511–512 (2014) Siddiqui, S., Feki, S.: Predictive performance tuning of OpenACC accelerated applications, 29th International Conference, 22–26 June 2014, Leipzig, Germany. LNCS, vol. 8488, pp. 511–512 (2014)
8.
go back to reference Feki, S., Gabriel, E.: A historic knowledge based approach for dynamic optimization. In: Proceedings of the International Conference on Parallel Computing, pp. 389–396 (2009) Feki, S., Gabriel, E.: A historic knowledge based approach for dynamic optimization. In: Proceedings of the International Conference on Parallel Computing, pp. 389–396 (2009)
9.
go back to reference Feki, S., Gabriel, E.: Incorporating historic knowledge into a communication library for self-optimizing high performance computing applications. In: Second IEEE International Conference on Self-Adaptive and Self-Organizing Systems, Venice, Italy (2008) Feki, S., Gabriel, E.: Incorporating historic knowledge into a communication library for self-optimizing high performance computing applications. In: Second IEEE International Conference on Self-Adaptive and Self-Organizing Systems, Venice, Italy (2008)
10.
go back to reference Frigo, M., Johnson, S.: The design and implementation of FFTW3. Proceedings of IEEE 93(2), 216–231 (2005)CrossRef Frigo, M., Johnson, S.: The design and implementation of FFTW3. Proceedings of IEEE 93(2), 216–231 (2005)CrossRef
11.
go back to reference Mametjanov, A., Lowell, M.C., Norris, B.: Autotuning stencil-based computations on GPUs, In: Cluster Conference, Beijing, China (2012) Mametjanov, A., Lowell, M.C., Norris, B.: Autotuning stencil-based computations on GPUs, In: Cluster Conference, Beijing, China (2012)
12.
go back to reference Vuduc, R., Demmel, J.W., Bilmes, J.A.: Statistical models for empirical search-based performance tuning. Int. J. High Perform. Comput. Appl. 18(1), 6594 (2004)CrossRef Vuduc, R., Demmel, J.W., Bilmes, J.A.: Statistical models for empirical search-based performance tuning. Int. J. High Perform. Comput. Appl. 18(1), 6594 (2004)CrossRef
13.
go back to reference Tillmann, M., Karcher, T., Dachsbacher, C., Tichy, W.F.: Application-independent autotuning for GPUs. In: International Conference on Parallel Computing, Munich, Germany (2013) Tillmann, M., Karcher, T., Dachsbacher, C., Tichy, W.F.: Application-independent autotuning for GPUs. In: International Conference on Parallel Computing, Munich, Germany (2013)
14.
go back to reference Feki, S., Al-Jarro, A., Bagci, H.: Multi-GPU-based acceleration of the explicit time domain volume integral equation solver using MPI-OpenACC. In: IEEE International Symposium on Antennas and Propagation and USNC/URSI National Radio Science, Lake Buena Vista, Florida, USA (2013) Feki, S., Al-Jarro, A., Bagci, H.: Multi-GPU-based acceleration of the explicit time domain volume integral equation solver using MPI-OpenACC. In: IEEE International Symposium on Antennas and Propagation and USNC/URSI National Radio Science, Lake Buena Vista, Florida, USA (2013)
15.
go back to reference Bodin, F.: Using CAPS compiler on NVIDIA kepler and CARMA systems. In: Supercomputing, Salt Lake City, Utah, USA (2012) Bodin, F.: Using CAPS compiler on NVIDIA kepler and CARMA systems. In: Supercomputing, Salt Lake City, Utah, USA (2012)
Metadata
Title
Historic Learning Approach for Auto-tuning OpenACC Accelerated Scientific Applications
Authors
Shahzeb Siddiqui
Fatemah AlZayer
Saber Feki
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-17353-5_19

Premium Partner