Skip to main content

2023 | OriginalPaper | Buchkapitel

Graph Mining and Machine Learning for Shader Codes Analysis to Accelerate GPU Tuning

verfasst von : Lin Zhao, Arijit Khan, Robby Luo, Chai Kiat Yeo

Erschienen in: Complex Networks and Their Applications XI

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The graphics processing unit (GPU) has become one of the most important computing technologies. Disassembly shader codes, which are machine-level codes, are important for GPU designers (e.g., AMD, Intel, NVIDIA) to tune the hardware, including customization of clock speeds and voltages. Due to many use-cases of modern GPUs, engineers generally find it difficult to manually inspect a large number of shader codes emerging from these applications. To this end, we develop a framework that converts shader codes into graphs, and employs sophisticated graph mining and machine learning techniques over a number of applications to simplify shader graphs analysis in an effective and explainable manner, aiming at accelerating the whole debugging process and improving the overall hardware performance. We study shader codes’ evolution via temporal graph analysis and structure mining with frequent subgraphs. Using them as the underlying tools, we conduct a frame’s scene detection and representative frames selection. We group the scenes (applications) to identify the representative scenes, and predict a new application’s inefficient shaders. We empirically demonstrate the effectiveness of our solution and discuss future directions.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Huang, S., Xiao, S., Feng, W.-c.: On the energy efficiency of graphics processing units for scientific computing. In: IPDPS (2009) Huang, S., Xiao, S., Feng, W.-c.: On the energy efficiency of graphics processing units for scientific computing. In: IPDPS (2009)
2.
Zurück zum Zitat Owens, J.D., Houston, M., Luebke, D., Green, S., Stone, J.E., Phillips, J.C.: GPU computing. Proc. IEEE 96(5), 879–899 (2008)CrossRef Owens, J.D., Houston, M., Luebke, D., Green, S., Stone, J.E., Phillips, J.C.: GPU computing. Proc. IEEE 96(5), 879–899 (2008)CrossRef
4.
Zurück zum Zitat Foster, I., Kesselman, C., Nick, J., Tuecke, S.: The Physiology of the Grid: an Open Grid Services Architecture for Distributed Systems Integration. Technical Report, Global Grid Forum (2002) Foster, I., Kesselman, C., Nick, J., Tuecke, S.: The Physiology of the Grid: an Open Grid Services Architecture for Distributed Systems Integration. Technical Report, Global Grid Forum (2002)
7.
Zurück zum Zitat de La Lama, C.S., Jääskeläinen, P., Kultala, H., Takala, J.: Programmable and scalable architecture for graphics processing units. Trans. High Perform. Embed. Archit. Compil. 5(2019), 21–38 (2019) de La Lama, C.S., Jääskeläinen, P., Kultala, H., Takala, J.: Programmable and scalable architecture for graphics processing units. Trans. High Perform. Embed. Archit. Compil. 5(2019), 21–38 (2019)
9.
Zurück zum Zitat Ren, X., Lis, M.: CHOPIN: scalable graphics rendering in multi-GPU systems via parallel image composition. In: IEEE International Symposium on High-Performance Computer Architecture, HPCA, pp. 709–722. IEEE (2021) Ren, X., Lis, M.: CHOPIN: scalable graphics rendering in multi-GPU systems via parallel image composition. In: IEEE International Symposium on High-Performance Computer Architecture, HPCA, pp. 709–722. IEEE (2021)
10.
Zurück zum Zitat Zaidi, S.I., Fam, S.T., Lotfi, P., Indukuru, V.R., Pan, J., Sowerby, A.M., Duprat, J.-L.: Shader Profiler (2018) Zaidi, S.I., Fam, S.T., Lotfi, P., Indukuru, V.R., Pan, J., Sowerby, A.M., Duprat, J.-L.: Shader Profiler (2018)
12.
Zurück zum Zitat Gascon, H., Yamaguchi, F., Arp, D., Rieck, K.: Structural detection of android malware using embedded call graphs. In AISec (2013) Gascon, H., Yamaguchi, F., Arp, D., Rieck, K.: Structural detection of android malware using embedded call graphs. In AISec (2013)
13.
Zurück zum Zitat Alanazi, R., Gharibi, G., Lee, Y.: Facilitating program comprehension with call graph multilevel hierarchical abstractions. J. Syst. Softw. 176(2021), 110945 (2021)CrossRef Alanazi, R., Gharibi, G., Lee, Y.: Facilitating program comprehension with call graph multilevel hierarchical abstractions. J. Syst. Softw. 176(2021), 110945 (2021)CrossRef
14.
Zurück zum Zitat Tekin, U., Buzluca, F.: A graph mining approach for detecting identical design structures in object-oriented design models. Sci. Comput. Program. 95(2014), 406–425 (2014)CrossRef Tekin, U., Buzluca, F.: A graph mining approach for detecting identical design structures in object-oriented design models. Sci. Comput. Program. 95(2014), 406–425 (2014)CrossRef
16.
Zurück zum Zitat Allamanis, M., Brockschmidt, M., Khademi, M.: Learning to represent programs with graphs. In: ICLR (2018) Allamanis, M., Brockschmidt, M., Khademi, M.: Learning to represent programs with graphs. In: ICLR (2018)
17.
Zurück zum Zitat Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. In: ICLR (2017) Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. In: ICLR (2017)
20.
Zurück zum Zitat Leskovec, J., Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets, 2nd edn. Cambridge University Press (2014) Leskovec, J., Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets, 2nd edn. Cambridge University Press (2014)
21.
Zurück zum Zitat Shervashidze, N., Schweitzer, P., Leeuwen, E.J.V., Mehlhorn, K., Borgwardt, K.M.: Weisfeiler-Lehman graph kernels. J. Mach. Learn. Res. 12(2011), 2539–2561 (2011) Shervashidze, N., Schweitzer, P., Leeuwen, E.J.V., Mehlhorn, K., Borgwardt, K.M.: Weisfeiler-Lehman graph kernels. J. Mach. Learn. Res. 12(2011), 2539–2561 (2011)
22.
Zurück zum Zitat Siglidis, G., Nikolentzos, G., Limnios, S., Giatsidis, C., Skianis, K., Vazirgiannis, M.: GraKeL: a graph kernel library in Python. J. Mach. Learn. Res. 21(54), 1–5 (2020) Siglidis, G., Nikolentzos, G., Limnios, S., Giatsidis, C., Skianis, K., Vazirgiannis, M.: GraKeL: a graph kernel library in Python. J. Mach. Learn. Res. 21(54), 1–5 (2020)
23.
Zurück zum Zitat Hamilton, W.L., Ying, Z., Leskovec, J.: Inductive representation learning on large graphs. In: NeurIPS (2017) Hamilton, W.L., Ying, Z., Leskovec, J.: Inductive representation learning on large graphs. In: NeurIPS (2017)
24.
Zurück zum Zitat Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: visual explanations from deep networks via gradient-based localization. In: ICCV (2017) Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: visual explanations from deep networks via gradient-based localization. In: ICCV (2017)
25.
Zurück zum Zitat Yan, X., Han, J.: gSpan: graph-based substructure pattern mining. In: ICDM (2002) Yan, X., Han, J.: gSpan: graph-based substructure pattern mining. In: ICDM (2002)
26.
Zurück zum Zitat Lin, X., Yang, X., Li, Y.: A deep clustering algorithm based on gaussian mixture model. J. Phys.: Conf. Ser. 1302, 032012 (August 2019) Lin, X., Yang, X., Li, Y.: A deep clustering algorithm based on gaussian mixture model. J. Phys.: Conf. Ser. 1302, 032012 (August 2019)
27.
Zurück zum Zitat Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: a new data clustering algorithm and its applications. Data Min. Knowl. Discov. 1, 141–182 (June 1997) Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: a new data clustering algorithm and its applications. Data Min. Knowl. Discov. 1, 141–182 (June 1997)
28.
Zurück zum Zitat Schaa, D., Kaeli, D.R.: Exploring the multiple-GPU design space. In: IPDPS (2009) Schaa, D., Kaeli, D.R.: Exploring the multiple-GPU design space. In: IPDPS (2009)
29.
Zurück zum Zitat Moya, V., Gonzalez, C., Roca, J., Fernandez, A., Espasa, R.: Shader performance analysis on a modern GPU architecture. In: MICRO (2005) Moya, V., Gonzalez, C., Roca, J., Fernandez, A., Espasa, R.: Shader performance analysis on a modern GPU architecture. In: MICRO (2005)
32.
Zurück zum Zitat Welton, B., Miller, B.P.: Diogenes: Looking for an Honest CPU/GPU Performance Measurement Tool (n.d.) Welton, B., Miller, B.P.: Diogenes: Looking for an Honest CPU/GPU Performance Measurement Tool (n.d.)
33.
Zurück zum Zitat Zhou, K., Adhianto, L., Anderson, J., Cherian, A., Grubisic, D., Krentel, M., Liu, Y., Meng, X., Mellor-Crummey, J.: Measurement and analysis of GPU accelerated applications with HPCToolkit. Parallel Comput. 108(2021), 102837 (2021)CrossRef Zhou, K., Adhianto, L., Anderson, J., Cherian, A., Grubisic, D., Krentel, M., Liu, Y., Meng, X., Mellor-Crummey, J.: Measurement and analysis of GPU accelerated applications with HPCToolkit. Parallel Comput. 108(2021), 102837 (2021)CrossRef
34.
Zurück zum Zitat Baddouh, C.A., Khairy, M., Green, R.N., Payer, M., Rogers, T.G.: Principal kernel analysis: a tractable methodology to simulate scaled GPU workloads. In: MICRO (2021) Baddouh, C.A., Khairy, M., Green, R.N., Payer, M., Rogers, T.G.: Principal kernel analysis: a tractable methodology to simulate scaled GPU workloads. In: MICRO (2021)
35.
Zurück zum Zitat Huang, J.-C., Nai, L., Kim, H., Lee, H.-H.S.: TBPoint: reducing simulation time for large-scale GPGPU kernels. In: IPDPS (2014) Huang, J.-C., Nai, L., Kim, H., Lee, H.-H.S.: TBPoint: reducing simulation time for large-scale GPGPU kernels. In: IPDPS (2014)
36.
Zurück zum Zitat Kambadur, M., Hong, S., Cabral, J., Patil, H., Luk, C., Sajid, S., Kim, M.A.: Fast computational GPU design with GT-pin. In: IISWC (2015) Kambadur, M., Hong, S., Cabral, J., Patil, H., Luk, C., Sajid, S., Kim, M.A.: Fast computational GPU design with GT-pin. In: IISWC (2015)
37.
Zurück zum Zitat Yu, Z., Eeckhout, L., Goswami, N., Li, T., John, L.K., Jin, H., Xu, C.-Z., Wu, J.: GPGPU-MiniBench: accelerating GPGPU micro-architecture simulation. IEEE Trans. Computers 64(11), 3153–3166 (2015) Yu, Z., Eeckhout, L., Goswami, N., Li, T., John, L.K., Jin, H., Xu, C.-Z., Wu, J.: GPGPU-MiniBench: accelerating GPGPU micro-architecture simulation. IEEE Trans. Computers 64(11), 3153–3166 (2015)
38.
Zurück zum Zitat Sadyrin, D., Dergachev, A., Loginov, I., Korenkov, I.N., Ilina, A.: Application of graph databases for static code analysis of web-applications. In: MICSECS (2019) Sadyrin, D., Dergachev, A., Loginov, I., Korenkov, I.N., Ilina, A.: Application of graph databases for static code analysis of web-applications. In: MICSECS (2019)
39.
Zurück zum Zitat Liu, J.: Enabling Static Program Analysis Using a Graph Database. Ph.D. Dissertation. Wright State University (2020) Liu, J.: Enabling Static Program Analysis Using a Graph Database. Ph.D. Dissertation. Wright State University (2020)
40.
Zurück zum Zitat Nair, A., Roy, A., Meinke, K.: FuncGNN: a graph neural network approach to program similarity. In: ESEM (2020) Nair, A., Roy, A., Meinke, K.: FuncGNN: a graph neural network approach to program similarity. In: ESEM (2020)
41.
Zurück zum Zitat Phan, A.V., Nguyen, M.L., Bui, L.T.: Convolutional neural networks over control flow graphs for software defect prediction. In: ICTAI (2017) Phan, A.V., Nguyen, M.L., Bui, L.T.: Convolutional neural networks over control flow graphs for software defect prediction. In: ICTAI (2017)
42.
Zurück zum Zitat Wang, A.Z., You, J., Leskovec, J.: Frequent subgraph mining by walking in order embedding space. In: GRL Workshop (2020) Wang, A.Z., You, J., Leskovec, J.: Frequent subgraph mining by walking in order embedding space. In: GRL Workshop (2020)
43.
Zurück zum Zitat Thorndike, R.L.: Who belongs in the family? Psychometrika 18(1953), 267–276 (1953)CrossRef Thorndike, R.L.: Who belongs in the family? Psychometrika 18(1953), 267–276 (1953)CrossRef
45.
Zurück zum Zitat Zhao, L., Khan, A., Luo, R.: ShaderNet: graph-based shader code analysis to accelerate GPU’s performance improvement (Demonstration). In: GRADES-NDA workshop (2022) Zhao, L., Khan, A., Luo, R.: ShaderNet: graph-based shader code analysis to accelerate GPU’s performance improvement (Demonstration). In: GRADES-NDA workshop (2022)
Metadaten
Titel
Graph Mining and Machine Learning for Shader Codes Analysis to Accelerate GPU Tuning
verfasst von
Lin Zhao
Arijit Khan
Robby Luo
Chai Kiat Yeo
Copyright-Jahr
2023
DOI
https://doi.org/10.1007/978-3-031-21127-0_35

Premium Partner