Skip to main content
Erschienen in: Cluster Computing 4/2020

17.03.2020

Computer vision algorithms acceleration using graphic processors NVIDIA CUDA

verfasst von: Mouna Afif, Yahia Said, Mohamed Atri

Erschienen in: Cluster Computing | Ausgabe 4/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Using graphic processing units (GPUs) in parallel with central processing unit in order to accelerate algorithms and applications demanding extensive computational resources has been a new trend used for the last few years. In this paper, we propose a GPU-accelerated method to parallelize different Computer vision tasks. We will report on parallelism and acceleration in computer vision applications, we provide an overview about the CUDA NVIDIA GPU programming language used. After that we will dive on GPU Architecture and acceleration used for time consuming optimization. We introduce a high-speed computer vision algorithm using graphic processing unit by using the NVIDIA’s programming framework compute unified device architecture (CUDA). We realize high and significant accelerations for our computer vision algorithms and we demonstrate that using CUDA as a GPU programming language can improve Efficiency and speedups. Especially we demonstrate the efficiency of our implementations of our computer vision algorithms by speedups obtained for all our implementations especially for some tasks and for some image sizes that come up to 8061 and 5991 and 722 acceleration times.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Jubertie, S.: NVIDIA CUDA Compute Unified Device Architecture. Laboratory of Informatique Fondamentale in Orléans, Orleans (2011) Jubertie, S.: NVIDIA CUDA Compute Unified Device Architecture. Laboratory of Informatique Fondamentale in Orléans, Orleans (2011)
2.
Zurück zum Zitat NVIDIA: NVIDIA CUDA Programming Guide v2.0. NVIDIA, Santa Clara (2008) NVIDIA: NVIDIA CUDA Programming Guide v2.0. NVIDIA, Santa Clara (2008)
3.
Zurück zum Zitat Afif, M., Said, Y., Bahri, H., Atri, M.: Efficient implementation of sobel filter based on GPUs cards. In: 2016 International Image Processing, Applications and Systems (IPAS), pp. 1–4. IEEE (2016) Afif, M., Said, Y., Bahri, H., Atri, M.: Efficient implementation of sobel filter based on GPUs cards. In: 2016 International Image Processing, Applications and Systems (IPAS), pp. 1–4. IEEE (2016)
5.
Zurück zum Zitat Vidyarthi, A., Mittal, N.: Texture based feature extraction method for classification of brain tumor MRI. J. Intell. Fuzzy Syst. 32(4), 2807–2818 (2017)CrossRef Vidyarthi, A., Mittal, N.: Texture based feature extraction method for classification of brain tumor MRI. J. Intell. Fuzzy Syst. 32(4), 2807–2818 (2017)CrossRef
6.
Zurück zum Zitat Tsai, H.-Y., Zhang, H., Hung, C.-L., et al.: GPU-accelerated features extraction from magnetic resonance images. IEEE Access 5, 22634–22646 (2017)CrossRef Tsai, H.-Y., Zhang, H., Hung, C.-L., et al.: GPU-accelerated features extraction from magnetic resonance images. IEEE Access 5, 22634–22646 (2017)CrossRef
7.
Zurück zum Zitat Sahah, K.: Performance analysis of Sobel edge detection filter on GPU using CUDA and OpenCL. Int. J. Res. Appl. Sci. Eng. Technol. (IJRASET) 1(3), 22–26 (2013) Sahah, K.: Performance analysis of Sobel edge detection filter on GPU using CUDA and OpenCL. Int. J. Res. Appl. Sci. Eng. Technol. (IJRASET) 1(3), 22–26 (2013)
9.
Zurück zum Zitat Jiang, Y., Xu, Y., Liu, Y.: Performance evaluation of feature detection and matching in stereo visual odometry. Neurocomputing 120, 380–390 (2013)CrossRef Jiang, Y., Xu, Y., Liu, Y.: Performance evaluation of feature detection and matching in stereo visual odometry. Neurocomputing 120, 380–390 (2013)CrossRef
10.
Zurück zum Zitat Cheng, L., Li, M., Liu, Y., et al.: Remote sensing image matching by integrating affine invariant feature extraction and RANSAC. Comput. Electr. Eng. 38(4), 1023–1032 (2012)CrossRef Cheng, L., Li, M., Liu, Y., et al.: Remote sensing image matching by integrating affine invariant feature extraction and RANSAC. Comput. Electr. Eng. 38(4), 1023–1032 (2012)CrossRef
11.
Zurück zum Zitat Fadaifard, H., Wolberg, G., Haralick, R.: Multiscale 3D feature extraction and matching with an application to 3D face recognition. Graph. Models 75(4), 157–176 (2013)CrossRef Fadaifard, H., Wolberg, G., Haralick, R.: Multiscale 3D feature extraction and matching with an application to 3D face recognition. Graph. Models 75(4), 157–176 (2013)CrossRef
12.
Zurück zum Zitat Alsadik, B., Remondino, F., Menna, F., et al.: Robust extraction of image correspondences exploiting the image scene geometry and approximate camera orientation. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 5, W1 (2013)CrossRef Alsadik, B., Remondino, F., Menna, F., et al.: Robust extraction of image correspondences exploiting the image scene geometry and approximate camera orientation. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 5, W1 (2013)CrossRef
13.
Zurück zum Zitat Fangi, G., Nardinocchi, C.: Photogrammetric processing of spherical panoramas. Photogramm. Rec. 28(143), 293–311 (2013)CrossRef Fangi, G., Nardinocchi, C.: Photogrammetric processing of spherical panoramas. Photogramm. Rec. 28(143), 293–311 (2013)CrossRef
14.
Zurück zum Zitat Fassold, H.: Computer vision on the GPU—tools, algorithms and frameworks. In: 2016 IEEE 20th Jubilee International Conference on Intelligent Engineering Systems (INES), pp. 245–250. IEEE (2016) Fassold, H.: Computer vision on the GPU—tools, algorithms and frameworks. In: 2016 IEEE 20th Jubilee International Conference on Intelligent Engineering Systems (INES), pp. 245–250. IEEE (2016)
15.
Zurück zum Zitat Fassold, H., Rosner, J., Schallauer, P., Bailer, W.: Realtime KLT feature point tracking for high definition video. In: Proceedings of GravisMa Workshop (2009) Fassold, H., Rosner, J., Schallauer, P., Bailer, W.: Realtime KLT feature point tracking for high definition video. In: Proceedings of GravisMa Workshop (2009)
16.
Zurück zum Zitat Fassold, H., Rosner, J.: A real-time GPU implementation of the SIFT algorithm for large-scale video analysis tasks. In: Proceedings of the Real-Time Image and Video Processing (2015) Fassold, H., Rosner, J.: A real-time GPU implementation of the SIFT algorithm for large-scale video analysis tasks. In: Proceedings of the Real-Time Image and Video Processing (2015)
17.
Zurück zum Zitat Werlberger, M., Trobin, W., Pock, T., Wedel, A., Cremers, D., Bischof, H.: Anisotropic Huber-L1 optical flow. In: Proceedings of the British Machine Vision Conference (BMVC), London, UK (2009) Werlberger, M., Trobin, W., Pock, T., Wedel, A., Cremers, D., Bischof, H.: Anisotropic Huber-L1 optical flow. In: Proceedings of the British Machine Vision Conference (BMVC), London, UK (2009)
18.
Zurück zum Zitat Harris, M., Sengupta, S., Owens, J.D., et al.: Parallel prefix sum (scan) with CUDA. GPU Gems 3. In: Laser Assisted Microtechnology. Springer (2007) Harris, M., Sengupta, S., Owens, J.D., et al.: Parallel prefix sum (scan) with CUDA. GPU Gems 3. In: Laser Assisted Microtechnology. Springer (2007)
19.
Zurück zum Zitat Afif, M., Said, Y., Atri, M.: Efficient implementation of integral image algorithm on NVIDIA CUDA. In: 2018 International Conference on Advanced Systems and Electric Technologies (IC_ASET) (pp. 1–5). IEEE (2018) Afif, M., Said, Y., Atri, M.: Efficient implementation of integral image algorithm on NVIDIA CUDA. In: 2018 International Conference on Advanced Systems and Electric Technologies (IC_ASET) (pp. 1–5). IEEE (2018)
20.
Zurück zum Zitat Chouchene, M., Sayadi, F.E., Atri, M., Tourki, R.: Integral image computation on GPU. In: 2013 10th International Multi-Conference on Systems, Signals & Devices (SSD) (pp. 1–4). IEEE (2013) Chouchene, M., Sayadi, F.E., Atri, M., Tourki, R.: Integral image computation on GPU. In: 2013 10th International Multi-Conference on Systems, Signals & Devices (SSD) (pp. 1–4). IEEE (2013)
21.
Zurück zum Zitat Ehsan, S., Clark, A., Rehman, N., et al.: Integral images: efficient algorithms for their computation and storage in resource-constrained embedded vision systems. Sensors 15(7), 16804–16830 (2015)CrossRef Ehsan, S., Clark, A., Rehman, N., et al.: Integral images: efficient algorithms for their computation and storage in resource-constrained embedded vision systems. Sensors 15(7), 16804–16830 (2015)CrossRef
22.
Zurück zum Zitat Bozkurt, F., Yaganoglu, M., Gunay, F.B.: Effective Gaussian blurring process on graphics processing unit with CUDA. Int. J. Mach. Learn. Comput. 5, 57 (2015)CrossRef Bozkurt, F., Yaganoglu, M., Gunay, F.B.: Effective Gaussian blurring process on graphics processing unit with CUDA. Int. J. Mach. Learn. Comput. 5, 57 (2015)CrossRef
23.
Zurück zum Zitat Daga, B., Bhute, A., Ghatol, A.: Implementation of parallel image processing using NVIDIA GPU framework. In: International Conference on Advances in Computing, Communication and Control, pp. 457–464. Springer, Berlin (2011) Daga, B., Bhute, A., Ghatol, A.: Implementation of parallel image processing using NVIDIA GPU framework. In: International Conference on Advances in Computing, Communication and Control, pp. 457–464. Springer, Berlin (2011)
24.
Zurück zum Zitat Dagum, L., Menon, R.: OpenMP: an industry-standard API for shared-memory programming. Computing in Science & Engineering 1, 46–55 (1998) Dagum, L., Menon, R.: OpenMP: an industry-standard API for shared-memory programming. Computing in Science & Engineering 1, 46–55 (1998)
25.
Zurück zum Zitat Chouchene, M., et al.: Image processing application on graphics processors. In: IEEE Conference on Computer Vision and Pattern Recognition International Journal of Image Processing (IJIP), vol. 8, No. 3 (2014) Chouchene, M., et al.: Image processing application on graphics processors. In: IEEE Conference on Computer Vision and Pattern Recognition International Journal of Image Processing (IJIP), vol. 8, No. 3 (2014)
26.
Zurück zum Zitat Dore, A., Lasrado, S.: Performance analysis of Sobel edge filter on heterogeneous system using OpenCL. Int. J. Res. Eng. Technol. (IJRET) 3(3), 53 (2014) Dore, A., Lasrado, S.: Performance analysis of Sobel edge filter on heterogeneous system using OpenCL. Int. J. Res. Eng. Technol. (IJRET) 3(3), 53 (2014)
27.
Zurück zum Zitat Tse, J.J.: Image processing with CUDA. University of Nevada, Theses (2012) Tse, J.J.: Image processing with CUDA. University of Nevada, Theses (2012)
Metadaten
Titel
Computer vision algorithms acceleration using graphic processors NVIDIA CUDA
verfasst von
Mouna Afif
Yahia Said
Mohamed Atri
Publikationsdatum
17.03.2020
Verlag
Springer US
Erschienen in
Cluster Computing / Ausgabe 4/2020
Print ISSN: 1386-7857
Elektronische ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-020-03090-6

Weitere Artikel der Ausgabe 4/2020

Cluster Computing 4/2020 Zur Ausgabe

Premium Partner