nach oben

The Journal of Supercomputing

Erschienen in:

21.05.2023

Real-time approximate and combined 2D convolvers for FPGA-based image processing

verfasst von: Ali Ramezanzad, Mehran Rezaei, Hooman Nikmehr, Mahdi Kalbasi

Erschienen in: The Journal of Supercomputing | Ausgabe 16/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Convolution widely has been used as the main part of the improvement in digital image processing applications. In convolutional computations, a large number of memory accesses and a huge amount of computations challenge its performance. Many of the related proposed convolvers are based on exact computations. Although exact convolvers keep the accuracy of the convolution operation at the top level, sometimes by missing a negligible amount of accuracy, the performance can be improved. Approximate computing is a new technique for solving computation overhead problems. In this paper, approximate 2D convolvers are presented which minimize the memory access rate and computations by a special factor of multiply-and-accumulate (MAC) terms. On the other hand, to preserve the flexibility for supporting different required accuracy, the proposed approximate convolvers are combined with the exact designs with real-time pre-processing stages by exploiting innovative methods which manage the hardware overhead. In comparison with conventional convolvers, the proposed designs improve the number of active resources which causes a significant reduction in power consumption. For 3 × 3 kernel size, the evaluation results on the Xilinx Virtex-7 (XC7V2000t) FPGA device show 34% and 20% power optimization of the proposed approximate and combined convolvers, respectively, in comparison with exact convolver (EC). Also, this improvement grows by increasing the kernel size. Finally, a comparison based on RMSE and PSNR for different sample images and filters reveals that the error rate and image quality reduction are acceptable for many real-time image processing applications.

Vorheriger Artikel Identify spatio-temporal properties of network traffic by model checking

Nächster Artikel CALYOLOv4: lightweight YOLOv4 target detection based on coordinated attention

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Masters BR, Gonzalez RC, Woods R (2009) Digital image processing. J Biomed Opt 14(2):029901CrossRef

Zhao Y, Wang M, Yang G, Chan JCW (2018) FOV Expansion of Bioinspired Multiband Polarimetric Imagers With Convolutional Neural Networks. IEEE Photonics J 10(1):1–14

Xu Q, Mytkowicz T, Kim NS (2016) Approximate Computing: A Survey. IEEE Design & Test 33(1):8–22CrossRef

Kalbasi M, Nikmehr H (2019) A fine-grained pipelined 2-D convolver for high-performance applications. IEEE Trans Circuits Syst II Express Briefs 66(1):146–150

Licciardo GD, Cappetta C, Benedetto LD (2016) FPGA optimization of convolution-based 2d filtering processor for image processing. In: 2016 8th Computer Science and Electronic Engineering (CEEC):180–185

Cabello F, Len J, Iano Y, Arthur R (2015) Implementation of a fixed-point 2d Gaussian filter for image processing based on FPGA. In: 2015 Signal processing: Algorithms, Architectures, Arrangements, and Applications (SPA) 28–33

Chen K, Fabrizio L, Jie H (2016) Design and analysis of an approximate 2D convolver. In: 2016 IEEE international symposium on defect and fault tolerance in VLSI and nanotechnology systems (DFT). IEEE, 2016

Chen K et al (2018) Efficient implementations of reduced precision redundancy (RPR) multiply and accumulate (MAC). IEEE Trans Comput 68(5):784–790MathSciNetCrossRefMATH

Sborz G, Felipe V, Cesar Z (2020) Architectural exploration of an FPGA-based hardware accelerator for the gaussian filter using approximate computing. In: Anais Estendidos do X Simpósio Brasileiro de Engenharia de Sistemas Computacionais, SBC

10.

Menaka R, Janarthanan S, Deeba K (2020) FPGA implementation of low power and high speed image edge detection algorithm. Microprocess Microsyst 75:103053CrossRef

11.

Sangeetha D, Deepa P (2019) Fpga implementation of cost-effective robust Canny edge detection algorithm. J Real Time Image Process 16(4):957–970CrossRef

12.

Zhang H, Xia M, Hu G (2007) A multi-window partial buffering scheme for FPGA-based 2-D convolvers. IEEE Trans Circuits Syst II Express Briefs 54(2):200–204CrossRef

13.

Bosi B, Bois G, Savaria Y (1999) Reconfigurable pipelined 2-D convolvers for fast digital signal processing. IEEE Trans Very Larg Scale Integration (VLSI) Syst 7(3):299–308CrossRef

14.

Sunwoo MH, Oh SK (2004) A multiplier-less 2-D convolver chip for real-time image processing. J VLSI Signal Process Syst Signal, Image Video Technol 38(1):63–71. https://doi.org/10.1023/B:VLSI.0000028534.35761.a8CrossRef

15.

Toledo-Moreo FJ, Martnez-Alvarez JJ, Garrigs-Guerrero J, Ferrndez-Vicente JM (2012) FPGA-based architecture for the real-time computation of 2-d convolution with large kernel size. J Syst Archit 58(8):277–285CrossRef

16.

Zhang MZ, Ngo HT, Asari VK (2007) Multiplier-less VLSI architecture for real-time computation of multi-dimensional convolution. Microprocess Microsyst 31(1):25–37CrossRef

17.

Zhang MZ, Asari VK (2007) An efficient multiplier-less architecture for 2-D convolution with quadrant symmetric kernels. Integr VLSI J 40(4): 490 – 502. System-Level Interconnect Prediction. [Online] Available: http://www.sciencedirect.com/science/article/pii/S0167926006000666

18.

Ma ZB, Yang Y, Liu YX, Bharath AA (2016) Recurrently decomposable 2-D convolvers for FPGA-based digital image processing. IEEE Trans Circuits Syst II Express Briefs 63(10):979–983

19.

Fons F, Fons M, Cant E (2011) Run-time self-reconfigurable 2D convolver for adaptive image processing. Microelectron J 42(1):204–217CrossRef

20.

Yadav DK, Gupta AK, Mishra AK (2008) A fast and area efficient 2-d convolver for real time image processing. In: TENCON 2008 – 2008 IEEE Region 10 Conference. 1–6

21.

Artix-7 FPGAs Data Sheet, Xilinx, Inc., 2017, v1.22

22.

Kalbasi M, Nikmehr H (2020) A Classified and Comparative Study of 2-D Convolvers. In: 2020 International Conference on Machine Vision and Image Processing (MVIP). 1-5.https://doi.org/10.1109/MVIP49855.2020.9116874

23.

Dehghani A, Kavari A, Kalbasi M et al (2021) A new approach for design of an efficient FPGA-based reconfigurable convolver for image processing. J Supercomput. https://doi.org/10.1007/s11227-021-03963-6CrossRef

24.

Salomon D (2004) Data compression: the complete reference. Springer, NorthridgeMATH

25.

Wang Y, Lin J, Wang Z (2017) An energy-efficient architecture for binary weight convolutional neural networks. IEEE Trans Very Large Scale Integration (VLSI) Syst 26(2):280–293CrossRef

26.

Ahmed HO, Maged G, Mohamed D (2018) Concurrent MAC unit design using VHDL for deep learning networks on FPGA. In: 2018 IEEE Symposium on Computer Applications & Industrial Electronics (ISCAIE). IEEE

27.

Solovyev RA, et al. (2018) FPGA implementation of convolutional neural networks with fixed-point calculations. arXiv preprint arXiv:1808.09945

28.

Garland J, Gregg D (2018) Low complexity multiply-accumulate units for convolutional neural networks with weight-sharing. ACM Trans Archit Code Optim (TACO) 15(3):1–24CrossRef

Titel: Real-time approximate and combined 2D convolvers for FPGA-based image processing
verfasst von: Ali Ramezanzad
Mehran Rezaei
Hooman Nikmehr
Mahdi Kalbasi
Publikationsdatum: 21.05.2023
Verlag: Springer US
Erschienen in: The Journal of Supercomputing / Ausgabe 16/2023
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI: https://doi.org/10.1007/s11227-023-05377-y

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Weitere Artikel der Ausgabe 16/2023

SiMAIM: identifying sockpuppets and puppetmasters on a single forum-oriented social media site

Survivable SFC deployment method based on federated learning in multi-domain network

Energy-aware workflow scheduling in fog computing using a hybrid chaotic algorithm

Sketching the Krylov subspace: faster computation of the entire ridge regularization path

KG-MFEND: an efficient knowledge graph-based model for multi-domain fake news detection

An empirical study of major page faults for failure diagnosis in cluster systems

Premium Partner