Skip to main content
Top

2017 | OriginalPaper | Chapter

Optimizing CNN-Based Object Detection Algorithms on Embedded FPGA Platforms

Authors : Ruizhe Zhao, Xinyu Niu, Yajie Wu, Wayne Luk, Qiang Liu

Published in: Applied Reconfigurable Computing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Algorithms based on Convolutional Neural Network (CNN) have recently been applied to object detection applications, greatly improving their performance. However, many devices intended for these algorithms have limited computation resources and strict power consumption constraints, and are not suitable for algorithms designed for GPU workstations. This paper presents a novel method to optimise CNN-based object detection algorithms targeting embedded FPGA platforms. Given parameterised CNN hardware modules, an optimisation flow takes network architectures and resource constraints as input, and tunes hardware parameters with algorithm-specific information to explore the design space and achieve high performance. The evaluation shows that our design model accuracy is above 85% and, with optimised configuration, our design can achieve 49.6 times speed-up compared with software implementation.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bosi, B., et al.: Reconfigurable pipelined 2-D convolvers for fast digital signal processing. IEEE Trans. VLSI Syst. 7(3), 299–308 (1999)CrossRef Bosi, B., et al.: Reconfigurable pipelined 2-D convolvers for fast digital signal processing. IEEE Trans. VLSI Syst. 7(3), 299–308 (1999)CrossRef
2.
go back to reference Chakradhar, S., et al.: A dynamically configurable coprocessor for convolutional neural networks. In: ISCA (2010) Chakradhar, S., et al.: A dynamically configurable coprocessor for convolutional neural networks. In: ISCA (2010)
3.
4.
go back to reference Farabet, C., et al.: NeuFlow: a runtime-reconfigurable dataflow processor for vision. In: ECVW (2011) Farabet, C., et al.: NeuFlow: a runtime-reconfigurable dataflow processor for vision. In: ECVW (2011)
5.
7.
go back to reference Qiu, J., et al.: Going deeper with embedded FPGA platform for convolutional neural network. In: FPGA (2016) Qiu, J., et al.: Going deeper with embedded FPGA platform for convolutional neural network. In: FPGA (2016)
10.
go back to reference Ren, S., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: NIPS (2015) Ren, S., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: NIPS (2015)
11.
go back to reference Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. ImageNet Challenge (2014) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. ImageNet Challenge (2014)
12.
go back to reference Suda, N., et al.: Scalable and modularized RTL compilation of convolutional neural networks onto FPGA. In: FPL (2016) Suda, N., et al.: Scalable and modularized RTL compilation of convolutional neural networks onto FPGA. In: FPL (2016)
13.
go back to reference Suda, N., et al.: Throughput-optimized OpenCL-based FPGA accelerator for large-scale convolutional neural networks. In: FPGA (2016) Suda, N., et al.: Throughput-optimized OpenCL-based FPGA accelerator for large-scale convolutional neural networks. In: FPGA (2016)
14.
go back to reference Szegedy, C., et al.: Going deeper with convolutions. In: CVPR (2015) Szegedy, C., et al.: Going deeper with convolutions. In: CVPR (2015)
15.
go back to reference Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). doi:10.1007/978-3-319-10590-1_53 Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). doi:10.​1007/​978-3-319-10590-1_​53
16.
go back to reference Zhang, C., et al.: Optimizing FPGA-based accelerator design for deep convolutional neural networks. In: FPGA (2015) Zhang, C., et al.: Optimizing FPGA-based accelerator design for deep convolutional neural networks. In: FPGA (2015)
Metadata
Title
Optimizing CNN-Based Object Detection Algorithms on Embedded FPGA Platforms
Authors
Ruizhe Zhao
Xinyu Niu
Yajie Wu
Wayne Luk
Qiang Liu
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-56258-2_22