nach oben

Erschienen in:

2021 | OriginalPaper | Buchkapitel

An Evaluation on Effectiveness of Deep Learning in Detecting Small Object Within a Large Image

verfasst von : Nazirah Hassan, Kong Wai Ming, Choo Keng Wah

Erschienen in: 17th International Conference on Biomedical Engineering

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Multiple Deep Learning (DL) algorithms have been developed recently and are shown to be achieving very high accuracy in object detection. However, challenges have been reported in detecting small objects within a large image (e.g. > 2000 by 2000 in resolution). Various methods have been proposed using different detection algorithms in order to detect small objects. However, these approaches require high computational resources and are not suitable for edge computing devices that are used for practical applications such as pedestrian traffic light detection. We explored two different methods of detection to evaluate which method is best at detecting small objects. The first method is a two—part procedure with the first step being image processing and the second step, a R-CNN based detection using Edge Boxes algorithm for the extraction of region proposals. The second method is solely Faster R-CNN Object Detection with Instance Segmentation, termed as Mask R-CNN. A total of 4000 streets images of Singapore with pedestrian traffic lights were used as training data. The dimensions of the images range from 1200 by 900 to 4000 by 3000. The small object to be detected is the green or red man within pedestrian traffic lights. We evaluated these methods based on training time required, detection time, accuracy as well as suitability for deployment in edge computing devices. From the results, it is shown that the HSV + R-CNN approach is preferred as it achieves an accuracy of 95.5% and can be deployed in edge devices.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Skeletal Bone Age Assessment in Radiographs Based on Convolutional Neural Networks

Al-Qizwini, M., Barjasteh, I., AlQassab, H., Radha, H.: Deep learning algorithm for autonomous driving using googlenet. In: Intelligent Vehicles Symposium (IV), 2017 IEEE, pp. 89–96. IEEE (2017)

Chen, C., Seff, A., Kornhauser, A., Xiao, J.: Deepdriving: Learning affordance for direct perception in autonomous driving. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2722–2730 (2015)

Chen, G., Han, T. X., He, Z., Kays, R., Forrester, T.: Deep convolutional neural network based species recognition for wild animal monitoring. In: IEEE International Conference on Image Processing (ICIP), pp. 858–862 (2014)

Gomez Villa, A., Salazar, A., Vargas, F.: Towards automatic wild animal monitoring: Identification of animal species in camera-trap images using very deep convolutional neural networks. Ecological Informatics 41, 24–32 (2017)CrossRef

Olliverre, N., Yang, G., Slabaugh, G., Reyes-Aldasoro, C.C., Alonso, E.: International Workshop on Simulation and Synthesis in Medical Imaging. Springer; Cham, Switzerland: 2018. Generating Magnetic Resonance Spectroscopy Imaging Data of Brain Tumours from Linear, Non-linear and Deep Learning Models, pp. 130–138 (2018)

Mascetti, S., Ahmetovic, D., Gerino, A., Bernareggi, C., Busso, M., Rizzi, A.: Robust traffic lights detection on mobile devices for pedestrians with visual impairment. Computer Vision Image Underst. https://doi.org/10.1016/j.cviu.2015.11.017 (2016)

Cheng, R., Wang, K., Yang, K., Long, N., Bai, J., Liu, D.: Real-time pedestrian crossing lights detection algorithm for the visually impaired. Multimedia Tools Appl. 77(16), 20651–20671 (2018)

de Charette, R., Nashashibi, F.: Traffic light recognition using image processing compared to learning processes. In: Proceedings of the 22nd International Con- ference on Intelligent Robots and Systems, IEEE, pp. 333–338 (2009)

Lu, Y., Lu, J., Zhang, S., Hall, P.: Traffic signal detection and classification in street views using an attention model: Computational Visual Media, vol. 4, No. 3, pp. 253–266 (2018)

10.

“Analog and Digital Images,” Principles of Remote Sensing - Centre for Remote Imaging, Sensing and Processing, CRISP, 2001. [Online]. Available: https://crisp.nus.edu.sg/~research/tutorial/image.htm. Accessed 24 Sep 2019

11.

“RGB to HSV conversion | color conversion”, Rapidtables.com, 2019. [Online]. Available: https://www.rapidtables.com/convert/color/rgb-to-hsv.html. Accessed 27 Sep 2019

12.

Haralick, Robert, M., Linda, G.: Shapiro, Computer and Robot Vision, vol. I, Addison-Wesley, pp. 28–48 (1992)

13.

Krizhevsky, A., Sutskever, I, Hinton, GE.: ImageNet Classification with Deep Convolutional Neural Networks. In: Advances in neural information processing systems. Available: https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf. Accessed 27-Feb-2020

14.

Liu, W., Anguelov, D., Erhan, DE., Szegedy, C., Reed, S., Fu, C.Y., Berg, A.C.: SSD: Single Shot Multibox Detector. In: European conference on computer vision, pp. 21–37 (2016)

15.

Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: Unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 779–788 (2016)

16.

Ren, S., He, K., Girshick, R. Sun, J.: Faster R-CNN: Towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp. 91–99 (2015)

17.

He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, pp. 2980–2988, 22–29 Oct 2017

18.

He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016, pp. 770–778 (2017)

19.

Lin, T., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, pp. 936–944 (2017)

20.

Microsoft COCO Dataset.: https://cocodataset.org/#home

21.

NVIDIA TensorRT.: NVIDIA Developer, 24 Feb 2020. [Online]. Available: https://developer.nvidia.com/tensorrt. Accessed: 27-Feb-2020

Titel: An Evaluation on Effectiveness of Deep Learning in Detecting Small Object Within a Large Image
verfasst von: Nazirah Hassan
Kong Wai Ming
Choo Keng Wah
Verlag: Springer International Publishing
Buch: 17th International Conference on Biomedical Engineering
Print ISBN: 978-3-030-62044-8

Electronic ISBN: 978-3-030-62045-5

Copyright-Jahr: 2021
DOI: https://doi.org/10.1007/978-3-030-62045-5_17

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Beijing Auto Show 2024: Deutsche Hersteller wollen angreifen./© EKH-Pictures / Generated with AI / Stock.adobe.com, Buchstaben, die aus einem Megaphon kommen/© MicroStockHub/Getty Images/iStock, Digitale Lieferkette/© zapp2photo / stock.adobe.com, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Sustainibility Finance/© Robert Kneschke / stock.adobe.com / Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.