Skip to main content
Erschienen in: The Journal of Supercomputing 7/2021

02.01.2021

Enhancing the identification accuracy of deep learning object detection using natural language processing

verfasst von: Ming-Fong Tsai, Hung-Ju Tseng

Erschienen in: The Journal of Supercomputing | Ausgabe 7/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In recent years, object detection technology with artificial intelligence has been applied in many fields. This study uses a deep learning method to train an identification model to classify and browse pictures of the 600 different kinds of birds in Taiwan. To enhance the accuracy of identification and classification of these birds, we propose an automatic extraction system that can obtain training data by visiting public social media pages. We also develop mobile apps that allow users to take pictures of birds and upload them to an identification server to enable real-time identification and provide training data. These mobile apps are sent candidate bird pictures by the identification server to allow users to confirm and give feedback when the confidence level of identification is within a critical range. The bird pictures are then used as training data, and the identification model is periodically retrained to optimise the model. We also use natural language processing technology to enhance the level of confidence in image identification. The features of the birds’ appearance are described in words and candidate birds are obtained through image identification and used to readjust the adopted weight values. The proposed identification system gives a relatively high identification accuracy due to the use of deep learning object detection.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Xiao D, Shan F, Li Z, Le B, Liu X, Li X (2019) A target detection model based on improved tiny-Yolov3 under the environment of mining truck. IEEE Access 7:123757–123764CrossRef Xiao D, Shan F, Li Z, Le B, Liu X, Li X (2019) A target detection model based on improved tiny-Yolov3 under the environment of mining truck. IEEE Access 7:123757–123764CrossRef
2.
Zurück zum Zitat Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, Real-time object detection. In: IEEE conference on computer vision and pattern recognition, pp 779–788 Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, Real-time object detection. In: IEEE conference on computer vision and pattern recognition, pp 779–788
3.
Zurück zum Zitat Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In: IEEE conference on computer vision and pattern recognition, pp 1–10 Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In: IEEE conference on computer vision and pattern recognition, pp 1–10
5.
Zurück zum Zitat Huang S, Huang J, Huang C, Chen J, Liao I, Hsieh S, Tsai M (2019) Equipment component recognition cloud platform with information security. In: Workshop on consumer electronics, pp 1–3 Huang S, Huang J, Huang C, Chen J, Liao I, Hsieh S, Tsai M (2019) Equipment component recognition cloud platform with information security. In: Workshop on consumer electronics, pp 1–3
6.
Zurück zum Zitat Li W, Ou Y, Tseng H, Lin C, Tsai M (2019) Method for improving the quality of efficiency of bird identification. In: Conference on information technology application, pp 1–6 Li W, Ou Y, Tseng H, Lin C, Tsai M (2019) Method for improving the quality of efficiency of bird identification. In: Conference on information technology application, pp 1–6
8.
Zurück zum Zitat Lin C, Lin Y, Chang C, Chen C, Tsai M (2018) The design of automatic bird data capture systems. In: IEEE international conference on consumer electronics-Taiwan, pp 1–2 Lin C, Lin Y, Chang C, Chen C, Tsai M (2018) The design of automatic bird data capture systems. In: IEEE international conference on consumer electronics-Taiwan, pp 1–2
9.
Zurück zum Zitat Li G, Song Z, Fu Q (2018) A new method of image detection for small datasets under the framework of YOLO network. In: IEEE advanced information technology, electronic and automation control conference, pp 1031–1035 Li G, Song Z, Fu Q (2018) A new method of image detection for small datasets under the framework of YOLO network. In: IEEE advanced information technology, electronic and automation control conference, pp 1031–1035
10.
Zurück zum Zitat Lan J, Dang J, Wang Y, Wang S (2018) Pedestrian detection based on YOLO network model. In: IEEE international conference on mechatronics and automation, pp 1547–1551 Lan J, Dang J, Wang Y, Wang S (2018) Pedestrian detection based on YOLO network model. In: IEEE international conference on mechatronics and automation, pp 1547–1551
11.
Zurück zum Zitat Towhid M, Rahman M (2017) Spectrogram segmentation for bird species classification based on temporal continuity. In: IEEE international conference of computer and information technology, pp 1–4 Towhid M, Rahman M (2017) Spectrogram segmentation for bird species classification based on temporal continuity. In: IEEE international conference of computer and information technology, pp 1–4
12.
Zurück zum Zitat Roslan R, Nazery N, Jamil N, Hamzah R (2017) Color-based bird image classification using support vector machine. In: IEEE global conference on consumer electronics, pp 1–5 Roslan R, Nazery N, Jamil N, Hamzah R (2017) Color-based bird image classification using support vector machine. In: IEEE global conference on consumer electronics, pp 1–5
13.
Zurück zum Zitat Kim Y (2014) Convolutional neural networks for sentence classification. In: IEEE international conference on empirical methods in natural language processing, pp 1746–1751 Kim Y (2014) Convolutional neural networks for sentence classification. In: IEEE international conference on empirical methods in natural language processing, pp 1746–1751
14.
Zurück zum Zitat Ou Y, Lin C, Huang T, Tsai M (2020) Machine learning-based object recognition technology for bird identification system. In: IEEE international conference on consumer electronics-Taiwan, pp 1–2 Ou Y, Lin C, Huang T, Tsai M (2020) Machine learning-based object recognition technology for bird identification system. In: IEEE international conference on consumer electronics-Taiwan, pp 1–2
15.
Zurück zum Zitat Li S, Xiao T, Li H, Zhou B, Yue D, Wang X (2017) Person search with natural language description. In: IEEE conference on computer vision and pattern recognition, pp 5187–5196 Li S, Xiao T, Li H, Zhou B, Yue D, Wang X (2017) Person search with natural language description. In: IEEE conference on computer vision and pattern recognition, pp 5187–5196
16.
Zurück zum Zitat Redmon J, Farhadi A (2018) Yolov3: an incremental improvement. In: IEEE conference on computer vision and pattern recognition, pp 1–6 Redmon J, Farhadi A (2018) Yolov3: an incremental improvement. In: IEEE conference on computer vision and pattern recognition, pp 1–6
17.
Zurück zum Zitat Ren S, He K, Girshick R, Sun J (2016) Faster R-CNN: towards real-time object detection with region proposal networks, incremental improvement. In: IEEE conference on computer vision and pattern recognition, pp 1–9 Ren S, He K, Girshick R, Sun J (2016) Faster R-CNN: towards real-time object detection with region proposal networks, incremental improvement. In: IEEE conference on computer vision and pattern recognition, pp 1–9
Metadaten
Titel
Enhancing the identification accuracy of deep learning object detection using natural language processing
verfasst von
Ming-Fong Tsai
Hung-Ju Tseng
Publikationsdatum
02.01.2021
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 7/2021
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-020-03525-2

Weitere Artikel der Ausgabe 7/2021

The Journal of Supercomputing 7/2021 Zur Ausgabe