Skip to main content

2019 | OriginalPaper | Buchkapitel

Image Classification Using Deep Neural Networks: Transfer Learning and the Handling of Unknown Images

verfasst von : Vedang Chauhan, Keyur D. Joshi, Brian Surgenor

Erschienen in: Engineering Applications of Neural Networks

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Deep learning is a subset of machine learning that is powerful at recognizing patterns and extensively used for image classification. However, it typically requires a large amount of data and it is computationally expensive for training an application from scratch. ImageNet database has millions of images pertaining to different categories that are acquired by years of hard work. Getting such a database for every application is tough and time consuming. Transfer learning is an alternative to conventional training. Transfer learning results in much faster and easier training of a network. This research set out to evaluate the effect of transfer learning on the performance of a Deep Neural Network (DNN). Pre-trained AlexNet was selected, modified and retrained for 3 image classification applications (gears, connectors and coins) with a modest database. This approach gave 99% classification accuracy using transfer learning. To test the robustness of the network, unknown images were added to one of the classes and the accuracy was reinforced using a probability threshold. This approach succeeded in compensating for the effect of unknowns in the accuracy.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Bremananth, R., Balaji, B., Sarkari, M., Chitra, A.: A new approach to coin recognition using neural pattern analysis. In: IEEE Indicon Conference, Chennai, India, pp. 366–370 (2005) Bremananth, R., Balaji, B., Sarkari, M., Chitra, A.: A new approach to coin recognition using neural pattern analysis. In: IEEE Indicon Conference, Chennai, India, pp. 366–370 (2005)
4.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
5.
Zurück zum Zitat LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521, 436–444 (2015)CrossRef LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521, 436–444 (2015)CrossRef
6.
7.
Zurück zum Zitat Fadaeddini, A., Eshghi, M., Majidi, B.: A deep residual neural network for low altitude remote sensing image classification. In: 6th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS), Kerman, Iran, pp. 43–46 (2018) Fadaeddini, A., Eshghi, M., Majidi, B.: A deep residual neural network for low altitude remote sensing image classification. In: 6th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS), Kerman, Iran, pp. 43–46 (2018)
8.
Zurück zum Zitat Ciresan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: Computer Vision and Pattern Recognition, Providence, USA, pp. 3642–3649 (2012) Ciresan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: Computer Vision and Pattern Recognition, Providence, USA, pp. 3642–3649 (2012)
9.
Zurück zum Zitat Nguyen, A., Yosinski, J., Clune, J.: Deep neural networks are easily fooled: high confidence predictions for unrecognizable images. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, USA, pp. 427–436 (2015) Nguyen, A., Yosinski, J., Clune, J.: Deep neural networks are easily fooled: high confidence predictions for unrecognizable images. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, USA, pp. 427–436 (2015)
11.
Zurück zum Zitat Torrey, L., Shavlik, J.: Transfer learning. In: Handbook of Research on Machine Learning Applications (2009) Torrey, L., Shavlik, J.: Transfer learning. In: Handbook of Research on Machine Learning Applications (2009)
12.
Zurück zum Zitat Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)CrossRef Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)CrossRef
13.
Zurück zum Zitat Yosinski, J., Clune, J., Bengio, Y., Lipson, H.: How transferable are features in deep neural networks? In: Advances in Neural Information Processing Systems 27, NIPS 2014, pp. 1–14 (2014) Yosinski, J., Clune, J., Bengio, Y., Lipson, H.: How transferable are features in deep neural networks? In: Advances in Neural Information Processing Systems 27, NIPS 2014, pp. 1–14 (2014)
14.
Zurück zum Zitat LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef
15.
Zurück zum Zitat Szegedy, C., et al.: Going deeper with convolutions. In: Conference on Computer Vision and Pattern Recognition (CVPR), Boston, USA, pp. 1–9 (2015) Szegedy, C., et al.: Going deeper with convolutions. In: Conference on Computer Vision and Pattern Recognition (CVPR), Boston, USA, pp. 1–9 (2015)
16.
Zurück zum Zitat Joshi, K., Chauhan, V., Surgenor, B.: Real-time recognition and counting of Indian currency coins using machine vision: a preliminary analysis. In: Proceedings of the Canadian Society for Mechanical Engineering (CSME) International Congress, Kelowna, Canada (2016) Joshi, K., Chauhan, V., Surgenor, B.: Real-time recognition and counting of Indian currency coins using machine vision: a preliminary analysis. In: Proceedings of the Canadian Society for Mechanical Engineering (CSME) International Congress, Kelowna, Canada (2016)
18.
Zurück zum Zitat Chauhan, V.: Fault detection and classification in automated assembly machines using machine vision. Doctoral thesis, Department of Mechanical and Materials Engineering, Queen’s University, Canada, (2016) Chauhan, V.: Fault detection and classification in automated assembly machines using machine vision. Doctoral thesis, Department of Mechanical and Materials Engineering, Queen’s University, Canada, (2016)
19.
Zurück zum Zitat Shah, S., Bennamoun, M., Boussaid, F.: Iterative deep learning for image set based face and object recognition. Neurocomputing 174, 866–874 (2015)CrossRef Shah, S., Bennamoun, M., Boussaid, F.: Iterative deep learning for image set based face and object recognition. Neurocomputing 174, 866–874 (2015)CrossRef
20.
Zurück zum Zitat Noda, K., Yamaguchi, Y., Nakadai, K., Okuno, H., Ogata, T.: Audio-visual speech recognition using deep learning. Appl. Intell. 42, 722–737 (2015)CrossRef Noda, K., Yamaguchi, Y., Nakadai, K., Okuno, H., Ogata, T.: Audio-visual speech recognition using deep learning. Appl. Intell. 42, 722–737 (2015)CrossRef
21.
Zurück zum Zitat Zhou, S., Chen, Q., Wang, X.: Active deep learning method for semi-supervised sentiment classification. Neurocomputing 120, 536–546 (2013)CrossRef Zhou, S., Chen, Q., Wang, X.: Active deep learning method for semi-supervised sentiment classification. Neurocomputing 120, 536–546 (2013)CrossRef
22.
Zurück zum Zitat Russakovsky, O., et al.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2015)MathSciNetCrossRef Russakovsky, O., et al.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2015)MathSciNetCrossRef
23.
Zurück zum Zitat Joshi, K.D.: A flexible machine vision system for small part inspection based on a hybrid SVM/ANN approach. Doctoral thesis, Department of Mechanical and Materials Engineering, Queen’s University, Canada (2018) Joshi, K.D.: A flexible machine vision system for small part inspection based on a hybrid SVM/ANN approach. Doctoral thesis, Department of Mechanical and Materials Engineering, Queen’s University, Canada (2018)
24.
Zurück zum Zitat Machine Learning and Deep Learning Toolbox, MATLAB R2018b (2018) Machine Learning and Deep Learning Toolbox, MATLAB R2018b (2018)
25.
Zurück zum Zitat Mittal, A., Soundararajan, R., Bovik, A.: Making a completely blind image quality analyzer. IEEE Signal Process. Lett. 20(3), 209–212 (2013)CrossRef Mittal, A., Soundararajan, R., Bovik, A.: Making a completely blind image quality analyzer. IEEE Signal Process. Lett. 20(3), 209–212 (2013)CrossRef
Metadaten
Titel
Image Classification Using Deep Neural Networks: Transfer Learning and the Handling of Unknown Images
verfasst von
Vedang Chauhan
Keyur D. Joshi
Brian Surgenor
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-20257-6_23