Skip to main content
Top
Published in: Neural Computing and Applications 12/2023

23-07-2022 | S.I.: AI based Techniques and Applications for Intelligent IoT Systems

A maximum-entropy-attention-based convolutional neural network for image perception

Authors: Qili Chen, Ancai Zhang, Guangyuan Pan

Published in: Neural Computing and Applications | Issue 12/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In recent years, image perception such as enhancement, classification and object detection with deep learning has achieved significant successes. However, in real world under extreme conditions, the training of a deep learning model often yields low accuracy, low efficiency in feature extraction and generalizability, due to the inner uncourteous and uninterpretable characteristics. In this paper, a maximal-entropy-attention-based convolutional neural network (MEA-CNN) framework is proposed. A maximum entropy algorithm is first used for image feature pre-extraction. An attention mechanism is then proposed by combining the extracted features on original images. By applying the mechanism, the key areas of an image are enhanced, and noised area can be ignored. Afterward, the processed images are transferred into region convolutional neural network, which is a well-known pre-trained CNN model, for further feature learning and extraction. Finally, two real-world experiments on traffic sign recognition and road surface condition monitoring are designed. The results show that the proposed framework has high testing accuracy, with improvements of 17% and 2.9%, compared with some other existing methods. In addition, the features extracted by the model are more easily interpretable.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Gu K, Zhang Y, Qiao J (2021) Ensemble meta-learning for few-shot soot density recognition. IEEE Trans Industr Inf 17(3):2261–2270CrossRef Gu K, Zhang Y, Qiao J (2021) Ensemble meta-learning for few-shot soot density recognition. IEEE Trans Industr Inf 17(3):2261–2270CrossRef
2.
go back to reference Zhu M, Ge D (2020) Image quality assessment based on deep learning with FPGA implementation. Signal Process: Image Commun 1(83):115780 Zhu M, Ge D (2020) Image quality assessment based on deep learning with FPGA implementation. Signal Process: Image Commun 1(83):115780
3.
go back to reference Han G, Cheng Q, Sun X, Li L, Di W (2019) A biological mechanism based structure self-adaptive algorithm for feedforward neural network and its engineering applications. IEEE Access 7:25111–25122CrossRef Han G, Cheng Q, Sun X, Li L, Di W (2019) A biological mechanism based structure self-adaptive algorithm for feedforward neural network and its engineering applications. IEEE Access 7:25111–25122CrossRef
4.
go back to reference Han H, Liu H, Li J, Qiao J (2021) Cooperative fuzzy-neural control for wastewater treatment process. IEEE Trans Industr Inf 17(9):5971–5981CrossRef Han H, Liu H, Li J, Qiao J (2021) Cooperative fuzzy-neural control for wastewater treatment process. IEEE Trans Industr Inf 17(9):5971–5981CrossRef
5.
go back to reference Han G, Li L, Di W, Sun X, Bu T, Lin T (2020) Multiscale convolutional generative adversarial network for anchorage grout defect detection. IEEE Trans Instrum Meas 70:1–10 Han G, Li L, Di W, Sun X, Bu T, Lin T (2020) Multiscale convolutional generative adversarial network for anchorage grout defect detection. IEEE Trans Instrum Meas 70:1–10
6.
go back to reference Yang L, Wang L, Su Y, Gao Y (2021) Bag of shape descriptor using unsupervised deep learning for non-rigid shape recognition. Signal Process: Image Commun 1(96):116297 Yang L, Wang L, Su Y, Gao Y (2021) Bag of shape descriptor using unsupervised deep learning for non-rigid shape recognition. Signal Process: Image Commun 1(96):116297
7.
go back to reference Yin P, Yuan R, Cheng Y, Wu Q (2020) Deep guidance network for biomedical image segmentation. IEEE Access 8:116106–116116CrossRef Yin P, Yuan R, Cheng Y, Wu Q (2020) Deep guidance network for biomedical image segmentation. IEEE Access 8:116106–116116CrossRef
8.
go back to reference Wong A, Famuori M, Shafiee MJ, Li F, Chwyl B, Chung J (2019) YOLO Nano: a highly compact you only look once convolutional neural network for object detection. Fifth Workshop on Energy Efficient Machine Learning and Cognitive Computing - NeurIPS Edition, 22–25. Wong A, Famuori M, Shafiee MJ, Li F, Chwyl B, Chung J (2019) YOLO Nano: a highly compact you only look once convolutional neural network for object detection. Fifth Workshop on Energy Efficient Machine Learning and Cognitive Computing - NeurIPS Edition, 22–25.
9.
go back to reference Oktay O, Ferrante E, Kamnitsas K (2018) Anatomically constrained neural networks (ACNNs): application to cardiac image enhancement and segmentation. IEEE Trans Med Imaging 37(2):384–395CrossRef Oktay O, Ferrante E, Kamnitsas K (2018) Anatomically constrained neural networks (ACNNs): application to cardiac image enhancement and segmentation. IEEE Trans Med Imaging 37(2):384–395CrossRef
10.
go back to reference Pan G, Fu L, Thakali L (2017) Development of a global road safety performance function using deep neural networks. Int J Transp Sci Technol 6(3):159–173CrossRef Pan G, Fu L, Thakali L (2017) Development of a global road safety performance function using deep neural networks. Int J Transp Sci Technol 6(3):159–173CrossRef
11.
go back to reference Samek W, Binder A, Montavon G, Lapuschkin S, Müller KR (2017) Evaluating the visualization of what a deep neural network has learned. IEEE Trans Neural Netw Learn Syst 28(11):2660–2673MathSciNetCrossRef Samek W, Binder A, Montavon G, Lapuschkin S, Müller KR (2017) Evaluating the visualization of what a deep neural network has learned. IEEE Trans Neural Netw Learn Syst 28(11):2660–2673MathSciNetCrossRef
12.
go back to reference Pan G, Fu L, Chen Q, Yu M, Muresan M (2020) Road safety performance function analysis with visual feature importance of deep neural nets. IEEE/CAA J Automatica Sinica 7(3):735–744CrossRef Pan G, Fu L, Chen Q, Yu M, Muresan M (2020) Road safety performance function analysis with visual feature importance of deep neural nets. IEEE/CAA J Automatica Sinica 7(3):735–744CrossRef
13.
go back to reference Chen Q, Pan G, Chen W, Wu P (2021) A novel explainable deep belief network framework and its application for feature importance analysis. IEEE Sens J 21:25001–25009CrossRef Chen Q, Pan G, Chen W, Wu P (2021) A novel explainable deep belief network framework and its application for feature importance analysis. IEEE Sens J 21:25001–25009CrossRef
14.
go back to reference Gu K, Tao D, Qiao J, Lin W (2018) Learning a no-reference quality assessment model of enhanced images with big data. IEEE Trans Neural Netw Learn Syst 29(4):1301–1313CrossRef Gu K, Tao D, Qiao J, Lin W (2018) Learning a no-reference quality assessment model of enhanced images with big data. IEEE Trans Neural Netw Learn Syst 29(4):1301–1313CrossRef
15.
go back to reference Gu K, Zhang Y, Qiao J (2020) Vision-based monitoring of flare soot. IEEE Trans Instrum Meas 69(9):7136–7145CrossRef Gu K, Zhang Y, Qiao J (2020) Vision-based monitoring of flare soot. IEEE Trans Instrum Meas 69(9):7136–7145CrossRef
16.
go back to reference Liu H, Chu W, Wang H (2020) Automatic segmentation algorithm of ultrasound heart image based on convolutional neural network and image saliency. IEEE Access 8:104445–104457CrossRef Liu H, Chu W, Wang H (2020) Automatic segmentation algorithm of ultrasound heart image based on convolutional neural network and image saliency. IEEE Access 8:104445–104457CrossRef
17.
go back to reference Chen W, Gu K, Zhao T, Jiang G, Callet PL (2021) Semi-reference sonar image quality assessment based on task and visual perception. IEEE Trans Multimedia 23:1008–1020CrossRef Chen W, Gu K, Zhao T, Jiang G, Callet PL (2021) Semi-reference sonar image quality assessment based on task and visual perception. IEEE Trans Multimedia 23:1008–1020CrossRef
18.
go back to reference Zhu X, Zhang X, Zhang T, Zhu P, Tang X, Li C (2020) Discriminative feature pyramid network for object detection in remote sensing images. International Joint Conference on Neural Networks (IJCNN), 1–7. Zhu X, Zhang X, Zhang T, Zhu P, Tang X, Li C (2020) Discriminative feature pyramid network for object detection in remote sensing images. International Joint Conference on Neural Networks (IJCNN), 1–7.
19.
go back to reference Shi X, Qiu G, Yin C, Huang X, Chen K, Cheng Y, Zhong S (2021) An improved bearing fault diagnosis scheme based on hierarchical fuzzy entropy and Alexnet network. IEEE Access 9:61710–61720CrossRef Shi X, Qiu G, Yin C, Huang X, Chen K, Cheng Y, Zhong S (2021) An improved bearing fault diagnosis scheme based on hierarchical fuzzy entropy and Alexnet network. IEEE Access 9:61710–61720CrossRef
20.
go back to reference Avula SB, Badri SJ, Reddy G (2020) A novel forest fire detection system using fuzzy entropy optimized thresholding and STN-based CNN. IEEE International Conference on Communication Systems & Networks, 750–755. Avula SB, Badri SJ, Reddy G (2020) A novel forest fire detection system using fuzzy entropy optimized thresholding and STN-based CNN. IEEE International Conference on Communication Systems & Networks, 750–755.
21.
go back to reference Tian Y, Pan G (2020) An unsupervised regularization and dropout based deep neural network and its application for thermal error prediction. Appl Sci 10(8):2870CrossRef Tian Y, Pan G (2020) An unsupervised regularization and dropout based deep neural network and its application for thermal error prediction. Appl Sci 10(8):2870CrossRef
22.
go back to reference Chen Q, Pan G (2021) A structure-self-organizing DBN for image recognition. Neural Comput Appl 33(7553):877–886CrossRef Chen Q, Pan G (2021) A structure-self-organizing DBN for image recognition. Neural Comput Appl 33(7553):877–886CrossRef
23.
go back to reference Pan G, Fu L, Yu R, Muresan M, Evaluation of alternative pre-trained convolutional neural networks for winter road surface condition monitoring. IEEE International Conference on Transportation Information and Safety, (2019), 614–620. Pan G, Fu L, Yu R, Muresan M, Evaluation of alternative pre-trained convolutional neural networks for winter road surface condition monitoring. IEEE International Conference on Transportation Information and Safety, (2019), 614–620.
24.
go back to reference Gaus YFA, Bhowmik N, Akçay S, Guillén-Garcia PM, Barker JW, Breckon TP (2019) Evaluation of a dual convolutional neural network architecture for object-wise anomaly detection in cluttered X-ray security imagery. International Joint Conference on Neural Networks (IJCNN), 1–8. Gaus YFA, Bhowmik N, Akçay S, Guillén-Garcia PM, Barker JW, Breckon TP (2019) Evaluation of a dual convolutional neural network architecture for object-wise anomaly detection in cluttered X-ray security imagery. International Joint Conference on Neural Networks (IJCNN), 1–8.
25.
go back to reference Nie D, Wang L, Adeli E, Lao C, Lin W, Shen D (2018) 3-D fully convolutional networks for multimodal isointense infant brain image segmentation. IEEE Trans Cybernetics 49(3):1123–1136CrossRef Nie D, Wang L, Adeli E, Lao C, Lin W, Shen D (2018) 3-D fully convolutional networks for multimodal isointense infant brain image segmentation. IEEE Trans Cybernetics 49(3):1123–1136CrossRef
27.
go back to reference Lei T, Liu P, Jia X, Zhang X, Meng H, Nandi AK (2020) Automatic fuzzy clustering framework for image segmentation. IEEE Trans Fuzzy Syst 28(9):2078–2092CrossRef Lei T, Liu P, Jia X, Zhang X, Meng H, Nandi AK (2020) Automatic fuzzy clustering framework for image segmentation. IEEE Trans Fuzzy Syst 28(9):2078–2092CrossRef
28.
go back to reference Bazaluk O, Kotenko S, Nitsenko V (2021) Entropy as an objective function of optimization multimodal transportations. Entropy 23:946MathSciNetCrossRef Bazaluk O, Kotenko S, Nitsenko V (2021) Entropy as an objective function of optimization multimodal transportations. Entropy 23:946MathSciNetCrossRef
29.
go back to reference Li L, He H, Li J (2020) Entropy-based sampling approaches for multi-class imbalanced problems. IEEE Trans Knowl Data Eng 32(11):2159–2170CrossRef Li L, He H, Li J (2020) Entropy-based sampling approaches for multi-class imbalanced problems. IEEE Trans Knowl Data Eng 32(11):2159–2170CrossRef
30.
go back to reference Hussain L, Aziz W, Alshdadi AA, Ahmed Nadeem MS, Khan IR, Chaudhry Q (2019) Analyzing the dynamics of lung cancer imaging data using refined fuzzy entropy methods by extracting different features. IEEE Access 7:64704–64721CrossRef Hussain L, Aziz W, Alshdadi AA, Ahmed Nadeem MS, Khan IR, Chaudhry Q (2019) Analyzing the dynamics of lung cancer imaging data using refined fuzzy entropy methods by extracting different features. IEEE Access 7:64704–64721CrossRef
31.
go back to reference Chakraborty DB, Pal SK (2018) Neighborhood rough filter and intuitionistic entropy in unsupervised tracking. IEEE Trans Fuzzy Syst 26(4):2188–2200CrossRef Chakraborty DB, Pal SK (2018) Neighborhood rough filter and intuitionistic entropy in unsupervised tracking. IEEE Trans Fuzzy Syst 26(4):2188–2200CrossRef
33.
go back to reference Gu K, Zhang Y, Qiao J (2020) Random forest ensemble for river turbidity measurement from space remote sensing data. IEEE Trans Instrum Meas 69(11):9028–9036CrossRef Gu K, Zhang Y, Qiao J (2020) Random forest ensemble for river turbidity measurement from space remote sensing data. IEEE Trans Instrum Meas 69(11):9028–9036CrossRef
34.
go back to reference Ye M, Yan X, Jia M (2021) Rolling bearing fault diagnosis based on VMD-MPE and PSO-SVM. Entropy 23:762CrossRef Ye M, Yan X, Jia M (2021) Rolling bearing fault diagnosis based on VMD-MPE and PSO-SVM. Entropy 23:762CrossRef
35.
go back to reference Jalal A, Ahmed A, Rafique AA, Kim K (2021) Scene semantic recognition based on modified fuzzy c-mean and maximum entropy using object-to-object relations. IEEE Access 9:27758–27772CrossRef Jalal A, Ahmed A, Rafique AA, Kim K (2021) Scene semantic recognition based on modified fuzzy c-mean and maximum entropy using object-to-object relations. IEEE Access 9:27758–27772CrossRef
38.
go back to reference World Health Organization (WHO), Global status report on road safety, WHO Press, World Health Organization, Geneva, Switzerland, 2018. World Health Organization (WHO), Global status report on road safety, WHO Press, World Health Organization, Geneva, Switzerland, 2018.
Metadata
Title
A maximum-entropy-attention-based convolutional neural network for image perception
Authors
Qili Chen
Ancai Zhang
Guangyuan Pan
Publication date
23-07-2022
Publisher
Springer London
Published in
Neural Computing and Applications / Issue 12/2023
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-022-07564-z

Other articles of this Issue 12/2023

Neural Computing and Applications 12/2023 Go to the issue

Premium Partner