Skip to main content

2021 | OriginalPaper | Buchkapitel

Facial Expression Recognition via ResNet-18

verfasst von : Bin Li, Runda Li, Dimas Lima

Erschienen in: Multimedia Technology and Enhanced Learning

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

As an important part of human-computer interaction, facial expression recognition has become a hot research topic in the fields of computer vision, pattern recognition, artificial intelligence, etc., and plays an important role in our daily life. With the development of deep learning and convolutional neural network, the research of facial expression recognition has also made great progress. Moreover, in the current face emotion recognition research, there are problems such as poor generalization ability of network model. The extraction of traditional facial expression recognition features is complex and the effect is not ideal. In order to improve the effect of facial expression recognition, we propose a feature extraction method for deep residual network, and use deep residual network ResNet-18 to extract the features of the data set. Through the experimental simulation of the specified data set, it can be proved that this model is superior to state-of-the-art methods model.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Oji-Mmuo, C.N., Speer, R.R., Gardner, F.C., Marvin, M.M., Hozella, A.C., Doheny, K.K.: Prenatal opioid exposure heightens sympathetic arousal and facial expressions of pain/distress in term neonates at 24–48 hours post birth. J. Maternal-Fetal Neonatal Med. 33, 3879–3886 (2020)CrossRef Oji-Mmuo, C.N., Speer, R.R., Gardner, F.C., Marvin, M.M., Hozella, A.C., Doheny, K.K.: Prenatal opioid exposure heightens sympathetic arousal and facial expressions of pain/distress in term neonates at 24–48 hours post birth. J. Maternal-Fetal Neonatal Med. 33, 3879–3886 (2020)CrossRef
2.
Zurück zum Zitat Ali, H., Hariharan, M., Yaacob, S., Adom, A.H.: Facial Emotion recognition based on higher-order spectra using support vector machines. J. Med. Imaging Health Inf. 5, 1272–1277 (2015)CrossRef Ali, H., Hariharan, M., Yaacob, S., Adom, A.H.: Facial Emotion recognition based on higher-order spectra using support vector machines. J. Med. Imaging Health Inf. 5, 1272–1277 (2015)CrossRef
3.
Zurück zum Zitat Evans, F.: Haar wavelet transform based facial emotion recognition. Adv. Comput. Sci. Res. 61, 342–346 (2017) Evans, F.: Haar wavelet transform based facial emotion recognition. Adv. Comput. Sci. Res. 61, 342–346 (2017)
4.
Zurück zum Zitat Lu, H.M.: Facial emotion recognition based on biorthogonal wavelet entropy, fuzzy support vector machine, and stratified cross validation. IEEE Access 4, 8375–8385 (2016)CrossRef Lu, H.M.: Facial emotion recognition based on biorthogonal wavelet entropy, fuzzy support vector machine, and stratified cross validation. IEEE Access 4, 8375–8385 (2016)CrossRef
5.
Zurück zum Zitat Phillips, P.: Intelligent facial emotion recognition based on stationary wavelet entropy and Jaya algorithm. Neurocomputing 272, 668–676 (2018)CrossRef Phillips, P.: Intelligent facial emotion recognition based on stationary wavelet entropy and Jaya algorithm. Neurocomputing 272, 668–676 (2018)CrossRef
6.
Zurück zum Zitat Wang, S.-H., Yang, W., Dong, Z., Phillips, P., Zhang, Y.-D.: Facial emotion recognition via discrete wavelet transform, principal component analysis, and cat swarm optimization. In: Sun, Yi., Lu, H., Zhang, L., Yang, J., Huang, H. (eds.) IScIDE 2017. LNCS, vol. 10559, pp. 203–214. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67777-4_18 CrossRef Wang, S.-H., Yang, W., Dong, Z., Phillips, P., Zhang, Y.-D.: Facial emotion recognition via discrete wavelet transform, principal component analysis, and cat swarm optimization. In: Sun, Yi., Lu, H., Zhang, L., Yang, J., Huang, H. (eds.) IScIDE 2017. LNCS, vol. 10559, pp. 203–214. Springer, Cham (2017). https://​doi.​org/​10.​1007/​978-3-319-67777-4_​18 CrossRef
7.
Zurück zum Zitat Li, X.: Facial emotion recognition via stationary wavelet entropy and biogeography-based optimization. EAI Endorsed Trans. e-Learn. 6, Article ID: e4 (2020) Li, X.: Facial emotion recognition via stationary wavelet entropy and biogeography-based optimization. EAI Endorsed Trans. e-Learn. 6, Article ID: e4 (2020)
8.
Zurück zum Zitat Lv, Y.-D.: Alcoholism detection by data augmentation and convolutional neural network with stochastic pooling. J. Med. Syst. 42, Article ID: 2 (2018) Lv, Y.-D.: Alcoholism detection by data augmentation and convolutional neural network with stochastic pooling. J. Med. Syst. 42, Article ID: 2 (2018)
9.
Zurück zum Zitat Tang, C.: Twelve-layer deep convolutional neural network with stochastic pooling for tea category classification on GPU platform. Multimed. Tools Appl. 77, 22821–22839 (2018) Tang, C.: Twelve-layer deep convolutional neural network with stochastic pooling for tea category classification on GPU platform. Multimed. Tools Appl. 77, 22821–22839 (2018)
10.
Zurück zum Zitat Pan, C.: Abnormal breast identification by nine-layer convolutional neural network with parametric rectified linear unit and rank-based stochastic pooling. J. Comput. Sci. 27, 57–68 (2018)CrossRef Pan, C.: Abnormal breast identification by nine-layer convolutional neural network with parametric rectified linear unit and rank-based stochastic pooling. J. Comput. Sci. 27, 57–68 (2018)CrossRef
11.
Zurück zum Zitat Hasebe, T., Ueda, Y.: Unimodality for free multiplicative convolution with free normal distributions on the unit circle. J. Oper. Theory 85, 21–43 (2021)MathSciNet Hasebe, T., Ueda, Y.: Unimodality for free multiplicative convolution with free normal distributions on the unit circle. J. Oper. Theory 85, 21–43 (2021)MathSciNet
12.
Zurück zum Zitat Belinschi, S.T., Bercovici, H., Liu, W.H.: The atoms of operator-valued free convolutions. J. Oper. Theory 85, 303–320 (2021)MathSciNet Belinschi, S.T., Bercovici, H., Liu, W.H.: The atoms of operator-valued free convolutions. J. Oper. Theory 85, 303–320 (2021)MathSciNet
13.
Zurück zum Zitat Kumar, S., Mahadevappa, M., Dutta, P.K.: Lensless in-line holographic microscopy with light source of low spatio-temporal coherence. IEEE J. Sel. Top. Quantum Electron. 27, 8, Article ID: 6800608 (2021) Kumar, S., Mahadevappa, M., Dutta, P.K.: Lensless in-line holographic microscopy with light source of low spatio-temporal coherence. IEEE J. Sel. Top. Quantum Electron. 27, 8, Article ID: 6800608 (2021)
14.
Zurück zum Zitat Fujioka, T., Yashima, Y., Oyama, J., Mori, M., Kubota, K., Katsuta, L., et al.: Deep-learning approach with convolutional neural network for classification of maximum intensity projections of dynamic contrast-enhanced breast magnetic resonance imaging. Magn. Reson. Imaging 75, 1–8 (2021)CrossRef Fujioka, T., Yashima, Y., Oyama, J., Mori, M., Kubota, K., Katsuta, L., et al.: Deep-learning approach with convolutional neural network for classification of maximum intensity projections of dynamic contrast-enhanced breast magnetic resonance imaging. Magn. Reson. Imaging 75, 1–8 (2021)CrossRef
15.
Zurück zum Zitat Hou, X.-X.: Seven-layer deep neural network based on sparse autoencoder for voxelwise detection of cerebral microbleed. Multimed. Tools Appl. 77, 10521–10538 (2018) Hou, X.-X.: Seven-layer deep neural network based on sparse autoencoder for voxelwise detection of cerebral microbleed. Multimed. Tools Appl. 77, 10521–10538 (2018)
16.
Zurück zum Zitat Pan, C.: Multiple sclerosis identification by convolutional neural network with dropout and parametric ReLU. J. Comput. Sci. 28, 1–10 (2018)MathSciNetCrossRef Pan, C.: Multiple sclerosis identification by convolutional neural network with dropout and parametric ReLU. J. Comput. Sci. 28, 1–10 (2018)MathSciNetCrossRef
17.
Zurück zum Zitat Bercovici, H., Dykema, K., Nica, A.: Dan-virgil voiculescu at seventy. J. Oper. Theory 85, 5–20 (2021)MathSciNet Bercovici, H., Dykema, K., Nica, A.: Dan-virgil voiculescu at seventy. J. Oper. Theory 85, 5–20 (2021)MathSciNet
18.
Zurück zum Zitat Egger, H., Schmidt, K., Shashkov, V.: Multistep and Runge-Kutta convolution quadrature methods for coupled dynamical systems. J. Comput. Appl. Math. 387, 14, Article ID: 112618 (2021) Egger, H., Schmidt, K., Shashkov, V.: Multistep and Runge-Kutta convolution quadrature methods for coupled dynamical systems. J. Comput. Appl. Math. 387, 14, Article ID: 112618 (2021)
19.
Zurück zum Zitat Erbay, H.A., Erbay, S., Erkip, A.: A semi-discrete numerical method for convolution-type unidirectional wave equations. J. Comput. Appl. Math. 387, 13, Article ID: 112496 (2021) Erbay, H.A., Erbay, S., Erkip, A.: A semi-discrete numerical method for convolution-type unidirectional wave equations. J. Comput. Appl. Math. 387, 13, Article ID: 112496 (2021)
20.
Zurück zum Zitat Katsagounos, I., Thomakos, D.D., Litsiou, K., Nikolopoulos, K.: Superforecasting reality check: evidence from a small pool of experts and expedited identification. Eur. J. Oper. Res. 289, 107–117 (2021)MATHCrossRef Katsagounos, I., Thomakos, D.D., Litsiou, K., Nikolopoulos, K.: Superforecasting reality check: evidence from a small pool of experts and expedited identification. Eur. J. Oper. Res. 289, 107–117 (2021)MATHCrossRef
21.
Zurück zum Zitat Huang, C.: Multiple sclerosis identification by 14-layer convolutional neural network with batch normalization, dropout, and stochastic pooling. Front. Neurosci. 12, Article ID: 818 (2018) Huang, C.: Multiple sclerosis identification by 14-layer convolutional neural network with batch normalization, dropout, and stochastic pooling. Front. Neurosci. 12, Article ID: 818 (2018)
22.
Zurück zum Zitat Zhao, G.: Polarimetric synthetic aperture radar image segmentation by convolutional neural network using graphical processing units. J. Real-Time Image Proc. 15, 631–642 (2018) Zhao, G.: Polarimetric synthetic aperture radar image segmentation by convolutional neural network using graphical processing units. J. Real-Time Image Proc. 15, 631–642 (2018)
23.
Zurück zum Zitat Muhammad, K.: Image based fruit category classification by 13-layer deep convolutional neural network and data augmentation. Multimed. Tools Appl. 78, 3613–3632 (2019) Muhammad, K.: Image based fruit category classification by 13-layer deep convolutional neural network and data augmentation. Multimed. Tools Appl. 78, 3613–3632 (2019)
24.
Zurück zum Zitat Wang, S.-H., Sun, J.: Cerebral micro-bleeding identification based on a nine-layer convolutional neural network with stochastic pooling. Concurr. Comput. Pract. Exp. 32, e5130 (2020) Wang, S.-H., Sun, J.: Cerebral micro-bleeding identification based on a nine-layer convolutional neural network with stochastic pooling. Concurr. Comput. Pract. Exp. 32, e5130 (2020)
25.
Zurück zum Zitat Sangaiah, A.K.: Alcoholism identification via convolutional neural network based on parametric ReLU, dropout, and batch normalization. Neural Comput. Appl. 32, 665–680 (2020) Sangaiah, A.K.: Alcoholism identification via convolutional neural network based on parametric ReLU, dropout, and batch normalization. Neural Comput. Appl. 32, 665–680 (2020)
26.
Zurück zum Zitat Choi, S.H., Jung, S.H.: Stable acquisition of fine-grained segments using batch normalization and focal loss with L1 regularization in U-Net structure. Int. J. Fuzzy Logic Intell. Syst. 20, 59–68 (2020)CrossRef Choi, S.H., Jung, S.H.: Stable acquisition of fine-grained segments using batch normalization and focal loss with L1 regularization in U-Net structure. Int. J. Fuzzy Logic Intell. Syst. 20, 59–68 (2020)CrossRef
27.
Zurück zum Zitat Wang, S.-H.: DenseNet-201-based deep neural network with composite learning factor and precomputation for multiple sclerosis classification. ACM Trans. Multimed. Comput. Commun. Appl. 16, Article no. 60 (2020) Wang, S.-H.: DenseNet-201-based deep neural network with composite learning factor and precomputation for multiple sclerosis classification. ACM Trans. Multimed. Comput. Commun. Appl. 16, Article no. 60 (2020)
28.
Zurück zum Zitat Olimov, B., Karshiev, S., Jang, E., Din, S., Paul, A., Kim, J.: Weight initialization based-rectified linear unit activation function to improve the performance of a convolutional neural network model. Concurr. Comput. Pract. Exp. 11 (2021). (Article; Early Access). https://doi.org/10.1002/cpe.6143 Olimov, B., Karshiev, S., Jang, E., Din, S., Paul, A., Kim, J.: Weight initialization based-rectified linear unit activation function to improve the performance of a convolutional neural network model. Concurr. Comput. Pract. Exp. 11 (2021). (Article; Early Access). https://​doi.​org/​10.​1002/​cpe.​6143
29.
Zurück zum Zitat Zhang, Y.-D.: Advances in multimodal data fusion in neuroimaging: overview, challenges, and novel orientation. Inf. Fusion 64, 149–187 (2020)CrossRef Zhang, Y.-D.: Advances in multimodal data fusion in neuroimaging: overview, challenges, and novel orientation. Inf. Fusion 64, 149–187 (2020)CrossRef
30.
Zurück zum Zitat Wang, S.-H.: Covid-19 classification by FGCNet with deep feature fusion from graph convolutional network and convolutional neural network. Inf. Fusion 67, 208–229 (2021)CrossRef Wang, S.-H.: Covid-19 classification by FGCNet with deep feature fusion from graph convolutional network and convolutional neural network. Inf. Fusion 67, 208–229 (2021)CrossRef
32.
Zurück zum Zitat Kawahara, D., Tang, X.Y., Lee, C.K., Nagata, Y., Watanabe, Y.: Predicting the local response of metastatic brain tumor to gamma knife radiosurgery by radiomics with a machine learning method. Front. Oncol. 10, 8, Article ID: 569461 (2021) Kawahara, D., Tang, X.Y., Lee, C.K., Nagata, Y., Watanabe, Y.: Predicting the local response of metastatic brain tumor to gamma knife radiosurgery by radiomics with a machine learning method. Front. Oncol. 10, 8, Article ID: 569461 (2021)
33.
Zurück zum Zitat Dubey, S.R., Chakraborty, S.: Average biased ReLU based CNN descriptor for improved face retrieval. Multimed. Tools Appl., 26 (2021) Dubey, S.R., Chakraborty, S.: Average biased ReLU based CNN descriptor for improved face retrieval. Multimed. Tools Appl., 26 (2021)
34.
Zurück zum Zitat Yamaguchi, M., Iwamoto, G., Nishimura, Y., Tamukoh, H., Morie, T.: An energy-efficient time-domain analog CMOS BinaryConnect neural network processor based on a pulse-width modulation approach. IEEE Access 9, 2644–2654 (2021)CrossRef Yamaguchi, M., Iwamoto, G., Nishimura, Y., Tamukoh, H., Morie, T.: An energy-efficient time-domain analog CMOS BinaryConnect neural network processor based on a pulse-width modulation approach. IEEE Access 9, 2644–2654 (2021)CrossRef
35.
Zurück zum Zitat Farrell, M.H., Liang, T.Y., Misra, S.: Deep neural networks for estimation and inference. Econometrica 89, 181–213 (2021)MathSciNetCrossRef Farrell, M.H., Liang, T.Y., Misra, S.: Deep neural networks for estimation and inference. Econometrica 89, 181–213 (2021)MathSciNetCrossRef
36.
Zurück zum Zitat Tripathi, D., Edla, D.R., Kuppili, V., Bablani, A.: Evolutionary extreme learning machine with novel activation function for credit scoring. Eng. Appl. Artif. Intell. 96, 10, Article ID: 103980 (2020) Tripathi, D., Edla, D.R., Kuppili, V., Bablani, A.: Evolutionary extreme learning machine with novel activation function for credit scoring. Eng. Appl. Artif. Intell. 96, 10, Article ID: 103980 (2020)
37.
Zurück zum Zitat Satapathy, S.C.: A five-layer deep convolutional neural network with stochastic pooling for chest CT-based COVID-19 diagnosis. Mach. Vis. Appl. 32, Article ID: 14 (2021) Satapathy, S.C.: A five-layer deep convolutional neural network with stochastic pooling for chest CT-based COVID-19 diagnosis. Mach. Vis. Appl. 32, Article ID: 14 (2021)
38.
Zurück zum Zitat Moon, S.: ReLU network with bounded width is a universal approximator in view of an approximate identity. Appl. Sci. 11, 11, Article ID: 427 (2021) Moon, S.: ReLU network with bounded width is a universal approximator in view of an approximate identity. Appl. Sci. 11, 11, Article ID: 427 (2021)
39.
Zurück zum Zitat Bernardo, P.P., Gerum, C., Frischknecht, A., Lubeck, K., Bringmann, O.: UltraTrail: a configurable ultralow-power TC-ResNet AI accelerator for efficient keyword spotting. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 39, 4240–4251 (2020)CrossRef Bernardo, P.P., Gerum, C., Frischknecht, A., Lubeck, K., Bringmann, O.: UltraTrail: a configurable ultralow-power TC-ResNet AI accelerator for efficient keyword spotting. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 39, 4240–4251 (2020)CrossRef
40.
Zurück zum Zitat Alotaibi, B., Alotaibi, M.: A hybrid deep ResNet and inception model for hyperspectral image classification. PFG J. Photogramm. Remote Sens. Geoinf. Sci. 88, 463–476 (2020) Alotaibi, B., Alotaibi, M.: A hybrid deep ResNet and inception model for hyperspectral image classification. PFG J. Photogramm. Remote Sens. Geoinf. Sci. 88, 463–476 (2020)
41.
Zurück zum Zitat Hammad, M., Plawiak, P., Wang, K.Q., Acharya, U.R.: ResNet-attention model for human authentication using ECG signals. Expert Syst., 17, Article ID: e12547 (2020) Hammad, M., Plawiak, P., Wang, K.Q., Acharya, U.R.: ResNet-attention model for human authentication using ECG signals. Expert Syst., 17, Article ID: e12547 (2020)
Metadaten
Titel
Facial Expression Recognition via ResNet-18
verfasst von
Bin Li
Runda Li
Dimas Lima
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-82565-2_24