Skip to main content
Erschienen in: Wireless Personal Communications 1/2023

15.03.2023

Speech Emotion Recognition Systems: A Comprehensive Review on Different Methodologies

verfasst von: Audre Arlene Anthony, Chandreshekar Mohan Patil

Erschienen in: Wireless Personal Communications | Ausgabe 1/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

As humans, speech is the common as well as a natural way of expressing ourselves. Speech Emotion Recognition (SER) systems can be defined as an assortment of methods processes and classifies speech signals for the detection of associated emotions. Automatic emotion recognition is the technique of identification of human emotions from various signals like speech, facial expression and text. Collection of such signals and labelling them is often tiresome and needs proficient knowledge. This paper deals with the different types of open source speech emotion datasets of various languages and recent literature survey in the area of speech emotion recognition that employs a number of machine learning approaches with an objective of enhancing the classification accuracy. The paper prudently aims at identifying and synthesizing contemporary pertinent literature associated to the SER systems with different methodologies or design components, thus providing the researchers with an up-to-date understanding of the research topic in the field of SER.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
3.
Zurück zum Zitat Cornejo, J. Y. R., & Pedrini, H. (2019). Audio-Visual Emotion Recognition Using a Hybrid Deep Convolutional Neural Network based on Census Transform. In IEEE International Conference on Systems, Man and Cybernetics (SMC), pp. 3396–3402. https://doi.org/10.1109/SMC.2019.8914193. Cornejo, J. Y. R., & Pedrini, H. (2019). Audio-Visual Emotion Recognition Using a Hybrid Deep Convolutional Neural Network based on Census Transform. In IEEE International Conference on Systems, Man and Cybernetics (SMC), pp. 3396–3402. https://​doi.​org/​10.​1109/​SMC.​2019.​8914193.
4.
Zurück zum Zitat Qadri, S. A. A., Gunawan, T. S., Wani, T. M., Ambikairajah, E., Kartiwi, M., & Ihsanto, E. (2021). Speech emotion Recognition using deep neural networks on multilingual databases. In J. A. Mat Jizat, et al. (Eds.), Advances in Robotics, automation and data analytics. iCITES 2020 (vol. 1350). Advances in Intelligent Systems and Computing. Cham: Springer. https://doi.org/10.1007/978-3-030-70917-4_3. Qadri, S. A. A., Gunawan, T. S., Wani, T. M., Ambikairajah, E., Kartiwi, M., & Ihsanto, E. (2021). Speech emotion Recognition using deep neural networks on multilingual databases. In J. A. Mat Jizat, et al. (Eds.), Advances in Robotics, automation and data analytics. iCITES 2020 (vol. 1350). Advances in Intelligent Systems and Computing. Cham: Springer. https://​doi.​org/​10.​1007/​978-3-030-70917-4_​3.
11.
Zurück zum Zitat Huilian, L., Weiping, H., & Wang, Y. (2020). Speech Emotion Recognition Based on BLSTM and CNN Feature Fusion. In Proceedings of the 2020 4th International Conference on Digital Signal Processing (ICDSP 2020), Association for Computing Machinery, New York, NY, USA, 169–172. https://doi.org/10.1145/3408127.3408192 Huilian, L., Weiping, H., & Wang, Y. (2020). Speech Emotion Recognition Based on BLSTM and CNN Feature Fusion. In Proceedings of the 2020 4th International Conference on Digital Signal Processing (ICDSP 2020), Association for Computing Machinery, New York, NY, USA, 169–172. https://​doi.​org/​10.​1145/​3408127.​3408192
14.
Zurück zum Zitat Jiang, P., Fu, H., Tao, H., Lei, P., & Zhao, L. (2019). Parallelized Convolutional Recurrent Neural Network With Spectral Features for Speech Emotion Recognition. IEEE Access, 7, 90368–90377, https://doi.org/110.1109/ACCESS.2019.2927384. Jiang, P., Fu, H., Tao, H., Lei, P., & Zhao, L. (2019). Parallelized Convolutional Recurrent Neural Network With Spectral Features for Speech Emotion Recognition. IEEE Access, 7, 90368–90377, https://​doi.​org/​110.​1109/​ACCESS.​2019.​2927384.​
Metadaten
Titel
Speech Emotion Recognition Systems: A Comprehensive Review on Different Methodologies
verfasst von
Audre Arlene Anthony
Chandreshekar Mohan Patil
Publikationsdatum
15.03.2023
Verlag
Springer US
Erschienen in
Wireless Personal Communications / Ausgabe 1/2023
Print ISSN: 0929-6212
Elektronische ISSN: 1572-834X
DOI
https://doi.org/10.1007/s11277-023-10296-5

Weitere Artikel der Ausgabe 1/2023

Wireless Personal Communications 1/2023 Zur Ausgabe

Neuer Inhalt