Skip to main content
Top

2024 | OriginalPaper | Chapter

MLP-Based Speech Emotion Recognition for Audio and Visual Features

Authors : G. Kothai, Prabhas Bhanu Boora, S. Muzammil, L. Venkata Subhash, B. Naga Raju

Published in: Micro-Electronics and Telecommunication Engineering

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Due to its potential applications in domains involving psychology, social smart machines, and human–computer interaction, speech emotion recognition has become an important and growing topic field in the study. That article offers the service comparative analysis of strategies in artificial intelligence classifiers for speech emotion recognition, including MLP, decision tree, SVM, and random forest. Using visual analysis methods like wave plot and spectrogram analysis as well as audio feature extraction, we evaluate these classifiers. According to our observations, MLP achieves better performance than the other classifiers, obtaining an accuracy of 85% when identifying different emotions. Moreover, we illustrate how audio feature extraction and visual analysis help to improve emotion recognition efficiency. Our research has applications for creating speech emotion recognition algorithms that may be applied in real-life scenarios.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Chattopadhyay S, Dey A, Basak H (2020) Optimizing speech emotion recognition using manta-ray-based feature selection. arXiv preprint arXiv:2009.08909 Chattopadhyay S, Dey A, Basak H (2020) Optimizing speech emotion recognition using manta-ray-based feature selection. arXiv preprint arXiv:​2009.​08909
2.
go back to reference Wani TM, Gunawan TS, Qadri SAA, Kartiwi M, Ambikairajah E (2021) A comprehensive review of speech emotion recognition systems. IEEE Access 9:47795–47814CrossRef Wani TM, Gunawan TS, Qadri SAA, Kartiwi M, Ambikairajah E (2021) A comprehensive review of speech emotion recognition systems. IEEE Access 9:47795–47814CrossRef
3.
go back to reference Al Dujaili MJ, Ebrahimi-Moghadam A, Fatlawi A (2021) Speech emotion recognition based on SVM and KNN classifications fusion. Int J Electr Comput Eng 11(2):1259 Al Dujaili MJ, Ebrahimi-Moghadam A, Fatlawi A (2021) Speech emotion recognition based on SVM and KNN classifications fusion. Int J Electr Comput Eng 11(2):1259
4.
go back to reference Fang Y, Yang H, Zhang X, Liu H, Tao B (2021) Multi-feature input deep forest for EEG-based emotion recognition. Front Neurorobot 14:617531CrossRef Fang Y, Yang H, Zhang X, Liu H, Tao B (2021) Multi-feature input deep forest for EEG-based emotion recognition. Front Neurorobot 14:617531CrossRef
5.
go back to reference Khan M et al (2011) Comparison between KNN and SVM method for speech emotion recognition. Int J Comput Sci Eng 3(2):607–611 Khan M et al (2011) Comparison between KNN and SVM method for speech emotion recognition. Int J Comput Sci Eng 3(2):607–611
6.
go back to reference Tripathi S et al (2019) Deep learning based emotion recognition system using speech features and transcriptions. arXiv preprint arXiv:1906.05681 Tripathi S et al (2019) Deep learning based emotion recognition system using speech features and transcriptions. arXiv preprint arXiv:​1906.​05681
7.
go back to reference Huang C et al (2014) A research of speech emotion recognition based on deep belief network and SVM. Math Probl Eng Huang C et al (2014) A research of speech emotion recognition based on deep belief network and SVM. Math Probl Eng
8.
go back to reference Kerkeni L et al (2019) Automatic speech emotion recognition using machine learning. In: Social media and machine learning. IntechOpen Kerkeni L et al (2019) Automatic speech emotion recognition using machine learning. In: Social media and machine learning. IntechOpen
9.
go back to reference Ingale AB, Chaudhari DS (2012) Speech emotion recognition. Int J Soft Comput Eng (IJSCE) 2(1):235–238 Ingale AB, Chaudhari DS (2012) Speech emotion recognition. Int J Soft Comput Eng (IJSCE) 2(1):235–238
10.
go back to reference Bertero D et al (2016) Real-time speech emotion and sentiment recognition for interactive dialogue systems. In: Proceedings of the 2016 conference on empirical methods in natural language processing Bertero D et al (2016) Real-time speech emotion and sentiment recognition for interactive dialogue systems. In: Proceedings of the 2016 conference on empirical methods in natural language processing
11.
go back to reference Mirsamadi S, Barsoum E, Zhang C (2017) Automatic speech emotion recognition using recurrent neural networks with local attention. In: 2017 IEEE International conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 2227–2233 Mirsamadi S, Barsoum E, Zhang C (2017) Automatic speech emotion recognition using recurrent neural networks with local attention. In: 2017 IEEE International conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 2227–2233
12.
go back to reference Khalil RA, Jones E, Babar MI, Jan T, Zafar MH, Alhussain T (2019) Speech emotion recognition using deep learning techniques: a review. IEEE Access 7:117327–117345CrossRef Khalil RA, Jones E, Babar MI, Jan T, Zafar MH, Alhussain T (2019) Speech emotion recognition using deep learning techniques: a review. IEEE Access 7:117327–117345CrossRef
13.
go back to reference Abbaschian BJ, Sierra-Sosa D, Elmaghraby A (2021) Deep learning techniques for speech emotion recognition, from databases to models. Sensors 21(4):1249CrossRef Abbaschian BJ, Sierra-Sosa D, Elmaghraby A (2021) Deep learning techniques for speech emotion recognition, from databases to models. Sensors 21(4):1249CrossRef
14.
go back to reference Fayek HM, Lech M, Cavedon L (2017) Evaluating deep learning architectures for speech emotion recognition. Neural Netw 92:60–68CrossRef Fayek HM, Lech M, Cavedon L (2017) Evaluating deep learning architectures for speech emotion recognition. Neural Netw 92:60–68CrossRef
15.
go back to reference Pandey SK, Shekhawat HS, Prasanna SM (2019) Deep learning techniques for speech emotion recognition: a review. In: 2019 29th International conference radioelektronika (RADIOELEKTRONIKA). IEEE, pp 1–6 Pandey SK, Shekhawat HS, Prasanna SM (2019) Deep learning techniques for speech emotion recognition: a review. In: 2019 29th International conference radioelektronika (RADIOELEKTRONIKA). IEEE, pp 1–6
16.
go back to reference Lieskovská E, Jakubec M, Jarina R, Chmulík M (2021) A review on speech emotion recognition using deep learning and attention mechanism. Electronics 10(10):1163CrossRef Lieskovská E, Jakubec M, Jarina R, Chmulík M (2021) A review on speech emotion recognition using deep learning and attention mechanism. Electronics 10(10):1163CrossRef
17.
go back to reference Lim W, Jang D, Lee T (2016) Speech emotion recognition using convolutional and recurrent neural networks. In: 2016 Asia-Pacific signal and information processing association annual summit and Conference (APSIPA). IEEE, pp 1–4 Lim W, Jang D, Lee T (2016) Speech emotion recognition using convolutional and recurrent neural networks. In: 2016 Asia-Pacific signal and information processing association annual summit and Conference (APSIPA). IEEE, pp 1–4
18.
go back to reference Gupta S, Mehra A (2015) Speech emotion recognition using SVM with thresholding fusion. In: 2015 2nd International conference on signal processing and integrated networks (SPIN). IEEE, pp 570–574 Gupta S, Mehra A (2015) Speech emotion recognition using SVM with thresholding fusion. In: 2015 2nd International conference on signal processing and integrated networks (SPIN). IEEE, pp 570–574
19.
go back to reference Kwon S (2020) CLSTM: Deep feature-based speech emotion recognition using the hierarchical ConvLSTM network. Mathematics 8(12):21 Kwon S (2020) CLSTM: Deep feature-based speech emotion recognition using the hierarchical ConvLSTM network. Mathematics 8(12):21
Metadata
Title
MLP-Based Speech Emotion Recognition for Audio and Visual Features
Authors
G. Kothai
Prabhas Bhanu Boora
S. Muzammil
L. Venkata Subhash
B. Naga Raju
Copyright Year
2024
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-99-9562-2_2