Skip to main content
Erschienen in: Mobile Networks and Applications 4/2018

29.06.2018

Detecting Human Emotions in a Large Size of Database by Using Ensemble Classification Model

verfasst von: Sathit Prasomphan, Surinee Doungwichain

Erschienen in: Mobile Networks and Applications | Ausgabe 4/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

One of the most challenging researches in the field of Human-Computer Interaction (HCI) is Speech Emotion Recognition (SER). Several factors affect to the classification result. For example, the accuracy of detecting emotion depends on type of emotion and number of emotion which is classified and quality of speech is also the importance feature. Four different emotion types (anger, happy, natural, and sad) from Thai speech was used in this research. All of theses speech were recorded from Thai drama show which were most similar with daily life speech. The ensemble classification method with majority weight voting was used. This proposed algorithms used the combination of Support Vector Machine, Neural Network and k-Nearest Neighbors for emotion classification. The experimental results show that emotion classification by using the ensemble classification method by using the majority weight voting can efficiency give the better accuracy results than the single model. The proposed method has better results when using with fundamental frequency (F0) and Mel-Frequency Cepstral Coefficients (MFCC) of speech which give the accuracy results at 70.69.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
1.
Zurück zum Zitat Ayadi MMHE, Kamel MS, Karray F (2011) Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recogn 44:572–587 Ayadi MMHE, Kamel MS, Karray F (2011) Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recogn 44:572–587
2.
Zurück zum Zitat Xu S, Liu Y, Liu X (2013) Speaker recognition and speech emotion recognition based on GMM. In: 3rd international conference on electric and electronics Xu S, Liu Y, Liu X (2013) Speaker recognition and speech emotion recognition based on GMM. In: 3rd international conference on electric and electronics
3.
Zurück zum Zitat Seehapoch T, Wongthanavasu S (2011) Speech emotion recognition using Support Vector Machines. In: The 5th international conference on knowledge and smart technology (KST), pp 219–223 Seehapoch T, Wongthanavasu S (2011) Speech emotion recognition using Support Vector Machines. In: The 5th international conference on knowledge and smart technology (KST), pp 219–223
4.
Zurück zum Zitat Stickel C, Ebner M, Steinbach-Nordmann S, Searle G, Holzinger A (2009) Emotion detection: application of the valence arousal space for rapid biological usability testing to enhance universal access in universal access in Human-Computer interaction addressing diversity. Lecture notes in computer science, LNCS 5614. Springer, Berlin, pp 615–624 Stickel C, Ebner M, Steinbach-Nordmann S, Searle G, Holzinger A (2009) Emotion detection: application of the valence arousal space for rapid biological usability testing to enhance universal access in universal access in Human-Computer interaction addressing diversity. Lecture notes in computer science, LNCS 5614. Springer, Berlin, pp 615–624
5.
Zurück zum Zitat Hong M, Jung JJ, Camacho D (2017) GRSAT: a novel method on group recommendation by social affinity and trustworthiness. Cybern Syst 48(3):140–161CrossRef Hong M, Jung JJ, Camacho D (2017) GRSAT: a novel method on group recommendation by social affinity and trustworthiness. Cybern Syst 48(3):140–161CrossRef
6.
Zurück zum Zitat Burkhardt F, Paeschke A, Rolfes M, Sendlmeier W, Weiss B (2005) A database of German emotional speech. In: Proceedings of the interspeech Burkhardt F, Paeschke A, Rolfes M, Sendlmeier W, Weiss B (2005) A database of German emotional speech. In: Proceedings of the interspeech
7.
Zurück zum Zitat Kasuriya S, Teeramunkong T, Wutiwiwatchai C (2013) Developing a Thai emotional speech corpus. In: International conference on Asian spoken language research and evaluation Kasuriya S, Teeramunkong T, Wutiwiwatchai C (2013) Developing a Thai emotional speech corpus. In: International conference on Asian spoken language research and evaluation
8.
Zurück zum Zitat Kasuriya S, Banchaditt T, Somboon N, Teeramunkong T, Wutiwiwatchai C (2013) Detecting emotional speech in thai drama. In: 2nd ICT international student project conference (ICT-ISPC) Kasuriya S, Banchaditt T, Somboon N, Teeramunkong T, Wutiwiwatchai C (2013) Detecting emotional speech in thai drama. In: 2nd ICT international student project conference (ICT-ISPC)
9.
Zurück zum Zitat Thamsiri D, Meesad P (2011) Ensemble data classification based on decision tree, artificial neuron network and support vector machine optimized by genetic algorithm. J King's Mongkut's Univ Technol North Bangkok 21 (2):293–303 Thamsiri D, Meesad P (2011) Ensemble data classification based on decision tree, artificial neuron network and support vector machine optimized by genetic algorithm. J King's Mongkut's Univ Technol North Bangkok 21 (2):293–303
10.
Zurück zum Zitat Hong M, Jung JJ, Piccialli F, Chianese A (2017) Social recommendation service for cultural heritage. Pers Ubiquit Comput 21(2):191–201CrossRef Hong M, Jung JJ, Piccialli F, Chianese A (2017) Social recommendation service for cultural heritage. Pers Ubiquit Comput 21(2):191–201CrossRef
11.
Zurück zum Zitat Rieger SA Jr, Muraleedharan R, Ramachandran RP (2014) Speech based emotion recognition using spectral feature extraction and an ensemble of kNN classifiers. In: 9th international symposium on Chinese spoken language processing (ISCSLP), pp 589– 593 Rieger SA Jr, Muraleedharan R, Ramachandran RP (2014) Speech based emotion recognition using spectral feature extraction and an ensemble of kNN classifiers. In: 9th international symposium on Chinese spoken language processing (ISCSLP), pp 589– 593
12.
Zurück zum Zitat Anagnostopoulos T, Skourlas C (2014) Ensemble majority voting classifier for speech emotion recognition and prediction. J Syst Inf Technol 16(3):222–232CrossRef Anagnostopoulos T, Skourlas C (2014) Ensemble majority voting classifier for speech emotion recognition and prediction. J Syst Inf Technol 16(3):222–232CrossRef
13.
Zurück zum Zitat Nicholson J, Takahashi K, Nakatsu R (1999) Emotion recognition in speech using neural networks. In: 6th international conference on neural information processing, vol 2, pp 495–501 Nicholson J, Takahashi K, Nakatsu R (1999) Emotion recognition in speech using neural networks. In: 6th international conference on neural information processing, vol 2, pp 495–501
14.
Zurück zum Zitat Mu X, Lu J, Watta P, Hassoun MH (2009) Weighted voting-based ensemble classifiers with application to human face recognition and voice recognition. In: Proceedings of international joint conference on neural networks. Atlanta, Georgia, USA, pp 2168–2171 Mu X, Lu J, Watta P, Hassoun MH (2009) Weighted voting-based ensemble classifiers with application to human face recognition and voice recognition. In: Proceedings of international joint conference on neural networks. Atlanta, Georgia, USA, pp 2168–2171
15.
Zurück zum Zitat Bui KN, Jung JJ (2018) Internet of agents framework for connected vehicles: a case study on distributed traffic control system. J Parallel Distrib Comput 116:89–95CrossRef Bui KN, Jung JJ (2018) Internet of agents framework for connected vehicles: a case study on distributed traffic control system. J Parallel Distrib Comput 116:89–95CrossRef
16.
Zurück zum Zitat Morrison D, Wang R, Silva LCD (2007) Ensemble methods for spoken emotion recognition in call-centres. J Speech Commun 49(2):98–112CrossRef Morrison D, Wang R, Silva LCD (2007) Ensemble methods for spoken emotion recognition in call-centres. J Speech Commun 49(2):98–112CrossRef
17.
Zurück zum Zitat Aha D, Kibler D (1991) Instance-based learning algorithms. Mach Learn 6:37–66MATH Aha D, Kibler D (1991) Instance-based learning algorithms. Mach Learn 6:37–66MATH
18.
Zurück zum Zitat Sharkey AJC (1999) Combining artificial neural nets ensemble and modular multi-net systems. Springer, LondonCrossRefMATH Sharkey AJC (1999) Combining artificial neural nets ensemble and modular multi-net systems. Springer, LondonCrossRefMATH
19.
Zurück zum Zitat Vasuki P (2015) Speech emotion recognition using adaptive ensemble of class specific classifiers, research journal of applied sciences. Eng Technol 9(12):1105–1114 Vasuki P (2015) Speech emotion recognition using adaptive ensemble of class specific classifiers, research journal of applied sciences. Eng Technol 9(12):1105–1114
20.
Zurück zum Zitat Sittidech P, Nai-arun N (2015) Bagging model with cost sensitive analysis on diabetes data. Inf Technol J, KMUTNB 11(1):82–90 Sittidech P, Nai-arun N (2015) Bagging model with cost sensitive analysis on diabetes data. Inf Technol J, KMUTNB 11(1):82–90
21.
Zurück zum Zitat Shen P, Changjun Z (2011) Automatic speech emotion recognition using support vector machine. In: International conference on electronic & mechanical engineering and information technology, pp 621–625 Shen P, Changjun Z (2011) Automatic speech emotion recognition using support vector machine. In: International conference on electronic & mechanical engineering and information technology, pp 621–625
22.
Zurück zum Zitat Orgaz GB, Jung JJ, Camacho D (2016) Social big data: recent achievements and new challenges. Inf Fusion 28:45–59CrossRef Orgaz GB, Jung JJ, Camacho D (2016) Social big data: recent achievements and new challenges. Inf Fusion 28:45–59CrossRef
Metadaten
Titel
Detecting Human Emotions in a Large Size of Database by Using Ensemble Classification Model
verfasst von
Sathit Prasomphan
Surinee Doungwichain
Publikationsdatum
29.06.2018
Verlag
Springer US
Erschienen in
Mobile Networks and Applications / Ausgabe 4/2018
Print ISSN: 1383-469X
Elektronische ISSN: 1572-8153
DOI
https://doi.org/10.1007/s11036-018-1074-3

Weitere Artikel der Ausgabe 4/2018

Mobile Networks and Applications 4/2018 Zur Ausgabe

Neuer Inhalt