Skip to main content
Erschienen in: Neural Computing and Applications 8/2018

24.11.2016 | New Trends in data pre-processing methods for signal and image classification

Application of fuzzy C-means clustering algorithm to spectral features for emotion classification from speech

Erschienen in: Neural Computing and Applications | Ausgabe 8/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In the present study, emotion recognition from speech signals was performed by using the fuzzy C-means algorithm. Spectral features obtained from speech signals were used as features. The spectral features used were Mel frequency cepstral coefficients and linear prediction coefficients. Certain statistical features were extracted from the spectral features obtained in the study. After the selection of the extracted features, cluster centers were identified by using type-1 fuzzy C-means (FCM) algorithm and used as input to the classifier. Supervised classifiers such as ANN, NB, kNN, and SVM were used for classification. In the study, all seven emotions of the EmoDB database were used. Of the features obtained, FCM clustering was applied to Mel coefficients and obtained clusters centers were used as input for classification. The results showed that using FCM for preprocessing aim increased the success rate. The comparison of the classification methods showed that the maximum success rate was obtained as 92.86% using the SVM classifier.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
2.
Zurück zum Zitat Ma J, Jin H, Yang LT, Tsai JJ-P (2006) Ubiquitous intelligence and computing: third international conference, UIC 2006, Wuhan, China, September 3–6 proceedings (LNCS). Springer, SecaucusCrossRef Ma J, Jin H, Yang LT, Tsai JJ-P (2006) Ubiquitous intelligence and computing: third international conference, UIC 2006, Wuhan, China, September 3–6 proceedings (LNCS). Springer, SecaucusCrossRef
3.
Zurück zum Zitat Nasukawa T, Nasukawa T, Yi J, Yi J (2003) Sentiment analysis: capturing favorability using natural language processing. In: Proceedings of the 2nd international conference on knowledge capture, pp 70–77. doi:10.1145/945645.945658 Nasukawa T, Nasukawa T, Yi J, Yi J (2003) Sentiment analysis: capturing favorability using natural language processing. In: Proceedings of the 2nd international conference on knowledge capture, pp 70–77. doi:10.​1145/​945645.​945658
4.
Zurück zum Zitat Sönmez E, Aalbayrak S (2016) A facial component-based system for emotion classification. Turkish J Electr Eng Comput Sci 24:1663–1673CrossRef Sönmez E, Aalbayrak S (2016) A facial component-based system for emotion classification. Turkish J Electr Eng Comput Sci 24:1663–1673CrossRef
7.
Zurück zum Zitat Zhao X, Zhang S (2015) Spoken emotion recognition via locality-constrained kernel sparse representation. Neural Comput Appl 26(3):735–744CrossRef Zhao X, Zhang S (2015) Spoken emotion recognition via locality-constrained kernel sparse representation. Neural Comput Appl 26(3):735–744CrossRef
9.
Zurück zum Zitat Karimi S, Sedaaghi MH (2016) How to categorize emotional speech signals with respect to the speaker’s degree of emotional intensity. Turkish J Electr Eng Comput Sci 24:1306–1324. doi:10.3906/elk-1312-196 CrossRef Karimi S, Sedaaghi MH (2016) How to categorize emotional speech signals with respect to the speaker’s degree of emotional intensity. Turkish J Electr Eng Comput Sci 24:1306–1324. doi:10.​3906/​elk-1312-196 CrossRef
12.
15.
Zurück zum Zitat Hanilçi C (2007) A comparative study of speaker recognition techniques, MSc, Uludag University, Bursa Hanilçi C (2007) A comparative study of speaker recognition techniques, MSc, Uludag University, Bursa
21.
Zurück zum Zitat Kotropoulos C (2003) A state of the art review on emotional speech databases. In: 1st Richmedia conference, pp 109–119 Kotropoulos C (2003) A state of the art review on emotional speech databases. In: 1st Richmedia conference, pp 109–119
22.
Zurück zum Zitat Burkhardt F, Paeschke A, Rolfes M et al (2005) A database of German emotional speech. In: 9th European conference on speech communication and technology, pp 3–6 Burkhardt F, Paeschke A, Rolfes M et al (2005) A database of German emotional speech. In: 9th European conference on speech communication and technology, pp 3–6
23.
Zurück zum Zitat Becchetti C, Ricotti LP (2004) Speech recognition: theory an C++ implementation, 3rd edn. Wiley, New York, pp 125–135 Becchetti C, Ricotti LP (2004) Speech recognition: theory an C++ implementation, 3rd edn. Wiley, New York, pp 125–135
24.
Zurück zum Zitat Dunn JC (1973) A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J Cybern 3:32–57MathSciNetCrossRefMATH Dunn JC (1973) A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J Cybern 3:32–57MathSciNetCrossRefMATH
25.
Zurück zum Zitat Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Plenum Press, New York, p 4CrossRefMATH Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Plenum Press, New York, p 4CrossRefMATH
26.
27.
Zurück zum Zitat Bezdek JC, Ehrlich R, Full W (1984) FCM: the fuzzy C-means clustering algorithm. Comput Geosci 10(2–3):191–203CrossRef Bezdek JC, Ehrlich R, Full W (1984) FCM: the fuzzy C-means clustering algorithm. Comput Geosci 10(2–3):191–203CrossRef
29.
Zurück zum Zitat Anderson D, Mcneill G (1992) Artificial neural networks technology. Kaman Sciences Corporation, Utica, New York Anderson D, Mcneill G (1992) Artificial neural networks technology. Kaman Sciences Corporation, Utica, New York
30.
Zurück zum Zitat Baluja S (1995) Artificial neural network evolution: learning to steer a land vehicle. CRC Press Inc Baluja S (1995) Artificial neural network evolution: learning to steer a land vehicle. CRC Press Inc
31.
Zurück zum Zitat Mitchell TM (1997) Machine learning. McGraw-Hill, Inc., New YorkMATH Mitchell TM (1997) Machine learning. McGraw-Hill, Inc., New YorkMATH
37.
38.
Zurück zum Zitat Rish I (2001) An empirical study of the naive Bayes classifier. In: Proceedings of IJCAI-01 workshop on Empirical Methods in AI, pp 41–46 Rish I (2001) An empirical study of the naive Bayes classifier. In: Proceedings of IJCAI-01 workshop on Empirical Methods in AI, pp 41–46
39.
Zurück zum Zitat Bouckaert RR, Frank E, Hall MA, Holmes G, Pfahringer B, Reutemann P, Witten IH (2010) WEKA-experiences with a java open-source project. J Mach Learn Res 11:2533–2541MATH Bouckaert RR, Frank E, Hall MA, Holmes G, Pfahringer B, Reutemann P, Witten IH (2010) WEKA-experiences with a java open-source project. J Mach Learn Res 11:2533–2541MATH
40.
Zurück zum Zitat Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software. ACM SIGKDD Explor Newsl 11:10–18CrossRef Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software. ACM SIGKDD Explor Newsl 11:10–18CrossRef
43.
Zurück zum Zitat Engberg IS, Hansen AV (1996) Documentation of the danish emotional speech database des. Intern AAU report, Cent Pers Kommun, p 22 Engberg IS, Hansen AV (1996) Documentation of the danish emotional speech database des. Intern AAU report, Cent Pers Kommun, p 22
Metadaten
Titel
Application of fuzzy C-means clustering algorithm to spectral features for emotion classification from speech
Publikationsdatum
24.11.2016
Erschienen in
Neural Computing and Applications / Ausgabe 8/2018
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-016-2712-y

Weitere Artikel der Ausgabe 8/2018

Neural Computing and Applications 8/2018 Zur Ausgabe

New Trends in data pre-processing methods for signal and image classification

Adaptive hexagonal fuzzy hybrid filter for Rician noise removal in MRI images