Skip to main content
Erschienen in: Cognitive Computation 4/2013

01.12.2013

Global Selection of Features for Nonlinear Dynamics Characterization of Emotional Speech

verfasst von: Patricia Henríquez Rodríguez, Jesús B. Alonso Hernández, Miguel A. Ferrer Ballester, Carlos M. Travieso González, Juan R. Orozco-Arroyave

Erschienen in: Cognitive Computation | Ausgabe 4/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper proposes the application of measures based on nonlinear dynamics for emotional speech characterization. Measures such as mutual information, dimension correlation, entropy correlation, Shannon’s entropy, Lempel–Ziv complexity and Hurst exponent are extracted from the samples of a database of emotional speech. Then, summary statistics such as mean, standard deviation, skewness and kurtosis are applied on the extracted measures. Experiments were conducted on the Berlin emotional speech database for a three-class problem (neutral, fear and anger as emotional states). Feature selection is accomplished and a methodology is proposed to find the best features. In order to evaluate the discrimination ability of the selected features, a neural network classifier is used. The global success rate is 93.78 ± 3.18 %.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Yildirim S, Narayanan S, Potamianos A. Detecting emotional state of a child in a conversational computer game. Comput Speech Lang. 2011;25:29–44.CrossRef Yildirim S, Narayanan S, Potamianos A. Detecting emotional state of a child in a conversational computer game. Comput Speech Lang. 2011;25:29–44.CrossRef
2.
Zurück zum Zitat Burkhardt F, Polzehl T, Stegmann J, Metze F, Huber R. Detecting real life anger. In: Proceedings of the IEEE international conference on acoustics, speech and signal process. Taipei: IEEE Press; 2009. p. 4761–4764. Burkhardt F, Polzehl T, Stegmann J, Metze F, Huber R. Detecting real life anger. In: Proceedings of the IEEE international conference on acoustics, speech and signal process. Taipei: IEEE Press; 2009. p. 4761–4764.
3.
Zurück zum Zitat Lefter I, Rothkrantz LJM, van Leeuwen DA, Wiggers P. Automatic stress detection in emergency (Telephone) calls. Int J Intell Def Support Syst. 2011;4(2):148–68.CrossRef Lefter I, Rothkrantz LJM, van Leeuwen DA, Wiggers P. Automatic stress detection in emergency (Telephone) calls. Int J Intell Def Support Syst. 2011;4(2):148–68.CrossRef
5.
Zurück zum Zitat Wu S, Falk TH, Wai-Yip, C. Automatic recognition of speech emotion using long-term spectro-temporal features. In: Proceedings of the 16th IEEE international conference on digital signal process. Santorini, Greece; 5–7 July 2009, p. 1–6. Wu S, Falk TH, Wai-Yip, C. Automatic recognition of speech emotion using long-term spectro-temporal features. In: Proceedings of the 16th IEEE international conference on digital signal process. Santorini, Greece; 5–7 July 2009, p. 1–6.
6.
Zurück zum Zitat Giannakopoulos T, Pikrakis A, Theodoridis SA. Dimensional approach to emotion recognition of speech from movies. In: Proceedings of the 34th IEEE international conference on acoustic, speech and signal process. (ICASSP 2009). Taipei, Taiwan; 19–24 April 2009, p. 65–68. Giannakopoulos T, Pikrakis A, Theodoridis SA. Dimensional approach to emotion recognition of speech from movies. In: Proceedings of the 34th IEEE international conference on acoustic, speech and signal process. (ICASSP 2009). Taipei, Taiwan; 19–24 April 2009, p. 65–68.
7.
Zurück zum Zitat Schuller B, Batliner A, Steidl S, Seppi D. Recognising realistic emotions and affect in speech: state of the art and lessons learnt from the first challenge. Speech Comm. 2011;53(9–10):1062–87. doi:10.1016/j.specom.2011.01.011. Schuller B, Batliner A, Steidl S, Seppi D. Recognising realistic emotions and affect in speech: state of the art and lessons learnt from the first challenge. Speech Comm. 2011;53(9–10):1062–87. doi:10.​1016/​j.​specom.​2011.​01.​011.
8.
Zurück zum Zitat Burkhardt F, Paeschke A, Rolfes M, Sendlmeier WF, Weiss B. A database of German emotional speech. In: Proceedings of the 6th annual conference of the international speech communication association (Interspeech 2005), Lisbon, Portugal; 4–8 September 2005, p. 1517–1520. http://pascal.kgw.tuberlin.de/emodb/. Burkhardt F, Paeschke A, Rolfes M, Sendlmeier WF, Weiss B. A database of German emotional speech. In: Proceedings of the 6th annual conference of the international speech communication association (Interspeech 2005), Lisbon, Portugal; 4–8 September 2005, p. 1517–1520. http://​pascal.​kgw.​tuberlin.​de/​emodb/​.​
9.
Zurück zum Zitat Wu S, Falk TH, Wai-Yip C. Automatic speech emotion recognition using modulation spectral features. Speech Comm. 2011;53:768–85.CrossRef Wu S, Falk TH, Wai-Yip C. Automatic speech emotion recognition using modulation spectral features. Speech Comm. 2011;53:768–85.CrossRef
10.
Zurück zum Zitat Henríquez P, Alonso JB, Ferrer MA, Travieso CM, Godino-Llorente JI, Díaz-de-María F. Characterization of healthy and pathological voice through measures based on nonlinear dynamics. IEEE Trans Audio Speech Lang Process. 2009;17(6):1186–95.CrossRef Henríquez P, Alonso JB, Ferrer MA, Travieso CM, Godino-Llorente JI, Díaz-de-María F. Characterization of healthy and pathological voice through measures based on nonlinear dynamics. IEEE Trans Audio Speech Lang Process. 2009;17(6):1186–95.CrossRef
11.
Zurück zum Zitat Alonso JB, Díaz-de-María F, Travieso CM, Ferrer MA. Using nonlinear features for voice disorder detection. In: Proceedings of 3rd international conference nonlinear speech process. Barcelona, Spain; 2005, p. 94–106. Alonso JB, Díaz-de-María F, Travieso CM, Ferrer MA. Using nonlinear features for voice disorder detection. In: Proceedings of 3rd international conference nonlinear speech process. Barcelona, Spain; 2005, p. 94–106.
12.
Zurück zum Zitat Vaziri G, Almasganj F, Jenabi MS. On the fractal self- similarity of laryngeal pathologies detection: the estimation of hurst parameter. In: Proceedings of the 5th International conference on Information Technology and Application in Biomedicine. Shenzhen, China; 2008, p. 383–386. Vaziri G, Almasganj F, Jenabi MS. On the fractal self- similarity of laryngeal pathologies detection: the estimation of hurst parameter. In: Proceedings of the 5th International conference on Information Technology and Application in Biomedicine. Shenzhen, China; 2008, p. 383–386.
13.
Zurück zum Zitat Vaziri G, Almasganj F, Behroozmand R. Pathological assessment of patients’ speech signals using nonlinear dynamical analysis. Comput Biol Med. 2010;40:54–63.PubMedCrossRef Vaziri G, Almasganj F, Behroozmand R. Pathological assessment of patients’ speech signals using nonlinear dynamical analysis. Comput Biol Med. 2010;40:54–63.PubMedCrossRef
14.
Zurück zum Zitat Tsanas A, Little MA, McSharry PE, Spielman J, Ramig LO. Novel speech signal processing algorithms for high-accuracy classification of Parkinson’s disease. IEEE Trans Biomed Eng. 2012;59(5):1264–71. Tsanas A, Little MA, McSharry PE, Spielman J, Ramig LO. Novel speech signal processing algorithms for high-accuracy classification of Parkinson’s disease. IEEE Trans Biomed Eng. 2012;59(5):1264–71.
15.
Zurück zum Zitat Little MA, McSharry PE, Hunter EJ, Spielman J, Ramig LO. Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. IEEE Trans Biomed Eng. 2009;56(4):1015–22.PubMedCrossRef Little MA, McSharry PE, Hunter EJ, Spielman J, Ramig LO. Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. IEEE Trans Biomed Eng. 2009;56(4):1015–22.PubMedCrossRef
16.
Zurück zum Zitat Little MA, McSharry PE, Roberts SJ, Costello DA, Moroz IM. Exploiting nonlinear recurrence and fractal scaling properties for voice disorder detection. Biomed Eng Online. 2007;6:23.PubMedCrossRef Little MA, McSharry PE, Roberts SJ, Costello DA, Moroz IM. Exploiting nonlinear recurrence and fractal scaling properties for voice disorder detection. Biomed Eng Online. 2007;6:23.PubMedCrossRef
17.
Zurück zum Zitat Takens F. Detecting strange attractors in turbulence. Lecture notes in math, vol. 898. New York: Springer; 1981. p. 366–81. Takens F. Detecting strange attractors in turbulence. Lecture notes in math, vol. 898. New York: Springer; 1981. p. 366–81.
18.
Zurück zum Zitat Fraser AM, Swinney HL. Independent coordinates for strange attractors from mutual information. Phys Rev A. 1986;33(2):1134–40.PubMedCrossRef Fraser AM, Swinney HL. Independent coordinates for strange attractors from mutual information. Phys Rev A. 1986;33(2):1134–40.PubMedCrossRef
19.
Zurück zum Zitat Kennel MB, Brown R, Abarbanel HDI. Determining embedding dimension for phase-space reconstruction using a geometrical construction. Phys Rev A. 1992;45(6):3403–11.PubMedCrossRef Kennel MB, Brown R, Abarbanel HDI. Determining embedding dimension for phase-space reconstruction using a geometrical construction. Phys Rev A. 1992;45(6):3403–11.PubMedCrossRef
20.
Zurück zum Zitat Kantz H, Schreiber T. Nonlinear time series analysis. 2nd ed. Cambridge: Cambridge University Press; 1997. Kantz H, Schreiber T. Nonlinear time series analysis. 2nd ed. Cambridge: Cambridge University Press; 1997.
21.
Zurück zum Zitat Theiler J. Lacunarity in a best estimator of fractal dimension. Phys Lett A. 1988;133:195–200.CrossRef Theiler J. Lacunarity in a best estimator of fractal dimension. Phys Lett A. 1988;133:195–200.CrossRef
22.
Zurück zum Zitat Kaspar F, Shuster HG. Easily calculable measure for complexity of spatiotemporal patterns. Phys Rev A. 1987;36:842–8.PubMedCrossRef Kaspar F, Shuster HG. Easily calculable measure for complexity of spatiotemporal patterns. Phys Rev A. 1987;36:842–8.PubMedCrossRef
23.
Zurück zum Zitat Lempel A, Ziv J. On the complexity of finite sequences. IEEE Trans Inform Theory. 1976;22:75–81.CrossRef Lempel A, Ziv J. On the complexity of finite sequences. IEEE Trans Inform Theory. 1976;22:75–81.CrossRef
24.
Zurück zum Zitat Hurst HE, Black RP, Simaika YM. Long-term storage: an experimental study. London: Constable; 1965. Hurst HE, Black RP, Simaika YM. Long-term storage: an experimental study. London: Constable; 1965.
25.
Zurück zum Zitat Pudil P, Novovicová J, Kittler J. Floating search methods in feature selection. Pattern Recognit Lett. 1994;15:1119–25.CrossRef Pudil P, Novovicová J, Kittler J. Floating search methods in feature selection. Pattern Recognit Lett. 1994;15:1119–25.CrossRef
26.
Zurück zum Zitat Ruelle D. Deterministic chaos: the science and the fiction. Proc R Soc Lond A. 1990;427:241–8.CrossRef Ruelle D. Deterministic chaos: the science and the fiction. Proc R Soc Lond A. 1990;427:241–8.CrossRef
27.
Zurück zum Zitat Kienast M, Sendlmeier WF. Acoustical analysis of spectral and temporal changes in emotional speech. In: Proceedings of the ISCA workshop on speech and emotion. Newcastle, UK; 5–7 September 2000, p. 92–97. Kienast M, Sendlmeier WF. Acoustical analysis of spectral and temporal changes in emotional speech. In: Proceedings of the ISCA workshop on speech and emotion. Newcastle, UK; 5–7 September 2000, p. 92–97.
Metadaten
Titel
Global Selection of Features for Nonlinear Dynamics Characterization of Emotional Speech
verfasst von
Patricia Henríquez Rodríguez
Jesús B. Alonso Hernández
Miguel A. Ferrer Ballester
Carlos M. Travieso González
Juan R. Orozco-Arroyave
Publikationsdatum
01.12.2013
Verlag
Springer US
Erschienen in
Cognitive Computation / Ausgabe 4/2013
Print ISSN: 1866-9956
Elektronische ISSN: 1866-9964
DOI
https://doi.org/10.1007/s12559-012-9157-0

Weitere Artikel der Ausgabe 4/2013

Cognitive Computation 4/2013 Zur Ausgabe

Premium Partner