Skip to main content
Top

2016 | OriginalPaper | Chapter

Stress, Arousal, and Stress Detector Trained on Acted Speech Database

Authors : Róbert Sabo, Milan Rusko, Andrej Ridzik, Jakub Rajčáni

Published in: Speech and Computer

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper reports on initial experiments with the creation of a suitable database for training and testing systems for stress detection in speech and first experimental results. Based on the psychological understanding of the concepts of stress and emotion, we operationalized stress as a level of arousal, which can be detected in speech. We describe here a speech database with three levels of “acted stress” and three levels of soothing. For the very first experiment performed on the database we detect different levels of stress using Gaussian mixture models. The accuracy of detecting three levels of stress was 89 % for speakers included in the training database and 73 % for speakers whose recordings were not used during the adaptation of the GMM models.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Macková, L., Čižmár, A., Juhár, J.: A study of acoustic features for emotional speaker recognition in i-vector representation. Acta Electrotechnica et Informatica 15(2), 15–20 (2015) Macková, L., Čižmár, A., Juhár, J.: A study of acoustic features for emotional speaker recognition in i-vector representation. Acta Electrotechnica et Informatica 15(2), 15–20 (2015)
2.
go back to reference Vizer, L.M., Zhou, L., Sears, A.: Automated stress detection using keystroke and linguistic features: an exploratory study. Int. J. Hum. Comput. Stud. 67(10), 870–886 (2009)CrossRef Vizer, L.M., Zhou, L., Sears, A.: Automated stress detection using keystroke and linguistic features: an exploratory study. Int. J. Hum. Comput. Stud. 67(10), 870–886 (2009)CrossRef
3.
go back to reference Kurniawan, H., Maslov, A.V., Pechenizkiy, M.: Stress detection from speech and galvanic skin response signals. In: Computer-Based Medical Systems, pp. 209–214 (2013) Kurniawan, H., Maslov, A.V., Pechenizkiy, M.: Stress detection from speech and galvanic skin response signals. In: Computer-Based Medical Systems, pp. 209–214 (2013)
4.
go back to reference Zhang, C., Hansen, J.H.L.: Analysis and classification of speech mode: whispered throughshouted. In: Interspeech 2007, Antwerp, Belgium, pp. 2289–2292 (2007) Zhang, C., Hansen, J.H.L.: Analysis and classification of speech mode: whispered throughshouted. In: Interspeech 2007, Antwerp, Belgium, pp. 2289–2292 (2007)
5.
go back to reference Ruzanski, E., Hansen, J.H., et al.: Effects of phoneme characteristics on TEO feature-based automatic stress detection in speech. In: ICASSP (1), pp. 357–360 (2005) Ruzanski, E., Hansen, J.H., et al.: Effects of phoneme characteristics on TEO feature-based automatic stress detection in speech. In: ICASSP (1), pp. 357–360 (2005)
6.
go back to reference Womack, B.D., Hansen, J.H.: Classification of speech under stress using target driven features. Speech Commun. 20(1), 131–150 (1996)CrossRef Womack, B.D., Hansen, J.H.: Classification of speech under stress using target driven features. Speech Commun. 20(1), 131–150 (1996)CrossRef
7.
go back to reference McEwen, B.S., Wingfield, J.C.: The concept of allostasis in biology and biomedicine. Horm. Behav. 43(1), 2–15 (2003)CrossRef McEwen, B.S., Wingfield, J.C.: The concept of allostasis in biology and biomedicine. Horm. Behav. 43(1), 2–15 (2003)CrossRef
8.
go back to reference Chrousos, G.P.: Stressors, stress, and neuroendocrine integration of the adaptive response: the 1997 Hans Selye Memorial Lecture. Ann. N. Y. Acad. Sci. 851(1), 311–335 (1998)CrossRef Chrousos, G.P.: Stressors, stress, and neuroendocrine integration of the adaptive response: the 1997 Hans Selye Memorial Lecture. Ann. N. Y. Acad. Sci. 851(1), 311–335 (1998)CrossRef
9.
go back to reference Lazarus, R.S.: From psychological stress to the emotions: a history of changing outlooks. Pers. Crit. Concepts Psychol. 4, 179 (1998) Lazarus, R.S.: From psychological stress to the emotions: a history of changing outlooks. Pers. Crit. Concepts Psychol. 4, 179 (1998)
10.
go back to reference Cannon, W.: The wisdom of the body. Physiol. Rev. 9, 399–431 (1929) Cannon, W.: The wisdom of the body. Physiol. Rev. 9, 399–431 (1929)
11.
go back to reference Russell, J.A.: A circumplex model of affect. J. Pers. Soc. Psychol. 39(6), 1161–1178 (1980)CrossRef Russell, J.A.: A circumplex model of affect. J. Pers. Soc. Psychol. 39(6), 1161–1178 (1980)CrossRef
12.
go back to reference Dougall, A.L., Baum, A.: Stress, coping and immune function. In: Weiner, I.B., et al. (eds.) Handbook of Psychology, vol. 3, pp. 441–456. Wiley, New York (2003, 2009) Dougall, A.L., Baum, A.: Stress, coping and immune function. In: Weiner, I.B., et al. (eds.) Handbook of Psychology, vol. 3, pp. 441–456. Wiley, New York (2003, 2009)
13.
go back to reference Thayer, R.E.: The Activation-Deactivation Adjective Check List (AD ACL). APPENDIX I, The Biopsychology of Mood and Arousal. Oxford University Press, New York (1989) Thayer, R.E.: The Activation-Deactivation Adjective Check List (AD ACL). APPENDIX I, The Biopsychology of Mood and Arousal. Oxford University Press, New York (1989)
14.
go back to reference Hansen, J.H., Patil, S.: Speech under stress: analysis, modeling and recognition. In: Müller, C. (ed.) Speaker Classification 2007. LNCS (LNAI), vol. 4343, pp. 108–137. Springer, Heidelberg (2007)CrossRef Hansen, J.H., Patil, S.: Speech under stress: analysis, modeling and recognition. In: Müller, C. (ed.) Speaker Classification 2007. LNCS (LNAI), vol. 4343, pp. 108–137. Springer, Heidelberg (2007)CrossRef
15.
go back to reference Šimko, J., Beňuš, Š., Vainio, M.: Hyperarticulation in Lombard speech: global coordination of the jaw, lips and the tongue. J. Acoust. Soc. Am. 139(1), 151–162 (2016)CrossRef Šimko, J., Beňuš, Š., Vainio, M.: Hyperarticulation in Lombard speech: global coordination of the jaw, lips and the tongue. J. Acoust. Soc. Am. 139(1), 151–162 (2016)CrossRef
16.
go back to reference Rusko, M., Darjaa, S., Trnka, M., Ritomský, M., Sabo, R.: Alert!… Calm Down, There is Nothing to Worry About. Warning and Soothing Speech Synthesis. In: LREC, pp. 1182–1187 (2014) Rusko, M., Darjaa, S., Trnka, M., Ritomský, M., Sabo, R.: Alert!… Calm Down, There is Nothing to Worry About. Warning and Soothing Speech Synthesis. In: LREC, pp. 1182–1187 (2014)
17.
go back to reference Scherer, K.R.: Vocal communication of emotion: a review of research paradigms. Speech Commun. 40, 227–256 (2003)CrossRefMATH Scherer, K.R.: Vocal communication of emotion: a review of research paradigms. Speech Commun. 40, 227–256 (2003)CrossRefMATH
19.
go back to reference Rusko, M., Trnka, M., Darjaa, S., Hamar, J.: The dramatic piece reader for the blind and visually impaired. In Proceedings of SLPAT 2013, pp. 83–91 (2013) Rusko, M., Trnka, M., Darjaa, S., Hamar, J.: The dramatic piece reader for the blind and visually impaired. In Proceedings of SLPAT 2013, pp. 83–91 (2013)
20.
go back to reference Gajšek, R., et al.: Gender and affect recognition based on GMM and GMM-UBM modeling with relevance MAP estimation. In: Proceedings of the Interspeech, pp. 2810–2813 (2010) Gajšek, R., et al.: Gender and affect recognition based on GMM and GMM-UBM modeling with relevance MAP estimation. In: Proceedings of the Interspeech, pp. 2810–2813 (2010)
21.
go back to reference Panayotov, V., Chen, G., Povey, D., Khudanpur, S.: Librispeech: an ASR corpus based on public domain audio books. In: Acoustics, ICASSP 2015, pp. 5206–5210 (2015) Panayotov, V., Chen, G., Povey, D., Khudanpur, S.: Librispeech: an ASR corpus based on public domain audio books. In: Acoustics, ICASSP 2015, pp. 5206–5210 (2015)
Metadata
Title
Stress, Arousal, and Stress Detector Trained on Acted Speech Database
Authors
Róbert Sabo
Milan Rusko
Andrej Ridzik
Jakub Rajčáni
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-43958-7_82

Premium Partner