Skip to main content

2017 | OriginalPaper | Buchkapitel

Detecting Depression in Speech Under Different Speaking Styles and Emotional Valences

verfasst von : Zhenyu Liu, Bin Hu, Xiaoyu Li, Fei Liu, Gang Wang, Jing Yang

Erschienen in: Brain Informatics

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Detecting depression in speech is a hot topic in recent years. Some inconsistent results in previous researches imply a few important influence factors are ignored. In this paper, we investigated a sample of 184 subjects (108 females, 76 males) to examine the influence of speaking style and emotional valence on depression detection. First, classification accuracy was used to measure the influence of these two factors. Then, two-way analysis of variance was employed to determine interactive acoustical features. Finally, normalized features by subtracting got higher classification accuracies. Results show that both speaking style and emotional valence are important factors. Spontaneous speech is better than automatic speech and neutral is the best choice among three emotional valences in depression detection. Normalized features improve the detection performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Toh, W.L., Thomas, N., Rossell, S.L.: Auditory verbal hallucinations in bipolar disorder (BD) and major depressive disorder (MDD): a systematic review. J. Affect. Disord. 184, 18–28 (2015)CrossRef Toh, W.L., Thomas, N., Rossell, S.L.: Auditory verbal hallucinations in bipolar disorder (BD) and major depressive disorder (MDD): a systematic review. J. Affect. Disord. 184, 18–28 (2015)CrossRef
2.
Zurück zum Zitat Zhang, Y., Zhang, C., Yuan, G., Yao, J., Cheng, Z., Liu, C., et al.: Effect of tryptophan hydroxylase-2 rs7305115 SNP on suicide attempts risk in major depression. Behav. Brain Funct. 6, 1 (2010)CrossRef Zhang, Y., Zhang, C., Yuan, G., Yao, J., Cheng, Z., Liu, C., et al.: Effect of tryptophan hydroxylase-2 rs7305115 SNP on suicide attempts risk in major depression. Behav. Brain Funct. 6, 1 (2010)CrossRef
3.
Zurück zum Zitat Angeleri, F., Angeleri, V.A., Foschi, N., Giaquinto, S., Nolfe, G.: The influence of depression, social activity, and family stress on functional outcome after stroke. Stroke 24, 1478–1483 (1993)CrossRef Angeleri, F., Angeleri, V.A., Foschi, N., Giaquinto, S., Nolfe, G.: The influence of depression, social activity, and family stress on functional outcome after stroke. Stroke 24, 1478–1483 (1993)CrossRef
5.
Zurück zum Zitat Zumg, W., Richards, C., Short, M.: Self-rating depression scale in an outpatient clinic: further validation of the SDS. Arch. Gen. Psychiatry 13, 508–515 (1965)CrossRef Zumg, W., Richards, C., Short, M.: Self-rating depression scale in an outpatient clinic: further validation of the SDS. Arch. Gen. Psychiatry 13, 508–515 (1965)CrossRef
6.
Zurück zum Zitat Cohn, J.F., Kruez, T.S., Matthews, I., Yang, Y., Nguyen, M.H., Padilla, M.T., et al.: Detecting depression from facial actions and vocal prosody. In: 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops. ACII 2009, pp. 1–7 (2009) Cohn, J.F., Kruez, T.S., Matthews, I., Yang, Y., Nguyen, M.H., Padilla, M.T., et al.: Detecting depression from facial actions and vocal prosody. In: 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops. ACII 2009, pp. 1–7 (2009)
7.
Zurück zum Zitat Cummins, N., Epps, J., Breakspear, M., Goecke, R.: An investigation of depressed speech detection: features and normalization. In: Interspeech, pp. 2997–3000 (2011) Cummins, N., Epps, J., Breakspear, M., Goecke, R.: An investigation of depressed speech detection: features and normalization. In: Interspeech, pp. 2997–3000 (2011)
8.
Zurück zum Zitat Mundt, J.C., Snyder, P.J., Cannizzaro, M.S., Chappie, K., Geralts, D.S.: Voice acoustic measures of depression severity and treatment response collected via interactive voice response (IVR) technology. J. Neurolinguist. 20, 50–64 (2007)CrossRef Mundt, J.C., Snyder, P.J., Cannizzaro, M.S., Chappie, K., Geralts, D.S.: Voice acoustic measures of depression severity and treatment response collected via interactive voice response (IVR) technology. J. Neurolinguist. 20, 50–64 (2007)CrossRef
9.
Zurück zum Zitat Scherer, S., Stratou, G., Morency, L.-P.: Audiovisual behavior descriptors for depression assessment. In: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, pp. 135–140 (2013) Scherer, S., Stratou, G., Morency, L.-P.: Audiovisual behavior descriptors for depression assessment. In: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, pp. 135–140 (2013)
10.
Zurück zum Zitat Kupfer, D., Foster, F.G.: Interval between onset of sleep and rapid-eye-movement sleep as an indicator of depression. Lancet 300, 684–686 (1972)CrossRef Kupfer, D., Foster, F.G.: Interval between onset of sleep and rapid-eye-movement sleep as an indicator of depression. Lancet 300, 684–686 (1972)CrossRef
11.
Zurück zum Zitat Davidson, R.J., Pizzagalli, D., Nitschke, J.B., Putnam, K.: Depression: perspectives from affective neuroscience. Annu. Rev. Psychol. 53, 545–574 (2002)CrossRef Davidson, R.J., Pizzagalli, D., Nitschke, J.B., Putnam, K.: Depression: perspectives from affective neuroscience. Annu. Rev. Psychol. 53, 545–574 (2002)CrossRef
12.
Zurück zum Zitat Ozdas, A., Shiavi, R.G., Silverman, S.E., Silverman, M.K., Wilkes, D.M.: Investigation of vocal jitter and glottal flow spectrum as possible cues for depression and near-term suicidal risk. IEEE Trans. Biomed. Eng. 51, 1530–1540 (2004)CrossRef Ozdas, A., Shiavi, R.G., Silverman, S.E., Silverman, M.K., Wilkes, D.M.: Investigation of vocal jitter and glottal flow spectrum as possible cues for depression and near-term suicidal risk. IEEE Trans. Biomed. Eng. 51, 1530–1540 (2004)CrossRef
13.
Zurück zum Zitat Mundt, J.C., Vogel, A.P., Feltner, D.E., Lenderking, W.R.: Vocal acoustic biomarkers of depression severity and treatment response. Biol. Psychiatry 72, 580–587 (2012)CrossRef Mundt, J.C., Vogel, A.P., Feltner, D.E., Lenderking, W.R.: Vocal acoustic biomarkers of depression severity and treatment response. Biol. Psychiatry 72, 580–587 (2012)CrossRef
14.
Zurück zum Zitat Nilsonne, A., Sundberg, J., Ternstrom, S., Askenfelt, A.: Measuring the rate of change of voice fundamental frequency in fluent speech during mental depression. J. Acoust. Soc. Am. 83, 716–728 (1988)CrossRef Nilsonne, A., Sundberg, J., Ternstrom, S., Askenfelt, A.: Measuring the rate of change of voice fundamental frequency in fluent speech during mental depression. J. Acoust. Soc. Am. 83, 716–728 (1988)CrossRef
15.
Zurück zum Zitat Moore, E., Clements, M.A., Peifer, J.W., Weisser, L.: Critical analysis of the impact of glottal features in the classification of clinical depression in speech. IEEE Trans. Biomed. Eng. 55, 96–107 (2008)CrossRef Moore, E., Clements, M.A., Peifer, J.W., Weisser, L.: Critical analysis of the impact of glottal features in the classification of clinical depression in speech. IEEE Trans. Biomed. Eng. 55, 96–107 (2008)CrossRef
16.
Zurück zum Zitat Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Gedeon, T., Breakspear, M., et al.: A comparative study of different classifiers for detecting depression from spontaneous speech. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8022–8026 (2013) Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Gedeon, T., Breakspear, M., et al.: A comparative study of different classifiers for detecting depression from spontaneous speech. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8022–8026 (2013)
17.
Zurück zum Zitat Ooi, K.E.B., Lech, M., Allen, N.B.: Multichannel weighted speech classification system for prediction of major depression in adolescents. IEEE Trans. Biomed. Eng. 60, 497–506 (2013)CrossRef Ooi, K.E.B., Lech, M., Allen, N.B.: Multichannel weighted speech classification system for prediction of major depression in adolescents. IEEE Trans. Biomed. Eng. 60, 497–506 (2013)CrossRef
18.
Zurück zum Zitat Cummins, N., Epps, J., Ambikairajah, E.: Spectro-temporal analysis of speech affected by depression and psychomotor retardation. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 7542–7546 (2013) Cummins, N., Epps, J., Ambikairajah, E.: Spectro-temporal analysis of speech affected by depression and psychomotor retardation. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 7542–7546 (2013)
19.
Zurück zum Zitat Kinnunen, T., Lee, K.-A., Li, H.: Dimension reduction of the modulation spectrogram for speaker verification. In: Odyssey, p. 30 (2008) Kinnunen, T., Lee, K.-A., Li, H.: Dimension reduction of the modulation spectrogram for speaker verification. In: Odyssey, p. 30 (2008)
20.
Zurück zum Zitat Breznitz, Z., Share, D.L.: Effects of accelerated reading rate on memory for text. J. Educ. Psychol. 84, 193 (1992)CrossRef Breznitz, Z., Share, D.L.: Effects of accelerated reading rate on memory for text. J. Educ. Psychol. 84, 193 (1992)CrossRef
21.
Zurück zum Zitat Alpert, M., Pouget, E.R., Silva, R.R.: Reflections of depression in acoustic measures of the patient’s speech. J. Affect. Disord. 66, 59–69 (2001)CrossRef Alpert, M., Pouget, E.R., Silva, R.R.: Reflections of depression in acoustic measures of the patient’s speech. J. Affect. Disord. 66, 59–69 (2001)CrossRef
22.
Zurück zum Zitat Calev, A., Nigal, D., Chazan, S.: Retrieval from semantic memory using meaningful and meaningless constructs by depressed, stable bipolar and manic patients. Br. J. Clin. Psychol. 28, 67–73 (1989)CrossRef Calev, A., Nigal, D., Chazan, S.: Retrieval from semantic memory using meaningful and meaningless constructs by depressed, stable bipolar and manic patients. Br. J. Clin. Psychol. 28, 67–73 (1989)CrossRef
23.
Zurück zum Zitat Vanger, P., Summerfield, A.B., Rosen, B., Watson, J.: Effects of communication content on speech behavior of depressives. Compr. Psychiatry 33, 39–41 (1992)CrossRef Vanger, P., Summerfield, A.B., Rosen, B., Watson, J.: Effects of communication content on speech behavior of depressives. Compr. Psychiatry 33, 39–41 (1992)CrossRef
24.
Zurück zum Zitat Shankayi, R., Vali, M., Salimi, M., Malekshahi, M.: Identifying depressed from healthy cases using speech processing. In: 19th Iranian Conference of Biomedical Engineering (ICBME), pp. 191–194 (2012) Shankayi, R., Vali, M., Salimi, M., Malekshahi, M.: Identifying depressed from healthy cases using speech processing. In: 19th Iranian Conference of Biomedical Engineering (ICBME), pp. 191–194 (2012)
25.
Zurück zum Zitat Kroenke, K., Spitzer, R.L., Williams, J.B.: The PHQ-9: validity of a brief depression severity measure. J. Gen. Intern. Med. 16, 606–613 (2001)CrossRef Kroenke, K., Spitzer, R.L., Williams, J.B.: The PHQ-9: validity of a brief depression severity measure. J. Gen. Intern. Med. 16, 606–613 (2001)CrossRef
26.
Zurück zum Zitat Hamilton, M.: A rating scale for depression. J. Neurol. Neurosurg. Psychiatry 23, 56–62 (1960)CrossRef Hamilton, M.: A rating scale for depression. J. Neurol. Neurosurg. Psychiatry 23, 56–62 (1960)CrossRef
29.
Zurück zum Zitat Gong, X., Huang, Y., Wang, Y., Luo, Y.: Revision of the Chinese facial affective picture system. Chin. Ment. Health J. 25, 40–46 (2011) Gong, X., Huang, Y., Wang, Y., Luo, Y.: Revision of the Chinese facial affective picture system. Chin. Ment. Health J. 25, 40–46 (2011)
30.
Zurück zum Zitat Martinot, M.-L.P., Bragulat, V., Artiges, E., Dollé, F., Hinnen, F., Jouvent, R., et al.: Decreased presynaptic dopamine function in the left caudate of depressed patients with affective flattening and psychomotor retardation. Am. J. Psychiatry 158, 314–316 (2001)CrossRef Martinot, M.-L.P., Bragulat, V., Artiges, E., Dollé, F., Hinnen, F., Jouvent, R., et al.: Decreased presynaptic dopamine function in the left caudate of depressed patients with affective flattening and psychomotor retardation. Am. J. Psychiatry 158, 314–316 (2001)CrossRef
31.
Zurück zum Zitat Clark, L., Chamberlain, S.R., Sahakian, B.J.: Neurocognitive mechanisms in depression: implications for treatment. Annu. Rev. Neurosci. 32, 57–74 (2009)CrossRef Clark, L., Chamberlain, S.R., Sahakian, B.J.: Neurocognitive mechanisms in depression: implications for treatment. Annu. Rev. Neurosci. 32, 57–74 (2009)CrossRef
32.
Zurück zum Zitat Hönig, F., Batliner, A., Nöth, E., Schnieder, S., Krajewski, J.: Automatic modelling of depressed speech: relevant features and relevance of gender. In: Fifteenth Annual Conference of the International Speech Communication Association, pp. 1248–1252 (2014) Hönig, F., Batliner, A., Nöth, E., Schnieder, S., Krajewski, J.: Automatic modelling of depressed speech: relevant features and relevance of gender. In: Fifteenth Annual Conference of the International Speech Communication Association, pp. 1248–1252 (2014)
33.
Zurück zum Zitat Smolak, L., Munstertieger, B.F.: The relationship of gender and voice to depression and eating disorders. Psychol. Women Q. 26, 234–241 (2002)CrossRef Smolak, L., Munstertieger, B.F.: The relationship of gender and voice to depression and eating disorders. Psychol. Women Q. 26, 234–241 (2002)CrossRef
34.
Zurück zum Zitat Low, L.S.A., Maddage, N.C., Lech, M., Sheeber, L.B., Allen, N.B.: Detection of clinical depression in adolescents’ speech during family interactions. IEEE Trans. Biomed. Eng. 58, 574–586 (2011)CrossRef Low, L.S.A., Maddage, N.C., Lech, M., Sheeber, L.B., Allen, N.B.: Detection of clinical depression in adolescents’ speech during family interactions. IEEE Trans. Biomed. Eng. 58, 574–586 (2011)CrossRef
35.
Zurück zum Zitat Ellgring, H., Scherer, K.R.: Vocal indicators of mood change in depression. J. Nonverbal Behav. 20, 83–110 (1996)CrossRef Ellgring, H., Scherer, K.R.: Vocal indicators of mood change in depression. J. Nonverbal Behav. 20, 83–110 (1996)CrossRef
Metadaten
Titel
Detecting Depression in Speech Under Different Speaking Styles and Emotional Valences
verfasst von
Zhenyu Liu
Bin Hu
Xiaoyu Li
Fei Liu
Gang Wang
Jing Yang
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-70772-3_25