Skip to main content
Top

2014 | OriginalPaper | Chapter

Physiological and Cognitive Status Monitoring on the Base of Acoustic-Phonetic Speech Parameters

Authors : Gábor Kiss, Klára Vicsi

Published in: Statistical Language and Speech Processing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper the development of an online monitoring system is shown in order to track physiological and cognitive condition of crew members of the Concordia Research Station in Antarctica, with specific regard to depression. Follow-up studies were carried out on recorded speech material in such a way that segmental and supra-segmental speech parameters were measured for individual researchers weakly, and the changes of these parameters were detected over time. Two kind of speech were recorded weekly by crew members in their mother tongue: a diary and a tale (“North Wind and The Sun”). An automatic language independent program was used to segment the records in phoneme level for the measurements. Such a way Concordia Speech Databases were constructed. Those acoustic-phonetic parameters were selected for the follow up study at Concordia, which parameters were statistically selected during a research on the base of the analysis of Seasonal Affective Disorder Databases gathered separately in Europe.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Goberman, A.M.: Correlation between acoustic speech characteristics and non-speech motor performance in Parkinson disease. Med. Sci. Monit. 11(3), 109–116 (2005) Goberman, A.M.: Correlation between acoustic speech characteristics and non-speech motor performance in Parkinson disease. Med. Sci. Monit. 11(3), 109–116 (2005)
2.
go back to reference Goberman, A.M., McMillan, J.: Relative speech timing in Parkinson’s disease. Commun. Sci. Disord. 32, 22–29 (2005) Goberman, A.M., McMillan, J.: Relative speech timing in Parkinson’s disease. Commun. Sci. Disord. 32, 22–29 (2005)
3.
go back to reference Metter, E.J., Hanson, W.R.: Clinical and acoustical variability in hypokinetic dysarthria. J. Commun. Disord. 19(5), 347–366 (1986)CrossRef Metter, E.J., Hanson, W.R.: Clinical and acoustical variability in hypokinetic dysarthria. J. Commun. Disord. 19(5), 347–366 (1986)CrossRef
4.
go back to reference Doyle, P., Raade, A., Pierre, A., Desai, S.: Fundamental frequency and acoustic variability associated with production of sustained vowels by speakers with hypokinetic dysarthria. J. Med. Speech. Lang. Pathol. 3, 41–50 (1995) Doyle, P., Raade, A., Pierre, A., Desai, S.: Fundamental frequency and acoustic variability associated with production of sustained vowels by speakers with hypokinetic dysarthria. J. Med. Speech. Lang. Pathol. 3, 41–50 (1995)
5.
go back to reference McNeil, M.R., Rosenbeck, J.C., Aronson, A.E.: The Dysarthrias: Physiology, Acoustics, Perception, Management. College Hill Press, San Diego (1984) McNeil, M.R., Rosenbeck, J.C., Aronson, A.E.: The Dysarthrias: Physiology, Acoustics, Perception, Management. College Hill Press, San Diego (1984)
6.
go back to reference Logemann, J.A., Fisher, H.B.: Vocal tract control in Parkinson’s disease: phonetic feature analysis of misarticulations. J. Speech Hear. Disord. 46(4), 348–352 (1981)CrossRef Logemann, J.A., Fisher, H.B.: Vocal tract control in Parkinson’s disease: phonetic feature analysis of misarticulations. J. Speech Hear. Disord. 46(4), 348–352 (1981)CrossRef
7.
go back to reference Ackermann, H., Konczak, J., Hertrich, I.: The temporal control of repetitive articulatory movements in Parkinson’s disease. Brain Lang. 56(2), 312–319 (1997)CrossRef Ackermann, H., Konczak, J., Hertrich, I.: The temporal control of repetitive articulatory movements in Parkinson’s disease. Brain Lang. 56(2), 312–319 (1997)CrossRef
8.
go back to reference Kent, R.D.: Acoustic Analysis of Speech, 2nd edn. Singular, San Diego (2001) Kent, R.D.: Acoustic Analysis of Speech, 2nd edn. Singular, San Diego (2001)
9.
go back to reference Lieberman, P., Morey, A., Hochstadt, J., Larson, M., Mather, S.: Mount Everest: a space analogue for speech, monitoring of cognitive deficits and stress. Aviat. Space Environ. Med. 76(6, Section II), 198–207 (2005) Lieberman, P., Morey, A., Hochstadt, J., Larson, M., Mather, S.: Mount Everest: a space analogue for speech, monitoring of cognitive deficits and stress. Aviat. Space Environ. Med. 76(6, Section II), 198–207 (2005)
10.
go back to reference Ivry, R.B., Justus, T.C., Middleton, C.: The cerebellum, timing, and language: implications for the study of dyslexia. In: Wolf, M. (ed.) Dyslexia Fluency and the Brain, pp. 198–211. York Press, Timonium (2001) Ivry, R.B., Justus, T.C., Middleton, C.: The cerebellum, timing, and language: implications for the study of dyslexia. In: Wolf, M. (ed.) Dyslexia Fluency and the Brain, pp. 198–211. York Press, Timonium (2001)
11.
go back to reference Esposito, A., Bourbakis, N.: The role of timing in speech perception and speech production processes and its effects on language impaired individuals. In: Sixth Symposium on BioInformatics and BioEngineering (BIBE’06), pp. 348–356. IEEE Computer Society (2006) Esposito, A., Bourbakis, N.: The role of timing in speech perception and speech production processes and its effects on language impaired individuals. In: Sixth Symposium on BioInformatics and BioEngineering (BIBE’06), pp. 348–356. IEEE Computer Society (2006)
12.
go back to reference Vicsi, K., Sztahó, D.: Problems of the automatic emotion recognitions in spontaneous speech; an example for the recognition in a dispatcher center. In: Esposito, A., Esposito, A.M., Martone, R., Müller, V.C., Scarpetta, G. (eds.) COST 2010. LNCS, vol. 6456, pp. 331–339. Springer, Heidelberg (2011)CrossRef Vicsi, K., Sztahó, D.: Problems of the automatic emotion recognitions in spontaneous speech; an example for the recognition in a dispatcher center. In: Esposito, A., Esposito, A.M., Martone, R., Müller, V.C., Scarpetta, G. (eds.) COST 2010. LNCS, vol. 6456, pp. 331–339. Springer, Heidelberg (2011)CrossRef
13.
go back to reference Tóth, S.L., Sztahó, D., Vicsi, K.: Speech emotion perception by human and machine. In: Esposito, A., Bourbakis, N.G., Avouris, N., Hatzilygeroudis, I. (eds.) HH and HM Interaction. LNCS (LNAI), vol. 5042, pp. 213–224. Springer, Heidelberg (2008)CrossRef Tóth, S.L., Sztahó, D., Vicsi, K.: Speech emotion perception by human and machine. In: Esposito, A., Bourbakis, N.G., Avouris, N., Hatzilygeroudis, I. (eds.) HH and HM Interaction. LNCS (LNAI), vol. 5042, pp. 213–224. Springer, Heidelberg (2008)CrossRef
14.
go back to reference France, D.J., Shiavi, R.G., Silverman, S., Silverman, M., Wilkes, D.M.: Acoustical properties of speech as indicators of depression and suicidal risk. IEEE Trans. Biomed. Eng. 47, 829–837 (2000)CrossRef France, D.J., Shiavi, R.G., Silverman, S., Silverman, M., Wilkes, D.M.: Acoustical properties of speech as indicators of depression and suicidal risk. IEEE Trans. Biomed. Eng. 47, 829–837 (2000)CrossRef
15.
go back to reference Cannizzaro, M., Harel, B., Reilly, N., Chappell, P., Snyder, P.J.: Voice acoustical measurement of the severity of major depression. Brain Cogn. 56, 30–35 (2004)CrossRef Cannizzaro, M., Harel, B., Reilly, N., Chappell, P., Snyder, P.J.: Voice acoustical measurement of the severity of major depression. Brain Cogn. 56, 30–35 (2004)CrossRef
16.
go back to reference Cannizzaro, M., Reilly, N., Mundt, J.C., Snyder, P.J.: Remote capture of human voice acoustical data by telephone: a methods study. Clin. Linguist. Phon. 19, 649–658 (2005)CrossRef Cannizzaro, M., Reilly, N., Mundt, J.C., Snyder, P.J.: Remote capture of human voice acoustical data by telephone: a methods study. Clin. Linguist. Phon. 19, 649–658 (2005)CrossRef
17.
go back to reference Garcia-toro, M., Talavera, J.A., Saiz-Ruiz, J., Gonzalez, A.: Prosody impairment in depression measured through acoustic analysis. J. Nerv. Ment. Dis. 188, 824–829 (2000)CrossRef Garcia-toro, M., Talavera, J.A., Saiz-Ruiz, J., Gonzalez, A.: Prosody impairment in depression measured through acoustic analysis. J. Nerv. Ment. Dis. 188, 824–829 (2000)CrossRef
18.
go back to reference Abela, J.R.Z., D’Allesandro, D.U.: Beck’s cognitive theory of depression: the diathesis-stress and causal mediation components. Br. J. Clin. Psychol. 41, 111–128 (2002)CrossRef Abela, J.R.Z., D’Allesandro, D.U.: Beck’s cognitive theory of depression: the diathesis-stress and causal mediation components. Br. J. Clin. Psychol. 41, 111–128 (2002)CrossRef
19.
go back to reference Kiss G., Sztahó D., Vicsi K.: Language independent automatic speech segmentation into phoneme-like units on the base of acoustic distinctive features. In: 4th IEEE International Conference on Cognitive Infococommunications - CogInfoCom 2013, Budapest, Hungary, 2–6 Dec 2013, pp. 579-582. IEEE Press, Piscataway (2013). ISBN: 978-1-4799-1-1543-9, IEEE Catalog Number: CFP1326R-PRT Kiss G., Sztahó D., Vicsi K.: Language independent automatic speech segmentation into phoneme-like units on the base of acoustic distinctive features. In: 4th IEEE International Conference on Cognitive Infococommunications - CogInfoCom 2013, Budapest, Hungary, 2–6 Dec 2013, pp. 579-582. IEEE Press, Piscataway (2013). ISBN: 978-1-4799-1-1543-9, IEEE Catalog Number: CFP1326R-PRT
20.
go back to reference Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Breakspear, M., Parker, G.: Detecting depression – a comparison between spontaneous and read speech. In: 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2013) Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Breakspear, M., Parker, G.: Detecting depression – a comparison between spontaneous and read speech. In: 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2013)
21.
go back to reference Helfer, B.S., Quatieri, T.F., Williamson, J.R., Mehta, D.D., Horwitz, R., Yu, B.: Classification of depression state based on articulatory precision. In: 14th Annual Conference of the International Speech Communication Association (2013) Helfer, B.S., Quatieri, T.F., Williamson, J.R., Mehta, D.D., Horwitz, R., Yu, B.: Classification of depression state based on articulatory precision. In: 14th Annual Conference of the International Speech Communication Association (2013)
22.
go back to reference Mundt, J.C., Snyder, P.J., Cannizzaro, M.S., Chappie, K., Geralts, D.S.: Voice acoustic measures of depression severity and treatment response collected via interactive voice response (IVR) technology. J. Neurolinguist. 20, 50–64 (2007)CrossRef Mundt, J.C., Snyder, P.J., Cannizzaro, M.S., Chappie, K., Geralts, D.S.: Voice acoustic measures of depression severity and treatment response collected via interactive voice response (IVR) technology. J. Neurolinguist. 20, 50–64 (2007)CrossRef
Metadata
Title
Physiological and Cognitive Status Monitoring on the Base of Acoustic-Phonetic Speech Parameters
Authors
Gábor Kiss
Klára Vicsi
Copyright Year
2014
DOI
https://doi.org/10.1007/978-3-319-11397-5_9

Premium Partner