Skip to main content
Erschienen in: Cognitive Computation 4/2013

01.12.2013

Improving Automatic Detection of Obstructive Sleep Apnea Through Nonlinear Analysis of Sustained Speech

verfasst von: José Luis Blanco, Luis A. Hernández, Rubén Fernández, Daniel Ramos

Erschienen in: Cognitive Computation | Ausgabe 4/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We present a novel approach for the detection of severe obstructive sleep apnea (OSA) based on patients’ voices introducing nonlinear measures to describe sustained speech dynamics. Nonlinear features were combined with state-of-the-art speech recognition systems using statistical modeling techniques (Gaussian mixture models, GMMs) over cepstral parameterization (MFCC) for both continuous and sustained speech. Tests were performed on a database including speech records from both severe OSA and control speakers. A 10 % relative reduction in classification error was obtained for sustained speech when combining MFCC-GMM and nonlinear features, and 33 % when fusing nonlinear features with both sustained and continuous MFCC-GMM. Accuracy reached 88.5 % allowing the system to be used in OSA early detection. Tests showed that nonlinear features and MFCCs are lightly correlated on sustained speech, but uncorrelated on continuous speech. Results also suggest the existence of nonlinear effects in OSA patients’ voices, which should be found in continuous speech.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Faundez-Zanuy M, McLaughlin S, Esposito A, Hussain A, Schoentgen J, Kubin G, Kleijn WB, Maragos P. Nonlinear speech processing: overview and applications. Control Intell Syst. 2002;30:1–10. Faundez-Zanuy M, McLaughlin S, Esposito A, Hussain A, Schoentgen J, Kubin G, Kleijn WB, Maragos P. Nonlinear speech processing: overview and applications. Control Intell Syst. 2002;30:1–10.
2.
Zurück zum Zitat Kubin G. Nonlinear processing of speech. In: Kleijn WB, Paliwal KK, editors. Speech coding and synthesis. Amsterdam: Elsevier Science; 1995. Kubin G. Nonlinear processing of speech. In: Kleijn WB, Paliwal KK, editors. Speech coding and synthesis. Amsterdam: Elsevier Science; 1995.
3.
Zurück zum Zitat Little MA, Costello DAE, Harries ML. Objective dysphonia quantification in vocal fold paralysis: comparing nonlinear with classical measures. J Voice. 2009;25(1):21–31.PubMedCrossRef Little MA, Costello DAE, Harries ML. Objective dysphonia quantification in vocal fold paralysis: comparing nonlinear with classical measures. J Voice. 2009;25(1):21–31.PubMedCrossRef
4.
Zurück zum Zitat Tsanas A, Little MA, McSharry PE, Ramig LO. Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson’s disease symptom severity. J R Soc Interface. 2010;8:842–55.PubMedCrossRef Tsanas A, Little MA, McSharry PE, Ramig LO. Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson’s disease symptom severity. J R Soc Interface. 2010;8:842–55.PubMedCrossRef
5.
Zurück zum Zitat Gómez-Vilda P, Rodellar-Biarge MV, Nieto-Lluis V, Muñoz-Mulas C, Mazaira-Fernández LM, Ramírez-Calvo C, Fernández-Fernández M, Toribio-Díaz E. Neurological disease detection and monitoring from voice production. Lecture notes in artificial intelligence. Volume 7015: nonlinear speech processing NOLISP 2011, Springer; 2011. Gómez-Vilda P, Rodellar-Biarge MV, Nieto-Lluis V, Muñoz-Mulas C, Mazaira-Fernández LM, Ramírez-Calvo C, Fernández-Fernández M, Toribio-Díaz E. Neurological disease detection and monitoring from voice production. Lecture notes in artificial intelligence. Volume 7015: nonlinear speech processing NOLISP 2011, Springer; 2011.
6.
Zurück zum Zitat Arias-Londoño JD, Godino-Llorente JI, Sáenz-Lechón N, Osma-Ruiz V, Castellanos-Domínguez G. Automatic detection of pathological voices using complexity measures, noise parameters, and mel-cepstral coefficients. IEEE Trans Biomed Eng 2011;58(2):370–9. Arias-Londoño JD, Godino-Llorente JI, Sáenz-Lechón N, Osma-Ruiz V, Castellanos-Domínguez G. Automatic detection of pathological voices using complexity measures, noise parameters, and mel-cepstral coefficients. IEEE Trans Biomed Eng 2011;58(2):370–9.
7.
Zurück zum Zitat KayPENTAX. Massachusetts Eye and Ear Infirmary (MEEI) Voice and Speech Lab. Disordered Voice Database and Program, Model 4337. Viewed September 2011; 2011. http://www.kaypentax.com. KayPENTAX. Massachusetts Eye and Ear Infirmary (MEEI) Voice and Speech Lab. Disordered Voice Database and Program, Model 4337. Viewed September 2011; 2011. http://​www.​kaypentax.​com.
8.
Zurück zum Zitat Puertas FJ, Pin G, María JM, Durán J. Documento de consenso Nacional sobre el síndrome de Apneas-hipopneas del sueño. Grupo Español De Sueño; 2005. Puertas FJ, Pin G, María JM, Durán J. Documento de consenso Nacional sobre el síndrome de Apneas-hipopneas del sueño. Grupo Español De Sueño; 2005.
9.
Zurück zum Zitat Coccagna G, Pollini A, Provini F. Cardiovascular disorders and obstructive sleep apnea syndrome. Clin Exp Hypertens. 2006;28:217–24.PubMedCrossRef Coccagna G, Pollini A, Provini F. Cardiovascular disorders and obstructive sleep apnea syndrome. Clin Exp Hypertens. 2006;28:217–24.PubMedCrossRef
10.
Zurück zum Zitat Nieto FJ, Peppard PE, Young T, Finn L, Hla KM, Farré R. Sleep disordered breathing and cancer mortality: results from the Wisconsin Sleep Cohort Study. Am J Respir Crit Care Med. 2012;186(2):190–4. Nieto FJ, Peppard PE, Young T, Finn L, Hla KM, Farré R. Sleep disordered breathing and cancer mortality: results from the Wisconsin Sleep Cohort Study. Am J Respir Crit Care Med. 2012;186(2):190–4.
11.
Zurück zum Zitat Lloberes P, Levy G, Descals C, et al. Self-reported sleepiness while driving as a risk factor for traffic accidents in patients with obstructive sleep apnoea syndrome and in non-apnoeic snorers. Respir Med. 2000;94(10):971–6.PubMedCrossRef Lloberes P, Levy G, Descals C, et al. Self-reported sleepiness while driving as a risk factor for traffic accidents in patients with obstructive sleep apnoea syndrome and in non-apnoeic snorers. Respir Med. 2000;94(10):971–6.PubMedCrossRef
12.
Zurück zum Zitat Penzel T, McNames J, de Chazal P, Raymond B, Murray A, Moody G. Systematic comparison of different algorithms for apnoea detection based on electrocardiogram recordings. Med Biol Eng Comput. 2002;40(4):402–7.PubMedCrossRef Penzel T, McNames J, de Chazal P, Raymond B, Murray A, Moody G. Systematic comparison of different algorithms for apnoea detection based on electrocardiogram recordings. Med Biol Eng Comput. 2002;40(4):402–7.PubMedCrossRef
13.
Zurück zum Zitat Calisti M, Bocchi L, Manfredi C, Romagnoli I, Gigliotti F, Donzelli G. Automatic detection of snore episodes from full night sound recordings: home and clinical application. In: Proceedings of the 3rd advanced voice function assessment international workshop. 2009. Calisti M, Bocchi L, Manfredi C, Romagnoli I, Gigliotti F, Donzelli G. Automatic detection of snore episodes from full night sound recordings: home and clinical application. In: Proceedings of the 3rd advanced voice function assessment international workshop. 2009.
14.
Zurück zum Zitat Alcázar JD, Fernández R, Blanco JL, Hernández L, López L, Linde F, Torre-Toledano D. Automatic speaker recognition techniques: a new tool for sleep apnoea diagnosis. Am J Respir Crit Care Med. 2009;179:A2131. Alcázar JD, Fernández R, Blanco JL, Hernández L, López L, Linde F, Torre-Toledano D. Automatic speaker recognition techniques: a new tool for sleep apnoea diagnosis. Am J Respir Crit Care Med. 2009;179:A2131.
15.
Zurück zum Zitat Fernández-Pozo R, Blanco-Murillo JL, Hernández-Gómez L, López-Gonzalo E, Alcázar-Ramírez J, Torre-Toledano D. Assessment of severe apnoea through voice analysis, automatic speech, and speaker recognition techniques. EURASIP J Adv Signal Process. 2009;2009(982531). doi:10.1155/2009/982531. Fernández-Pozo R, Blanco-Murillo JL, Hernández-Gómez L, López-Gonzalo E, Alcázar-Ramírez J, Torre-Toledano D. Assessment of severe apnoea through voice analysis, automatic speech, and speaker recognition techniques. EURASIP J Adv Signal Process. 2009;2009(982531). doi:10.​1155/​2009/​982531.
16.
Zurück zum Zitat Blanco JL, Fernández R, Díaz-Pardo D, Sigüenza A, Hernández L, Alcázar J. Analyzing GMMs to characterize resonance anomalies in speaker suffering from apnoea. In: Proceedings of the 10th annual conference of the international speech communication association. 2009. Blanco JL, Fernández R, Díaz-Pardo D, Sigüenza A, Hernández L, Alcázar J. Analyzing GMMs to characterize resonance anomalies in speaker suffering from apnoea. In: Proceedings of the 10th annual conference of the international speech communication association. 2009.
17.
Zurück zum Zitat Blanco JL, Fernández R, Torre D, Caminero FJ, López E. Analyzing training dependencies and posterior fusion in discriminative classification of apnea patients based on sustained and connected speech. In: Proceedings of the 12th annual conference of the international speech communication association. 2011. Blanco JL, Fernández R, Torre D, Caminero FJ, López E. Analyzing training dependencies and posterior fusion in discriminative classification of apnea patients based on sustained and connected speech. In: Proceedings of the 12th annual conference of the international speech communication association. 2011.
18.
Zurück zum Zitat Goldshtein E, Tarasiuk A, Zigel Y. Automatic detection of obstructive sleep apnea using speech signals. IEEE Trans Biomed Eng. 2011;58(5):1373–82.PubMedCrossRef Goldshtein E, Tarasiuk A, Zigel Y. Automatic detection of obstructive sleep apnea using speech signals. IEEE Trans Biomed Eng. 2011;58(5):1373–82.PubMedCrossRef
19.
Zurück zum Zitat Ryan CM, Bradley TD. Pathogenesis of obstructive sleep apnoea. J Appl Physiol. 2005;99(6):2440–50.PubMedCrossRef Ryan CM, Bradley TD. Pathogenesis of obstructive sleep apnoea. J Appl Physiol. 2005;99(6):2440–50.PubMedCrossRef
20.
Zurück zum Zitat Davidson TM. The Great Leap Forward: the anatomic evolution of obstructive sleep apnoea. Sleep Med. 2003;4:185–94.PubMedCrossRef Davidson TM. The Great Leap Forward: the anatomic evolution of obstructive sleep apnoea. Sleep Med. 2003;4:185–94.PubMedCrossRef
21.
Zurück zum Zitat Fox AW, Monoson PK, Morgan CD. Speech dysfunction of obstructive sleep apnea. A discriminant analysis of its descriptors. Chest. 1996;96(3):589–95.CrossRef Fox AW, Monoson PK, Morgan CD. Speech dysfunction of obstructive sleep apnea. A discriminant analysis of its descriptors. Chest. 1996;96(3):589–95.CrossRef
22.
Zurück zum Zitat Kummer A. Cleft palate and craniofacial anomalies: effects on speech and resonance. Clifton Park: Thomson Delmar Learning; 2001. Kummer A. Cleft palate and craniofacial anomalies: effects on speech and resonance. Clifton Park: Thomson Delmar Learning; 2001.
23.
Zurück zum Zitat Robb MP, Yates J, Morgan EJ. Vocal tract resonance characteristics of adults with obstructive sleep apnea. Acta Otolaryngol. 1997;117(5):760–3.PubMedCrossRef Robb MP, Yates J, Morgan EJ. Vocal tract resonance characteristics of adults with obstructive sleep apnea. Acta Otolaryngol. 1997;117(5):760–3.PubMedCrossRef
24.
Zurück zum Zitat Fiz JA, Morera J, Abad J, et al. Acoustic analysis of vowel emission in obstructive sleep apnea. Chest. 1993;104(4):1093–6.PubMedCrossRef Fiz JA, Morera J, Abad J, et al. Acoustic analysis of vowel emission in obstructive sleep apnea. Chest. 1993;104(4):1093–6.PubMedCrossRef
25.
Zurück zum Zitat Fernandez R, Hernández LA, López E, Alcázar J, Portillo G, Toledano DT. Design of a multimodal database for research on automatic detection of severe apnoea cases. In: Proceedings of 6th language resources and evaluation conference. LREC, Marrakech; 2008. Fernandez R, Hernández LA, López E, Alcázar J, Portillo G, Toledano DT. Design of a multimodal database for research on automatic detection of severe apnoea cases. In: Proceedings of 6th language resources and evaluation conference. LREC, Marrakech; 2008.
26.
Zurück zum Zitat Linde de Luna F, Alcazar J, Vergara C, Blanco JL, Fernandez R, Hernandez LA, Lopez E. Combining voice classification scores with clinical data for improving sleep apnea syndrome diagnosis. Am J Respir Crit Care Med. 2012;185:A6427. Linde de Luna F, Alcazar J, Vergara C, Blanco JL, Fernandez R, Hernandez LA, Lopez E. Combining voice classification scores with clinical data for improving sleep apnea syndrome diagnosis. Am J Respir Crit Care Med. 2012;185:A6427.
27.
Zurück zum Zitat Huang X, Acero A, Hon WH. Spoken language processing. Englewood Cliffs: Prentice-Hall; 2001. Huang X, Acero A, Hon WH. Spoken language processing. Englewood Cliffs: Prentice-Hall; 2001.
28.
Zurück zum Zitat Reynolds DA, Quatieri TF, Dunn RB. Speaker verification using adapted gaussian mixture models. Digit Signal Process. 2000;10:19–41.CrossRef Reynolds DA, Quatieri TF, Dunn RB. Speaker verification using adapted gaussian mixture models. Digit Signal Process. 2000;10:19–41.CrossRef
29.
Zurück zum Zitat Godino-Llorente JI, Gomez-Vilda P, Blanco-Velasco M. Dimensionality reduction of a pathological voice quality assessment system based on gaussian mixture models and short-term cepstral parameters. IEEE Trans Biomed Eng. 2006;53(10):1943–53.PubMedCrossRef Godino-Llorente JI, Gomez-Vilda P, Blanco-Velasco M. Dimensionality reduction of a pathological voice quality assessment system based on gaussian mixture models and short-term cepstral parameters. IEEE Trans Biomed Eng. 2006;53(10):1943–53.PubMedCrossRef
30.
Zurück zum Zitat Blouet R, Mokbel C, Mokbel H, Sanchez-Soto E, Chollet G, Greige, H. BECARS: a Free Software for Speaker Verification. In: Proceedings of the speaker and language recognition workshop, ODYSSEY; 2004. p. 145–148. Blouet R, Mokbel C, Mokbel H, Sanchez-Soto E, Chollet G, Greige, H. BECARS: a Free Software for Speaker Verification. In: Proceedings of the speaker and language recognition workshop, ODYSSEY; 2004. p. 145–148.
31.
Zurück zum Zitat Young SJ, Evermann G, Gales MJF, Hain T, Kershaw D, Moore G, Odell J, Ollason D, Povey D, Valtchev V, Woodland PC. The HTK Book, version 3.4. Cambridge, UK: Cambridge University Press; 2006. Young SJ, Evermann G, Gales MJF, Hain T, Kershaw D, Moore G, Odell J, Ollason D, Povey D, Valtchev V, Woodland PC. The HTK Book, version 3.4. Cambridge, UK: Cambridge University Press; 2006.
32.
Zurück zum Zitat Moreno A, Poch D, Bonafonte A, Lleida E, Llisterri J, Mariño JB, Nadeu C. ALBAYZIN speech database: design of the phonetic corpus. In: Proceedings of Eurospeech 93, vol. 1. Berlin, Germany, 1993. p. 175–178. Moreno A, Poch D, Bonafonte A, Lleida E, Llisterri J, Mariño JB, Nadeu C. ALBAYZIN speech database: design of the phonetic corpus. In: Proceedings of Eurospeech 93, vol. 1. Berlin, Germany, 1993. p. 175–178.
33.
Zurück zum Zitat Childers DG. Speech processing and synthesis toolboxes. New York: Wiley; 2000. Childers DG. Speech processing and synthesis toolboxes. New York: Wiley; 2000.
Metadaten
Titel
Improving Automatic Detection of Obstructive Sleep Apnea Through Nonlinear Analysis of Sustained Speech
verfasst von
José Luis Blanco
Luis A. Hernández
Rubén Fernández
Daniel Ramos
Publikationsdatum
01.12.2013
Verlag
Springer US
Erschienen in
Cognitive Computation / Ausgabe 4/2013
Print ISSN: 1866-9956
Elektronische ISSN: 1866-9964
DOI
https://doi.org/10.1007/s12559-012-9168-x

Weitere Artikel der Ausgabe 4/2013

Cognitive Computation 4/2013 Zur Ausgabe