Skip to main content
Top

2020 | OriginalPaper | Chapter

Dysphonia Measurements Detection Using CQT’s and MFCC’s Methods

Authors : Mario Lopez-Rodríguez, Mireya Sarai García-Vázquez, Luis Miguel Zamudio-Fuentes, Alejandro Ramírez-Acosta

Published in: Future Trends in Biomedical and Health Informatics and Cybersecurity in Medical Devices

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Dysphonia is a vocal impediment that appears as a symptom of Parkinson’s disease, and can be used for its diagnosis. Among the important measurements for dysphonia detection are jitter, shimmer, fundamental frequency (F0), Harmonics to noise ratio (HNR) and noise to harmonics ratio (NHR). The frequency space of the speech signal is used to detect these five dysphonia measurements, through this space the acoustic markers jitter, shimmer and F0 are calculated. In this article, an evaluation of the detection of acoustic markers is presented through the mathematical methods of the Constant Q Transform (CQT) and the Mel Frequencies Cepstral Coefficients (MFCC) in speech signals of patients with Parkinson’s disease. The classifier method Support Vector Machine (SVM) is used to detect the Biomarkers. According to the results, the CQT method and MFCC method (57% and 62% precision respectively) which is a promising results for Parkinson’s disease diagnosis by the detection of Dysphonia measurements.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Tsanas, A., Little, M.A., McSharry, P.E., et al.: Novel speech signal processing algorithms for high-accuracy classification of Parkinson’s disease. IEEE Trans. Biomed. Eng. 59(5), 1264–1271 (2012)CrossRef Tsanas, A., Little, M.A., McSharry, P.E., et al.: Novel speech signal processing algorithms for high-accuracy classification of Parkinson’s disease. IEEE Trans. Biomed. Eng. 59(5), 1264–1271 (2012)CrossRef
2.
go back to reference Hornykiewicz, O.: Biochemical aspects of Parkinson’s disease. Neurology 51(2 Suppl 2), S2–S9 (1998)CrossRef Hornykiewicz, O.: Biochemical aspects of Parkinson’s disease. Neurology 51(2 Suppl 2), S2–S9 (1998)CrossRef
3.
go back to reference Frid, A., Kantor, A., Svechin, D., et al.: Diagnosis of Parkinson’s disease from continuous speech using deep convolutional networks without manual selection of features. In: IEEE International Conference on the Science of Electrical Engineering (ICSEE). IEEE (1998) Frid, A., Kantor, A., Svechin, D., et al.: Diagnosis of Parkinson’s disease from continuous speech using deep convolutional networks without manual selection of features. In: IEEE International Conference on the Science of Electrical Engineering (ICSEE). IEEE (1998)
4.
go back to reference Dorsey, E.R., George, B.P., Leff, B., et al.: The coming crisis Obtaining care for the growing burden of neurodegenerative conditions. Neurology 10-1212 (2013) Dorsey, E.R., George, B.P., Leff, B., et al.: The coming crisis Obtaining care for the growing burden of neurodegenerative conditions. Neurology 10-1212 (2013)
5.
go back to reference Frid, A., Safra, E.J., Hazan, H., Lokey, L.L., et al.: Computational diagnosis of Parkinson’s Disease directly from natural speech using machine learning techniques. In: 2014 IEEE International Conference on Software Science, Technology and Engineering (SWSTE), pp. 50–53. IEEE, June 2014 Frid, A., Safra, E.J., Hazan, H., Lokey, L.L., et al.: Computational diagnosis of Parkinson’s Disease directly from natural speech using machine learning techniques. In: 2014 IEEE International Conference on Software Science, Technology and Engineering (SWSTE), pp. 50–53. IEEE, June 2014
6.
go back to reference Little, M.A., McSharry, P.E., Hunter, E.J., et al.: Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. IEEE Trans. Biomed. Eng. 56(4), 1015–1022 (2009)CrossRef Little, M.A., McSharry, P.E., Hunter, E.J., et al.: Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. IEEE Trans. Biomed. Eng. 56(4), 1015–1022 (2009)CrossRef
7.
go back to reference Little, M., McSharry, P., Moroz, I., et al.: Nonlinear, biophysically-informed speech pathology detection. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2006 Proceedings, vol. 2, pp. II–II (2006) Little, M., McSharry, P., Moroz, I., et al.: Nonlinear, biophysically-informed speech pathology detection. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2006 Proceedings, vol. 2, pp. II–II (2006)
8.
go back to reference Tsanas, A., Little, M.A., Fox, C., et al.: Objective automatic assessment of rehabilitative speech treatment in parkinson’s disease. IEEE Trans. Neural Syst. Rehabil. Eng. 22(1), 181–190 (2014)CrossRef Tsanas, A., Little, M.A., Fox, C., et al.: Objective automatic assessment of rehabilitative speech treatment in parkinson’s disease. IEEE Trans. Neural Syst. Rehabil. Eng. 22(1), 181–190 (2014)CrossRef
9.
go back to reference Tsanas, A., Little, M.A., McSharry, P.E., et al.: Accurate telemonitoring of Parkinson’s disease progression by noninvasive speech tests. IEEE Trans. Biomed. Eng. 57(4), 884–893 (2010)CrossRef Tsanas, A., Little, M.A., McSharry, P.E., et al.: Accurate telemonitoring of Parkinson’s disease progression by noninvasive speech tests. IEEE Trans. Biomed. Eng. 57(4), 884–893 (2010)CrossRef
10.
go back to reference Tsanas, A., Little, M.A., McSharry, P.E., et al.: Enhanced classical dysphonia measures and sparse regression for telemonitoring of Parkinson’s disease progression. In: 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), pp. 594–597 (2010) Tsanas, A., Little, M.A., McSharry, P.E., et al.: Enhanced classical dysphonia measures and sparse regression for telemonitoring of Parkinson’s disease progression. In: 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), pp. 594–597 (2010)
11.
go back to reference Benba, A., Jilbab, A., Hammouch, A.: Voice analysis for detecting persons with Parkinson’s disease using MFCC and VQ. J. Theor. Appl. Inf. Technol. 70(3) (2014) Benba, A., Jilbab, A., Hammouch, A.: Voice analysis for detecting persons with Parkinson’s disease using MFCC and VQ. J. Theor. Appl. Inf. Technol. 70(3) (2014)
12.
go back to reference Benba, A., Jilbab, A., Hammouch, A.: Voice analysis for detecting patients with Parkinson’s disease using the hybridization of the best acoustic features. Int. J. Electr. Eng. Inf. 8(1), 108 (2016) Benba, A., Jilbab, A., Hammouch, A.: Voice analysis for detecting patients with Parkinson’s disease using the hybridization of the best acoustic features. Int. J. Electr. Eng. Inf. 8(1), 108 (2016)
13.
go back to reference Tsanas, A., Little, M.A., McSharry, P.E., et al.: Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson’s disease symptom severity. J. Roy. Soc. Interface 8(59), 842–855 (2011)CrossRef Tsanas, A., Little, M.A., McSharry, P.E., et al.: Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson’s disease symptom severity. J. Roy. Soc. Interface 8(59), 842–855 (2011)CrossRef
14.
go back to reference Rakotomamonjy, A., Gasso, G.: Histogram of gradients of time-frequency representations for audio scene classification. IEEE/ACM Trans. Audio Speech Lang. Process. (TASLP) 23(1), 142–153 (2015) Rakotomamonjy, A., Gasso, G.: Histogram of gradients of time-frequency representations for audio scene classification. IEEE/ACM Trans. Audio Speech Lang. Process. (TASLP) 23(1), 142–153 (2015)
15.
go back to reference Brown, J.C.: Calculation of a constant Q spectral transform. J. Acoust. Soc. Am. 89(1), 425–434 (1991)CrossRef Brown, J.C.: Calculation of a constant Q spectral transform. J. Acoust. Soc. Am. 89(1), 425–434 (1991)CrossRef
16.
go back to reference Orozco-Arroyave, J.R., Arias-Londoño, J.D., Vargas-Bonilla, J.F., et al.: New speech corpus database for the analysis of people suffering from Parkinson’s Disease. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC), Reykjavik, Iceland, pp. 342–347 (2014) Orozco-Arroyave, J.R., Arias-Londoño, J.D., Vargas-Bonilla, J.F., et al.: New speech corpus database for the analysis of people suffering from Parkinson’s Disease. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC), Reykjavik, Iceland, pp. 342–347 (2014)
18.
go back to reference Suárez, E.J.C.: Tutorial sobre máquinas de vectores soporte (sVM). Tutorial sobre Máquinas de Vectores Soporte (SVM) (2014) Suárez, E.J.C.: Tutorial sobre máquinas de vectores soporte (sVM). Tutorial sobre Máquinas de Vectores Soporte (SVM) (2014)
19.
go back to reference Nalini, N., Palanivel, S.: Music emotion recognition: the combined evidence of MFCC and residual phase. Egypt. Inf. J. 17(1), 1–10 (2016)CrossRef Nalini, N., Palanivel, S.: Music emotion recognition: the combined evidence of MFCC and residual phase. Egypt. Inf. J. 17(1), 1–10 (2016)CrossRef
Metadata
Title
Dysphonia Measurements Detection Using CQT’s and MFCC’s Methods
Authors
Mario Lopez-Rodríguez
Mireya Sarai García-Vázquez
Luis Miguel Zamudio-Fuentes
Alejandro Ramírez-Acosta
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-30636-6_48