Skip to main content
Top
Published in: Neural Computing and Applications 20/2020

04-04-2018 | S.I.: Advances in Bio-Inspired Intelligent Systems

Towards robust voice pathology detection

Investigation of supervised deep learning, gradient boosting, and anomaly detection approaches across four databases

Authors: Pavol Harar, Zoltan Galaz, Jesus B. Alonso-Hernandez, Jiri Mekyska, Radim Burget, Zdenek Smekal

Published in: Neural Computing and Applications | Issue 20/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Automatic objective non-invasive detection of pathological voice based on computerized analysis of acoustic signals can play an important role in early diagnosis, progression tracking, and even effective treatment of pathological voices. In search towards such a robust voice pathology detection system, we investigated three distinct classifiers within supervised learning and anomaly detection paradigms. We conducted a set of experiments using a variety of input data such as raw waveforms, spectrograms, mel-frequency cepstral coefficients (MFCC), and conventional acoustic (dysphonic) features (AF). In comparison with previously published works, this article is the first to utilize combination of four different databases comprising normophonic and pathological recordings of sustained phonation of the vowel /a/ unrestricted to a subset of vocal pathologies. Furthermore, to our best knowledge, this article is the first to explore gradient-boosted trees and deep learning for this application. The following best classification performances measured by F1 score on dedicated test set were achieved: XGBoost (0.733) using AF and MFCC, DenseNet (0.621) using MFCC, and Isolation Forest (0.610) using AF. Even though these results are of exploratory character, conducted experiments do show promising potential of gradient boosting and deep learning methods to robustly detect voice pathologies.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Ali Z, Alsulaiman M, Muhammad G, Elamvazuthi I, Al-nasheri A, Mesallam TA, Farahat M, Malki KH (2017) Intra-and inter-database study for arabic, english, and german databases: do conventional speech features detect voice pathology? J Voice 31(3):e381–e386CrossRef Ali Z, Alsulaiman M, Muhammad G, Elamvazuthi I, Al-nasheri A, Mesallam TA, Farahat M, Malki KH (2017) Intra-and inter-database study for arabic, english, and german databases: do conventional speech features detect voice pathology? J Voice 31(3):e381–e386CrossRef
2.
go back to reference Ali Z, Muhammad G, Alhamid MF (2017) An automatic health monitoring system for patients suffering from voice complications in smart cities. IEEE Access 5:3900–3908CrossRef Ali Z, Muhammad G, Alhamid MF (2017) An automatic health monitoring system for patients suffering from voice complications in smart cities. IEEE Access 5:3900–3908CrossRef
3.
go back to reference Al-nasheri A, Muhammad G, Alsulaiman M, Ali Z (2017) Investigation of voice pathology detection and classification on different frequency regions using correlation functions. J Voice 31(1):3–15CrossRef Al-nasheri A, Muhammad G, Alsulaiman M, Ali Z (2017) Investigation of voice pathology detection and classification on different frequency regions using correlation functions. J Voice 31(1):3–15CrossRef
5.
go back to reference Al-nasheri A, Muhammad G, Alsulaiman M, Ali Z, Mesallam TA, Farahat M, Malki KH, Bencherif MA (2017) An investigation of multidimensional voice program parameters in three different databases for voice pathology detection and classification. J Voice 31(1):113–e119CrossRef Al-nasheri A, Muhammad G, Alsulaiman M, Ali Z, Mesallam TA, Farahat M, Malki KH, Bencherif MA (2017) An investigation of multidimensional voice program parameters in three different databases for voice pathology detection and classification. J Voice 31(1):113–e119CrossRef
6.
go back to reference Al-nasheri A, Ali Z, Muhammad G, Alsulaiman M (2014) Voice pathology detection using auto-correlation of different filters bank. In: 2014 IEEE/ACS 11th international conference on computer systems and applications (AICCSA), pp 50–55. IEEE Al-nasheri A, Ali Z, Muhammad G, Alsulaiman M (2014) Voice pathology detection using auto-correlation of different filters bank. In: 2014 IEEE/ACS 11th international conference on computer systems and applications (AICCSA), pp 50–55. IEEE
7.
go back to reference Amami R, Smiti A (2017) An incremental method combining density clustering and support vector machines for voice pathology detection. Comput Electr Eng 57:257–265CrossRef Amami R, Smiti A (2017) An incremental method combining density clustering and support vector machines for voice pathology detection. Comput Electr Eng 57:257–265CrossRef
8.
go back to reference Arias-Londoño JD, Godino-Llorente JI, Markaki M, Stylianou Y (2011) On combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices. Logop Phoniatr Vocol 36(2):60–69CrossRef Arias-Londoño JD, Godino-Llorente JI, Markaki M, Stylianou Y (2011) On combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices. Logop Phoniatr Vocol 36(2):60–69CrossRef
9.
go back to reference Armstrong D, Gosling A, Weinman J, Marteau T (1997) The place of inter-rater reliability in qualitative research: an empirical study. Sociology 31(3):597–606CrossRef Armstrong D, Gosling A, Weinman J, Marteau T (1997) The place of inter-rater reliability in qualitative research: an empirical study. Sociology 31(3):597–606CrossRef
10.
go back to reference Brabenec L, Mekyska J, Galaz Z, Rektorova I (2017) Speech disorders in parkinsons disease: early diagnostics and effects of medication and brain stimulation. J Neural Transm 124(3):303–334CrossRef Brabenec L, Mekyska J, Galaz Z, Rektorova I (2017) Speech disorders in parkinsons disease: early diagnostics and effects of medication and brain stimulation. J Neural Transm 124(3):303–334CrossRef
12.
go back to reference Chen T, Guestrin C (2016) Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 785–794. ACM Chen T, Guestrin C (2016) Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 785–794. ACM
14.
go back to reference Dahmani M, Guerti M (2017) Vocal folds pathologies classification using naïve bayes networks. In: 2017 6th international conference on systems and control (ICSC), pp 426–432. IEEE Dahmani M, Guerti M (2017) Vocal folds pathologies classification using naïve bayes networks. In: 2017 6th international conference on systems and control (ICSC), pp 426–432. IEEE
15.
go back to reference De Bodt MS, Wuyts FL, Van de Heyning PH, Croux C (1997) Test–retest study of the grbas scale: influence of experience and professional background on perceptual rating of voice quality. J Voice 11(1):74–80CrossRef De Bodt MS, Wuyts FL, Van de Heyning PH, Croux C (1997) Test–retest study of the grbas scale: influence of experience and professional background on perceptual rating of voice quality. J Voice 11(1):74–80CrossRef
16.
go back to reference Dejonckere PH, Bradley P, Clemente P, Cornut G, Crevier-Buchman L, Friedrich G, Van De Heyning P, Remacle M, Woisard V (2001) A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Eur Arch Otorhinolaryngol 258(2):77–82CrossRef Dejonckere PH, Bradley P, Clemente P, Cornut G, Crevier-Buchman L, Friedrich G, Van De Heyning P, Remacle M, Woisard V (2001) A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Eur Arch Otorhinolaryngol 258(2):77–82CrossRef
18.
go back to reference Eye M, Infirmary E (1994) Voice disorders database, version. 1.03 (cd-rom). Kay Elemetrics Corporation, Lincoln Park Eye M, Infirmary E (1994) Voice disorders database, version. 1.03 (cd-rom). Kay Elemetrics Corporation, Lincoln Park
19.
go back to reference Gerratt BR, Kreiman J, Antonanzas-Barroso N, Berke GS (1993) Comparing internal and external standards in voice quality judgments. J Speech Hear Res 36(1):14–20CrossRef Gerratt BR, Kreiman J, Antonanzas-Barroso N, Berke GS (1993) Comparing internal and external standards in voice quality judgments. J Speech Hear Res 36(1):14–20CrossRef
20.
go back to reference Godino-Llorente JI, Gómez-Vilda P, Cruz-Roldán F, Blanco-Velasco M, Fraile R (2010) Pathological likelihood index as a measurement of the degree of voice normality and perceived hoarseness. J Voice 24(6):667–677CrossRef Godino-Llorente JI, Gómez-Vilda P, Cruz-Roldán F, Blanco-Velasco M, Fraile R (2010) Pathological likelihood index as a measurement of the degree of voice normality and perceived hoarseness. J Voice 24(6):667–677CrossRef
21.
go back to reference Gwet KL (2014) Handbook of inter-rater reliability: the definitive guide to measuring the extent of agreement among raters. Advanced Analytics LLC, Montgomery Gwet KL (2014) Handbook of inter-rater reliability: the definitive guide to measuring the extent of agreement among raters. Advanced Analytics LLC, Montgomery
22.
go back to reference Harar P, Alonso-Hernandezy JB, Mekyska J, Galaz Z, Burget R, Smekal Z (2017) Voice pathology detection using deep learning: a preliminary study. In: 2017 international conference and workshop on bioinspired intelligence (IWOBI), pp 1–4. IEEE Harar P, Alonso-Hernandezy JB, Mekyska J, Galaz Z, Burget R, Smekal Z (2017) Voice pathology detection using deep learning: a preliminary study. In: 2017 international conference and workshop on bioinspired intelligence (IWOBI), pp 1–4. IEEE
23.
go back to reference Hartigan JA, Wong MA (1979) Algorithm as 136: a k-means clustering algorithm. J Roy Stat Soc Ser C (Appl Stat) 28(1):100–108MATH Hartigan JA, Wong MA (1979) Algorithm as 136: a k-means clustering algorithm. J Roy Stat Soc Ser C (Appl Stat) 28(1):100–108MATH
24.
go back to reference Hearst MA, Dumais ST, Osuna E, Platt J, Scholkopf B (1998) Support vector machines. IEEE Intell Syst Their Appl 13(4):18–28CrossRef Hearst MA, Dumais ST, Osuna E, Platt J, Scholkopf B (1998) Support vector machines. IEEE Intell Syst Their Appl 13(4):18–28CrossRef
25.
go back to reference Hemmerling D (2017) Voice pathology distinction using autoassociative neural networks. In: 2017 25th European signal processing conference (EUSIPCO), pp 1844–1847. IEEE Hemmerling D (2017) Voice pathology distinction using autoassociative neural networks. In: 2017 25th European signal processing conference (EUSIPCO), pp 1844–1847. IEEE
26.
go back to reference Hemmerling D, Skalski A, Gajda J (2016) Voice data mining for laryngeal pathology assessment. Comput Biol Med 69:270–276CrossRef Hemmerling D, Skalski A, Gajda J (2016) Voice data mining for laryngeal pathology assessment. Comput Biol Med 69:270–276CrossRef
27.
go back to reference Hillenbrand J, Houde RA (1996) Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Hear Res 39(2):311–321CrossRef Hillenbrand J, Houde RA (1996) Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Hear Res 39(2):311–321CrossRef
28.
go back to reference Hossain MS, Muhammad G (2016) Healthcare big data voice pathology assessment framework. IEEE Access 4:7806–7815CrossRef Hossain MS, Muhammad G (2016) Healthcare big data voice pathology assessment framework. IEEE Access 4:7806–7815CrossRef
29.
go back to reference Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1):489–501CrossRef Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1):489–501CrossRef
30.
32.
go back to reference Kreiman J, Gerratt BR, Kempster GB, Erman A, Berke GS (1993) Perceptual evaluation of voice quality: review, tutorial, and a framework for future research. J Speech Hear Res 36(1):21–40CrossRef Kreiman J, Gerratt BR, Kempster GB, Erman A, Berke GS (1993) Perceptual evaluation of voice quality: review, tutorial, and a framework for future research. J Speech Hear Res 36(1):21–40CrossRef
33.
go back to reference Little M, McSharry P, Hunter E, Spielman J, Ramig L (2009) Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. IEEE T Bio-Med Eng 56(4):1015–1022CrossRef Little M, McSharry P, Hunter E, Spielman J, Ramig L (2009) Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. IEEE T Bio-Med Eng 56(4):1015–1022CrossRef
34.
go back to reference Liu FT, Ting KM, Zhou ZH (2008) Isolation forest. In: Eighth IEEE international conference on data mining, 2008. ICDM’08, pp 413–422. IEEE Liu FT, Ting KM, Zhou ZH (2008) Isolation forest. In: Eighth IEEE international conference on data mining, 2008. ICDM’08, pp 413–422. IEEE
35.
go back to reference Liu FT, Ting KM, Zhou ZH (2012) Isolation-based anomaly detection. ACM Trans Knowl Discov Data (TKDD) 6(1):3 Liu FT, Ting KM, Zhou ZH (2012) Isolation-based anomaly detection. ACM Trans Knowl Discov Data (TKDD) 6(1):3
36.
go back to reference Martínez D, Lleida E, Ortega A, Miguel A, Villalba J (2012) Voice pathology detection on the saarbrücken voice database with calibration and fusion of scores using multifocal toolkit. In: Advances in speech and language technologies for Iberian Languages, pp 99–109. Springer Martínez D, Lleida E, Ortega A, Miguel A, Villalba J (2012) Voice pathology detection on the saarbrücken voice database with calibration and fusion of scores using multifocal toolkit. In: Advances in speech and language technologies for Iberian Languages, pp 99–109. Springer
37.
go back to reference Mehta DD, Hillman RE (2008) Voice assessment: updates on perceptual, acoustic, aerodynamic, and endoscopic imaging methods. Curr Opin Otolaryngol Head Neck Surg 16(3):211CrossRef Mehta DD, Hillman RE (2008) Voice assessment: updates on perceptual, acoustic, aerodynamic, and endoscopic imaging methods. Curr Opin Otolaryngol Head Neck Surg 16(3):211CrossRef
38.
go back to reference Mekyska J, Janousova E, Gomez-Vilda P, Smekal Z, Rektorova I, Eliasova I, Kostalova M, Mrackova M, Alonso-Hernandez JB, Faundez-Zanuy M et al (2015) Robust and complex approach of pathological speech signal analysis. Neurocomputing 167:94–111CrossRef Mekyska J, Janousova E, Gomez-Vilda P, Smekal Z, Rektorova I, Eliasova I, Kostalova M, Mrackova M, Alonso-Hernandez JB, Faundez-Zanuy M et al (2015) Robust and complex approach of pathological speech signal analysis. Neurocomputing 167:94–111CrossRef
40.
go back to reference Mekyska J, Smekal Z, Galaz Z, Mzourek Z, Rektorova I, Faundez-Zanuy M, López-de Ipiña K (2016) Recent advances in nonlinear speech processing, chap. Perceptual features as markers of Parkinson’s disease: the issue of clinical interpretability, pp 83–91. Springer, Cham. https://doi.org/10.1007/978-3-319-28109-4_9 Mekyska J, Smekal Z, Galaz Z, Mzourek Z, Rektorova I, Faundez-Zanuy M, López-de Ipiña K (2016) Recent advances in nonlinear speech processing, chap. Perceptual features as markers of Parkinson’s disease: the issue of clinical interpretability, pp 83–91. Springer, Cham. https://​doi.​org/​10.​1007/​978-3-319-28109-4_​9
42.
go back to reference Michaelis D, Gramss T, Strube HW (1997) Glottal-to-noise excitation ratio-a new measure for describing pathological voices. Acta Acust United Acust 83(4):700–706 Michaelis D, Gramss T, Strube HW (1997) Glottal-to-noise excitation ratio-a new measure for describing pathological voices. Acta Acust United Acust 83(4):700–706
43.
go back to reference Muhammad G, Alhamid MF, Hossain MS, Almogren AS, Vasilakos AV (2017) Enhanced living by assessing voice pathology using a co-occurrence matrix. Sensors 17(2):267CrossRef Muhammad G, Alhamid MF, Hossain MS, Almogren AS, Vasilakos AV (2017) Enhanced living by assessing voice pathology using a co-occurrence matrix. Sensors 17(2):267CrossRef
44.
go back to reference Muhammad G, Alsulaiman M, Ali Z, Mesallam TA, Farahat M, Malki KH, Al-nasheri A, Bencherif MA (2017) Voice pathology detection using interlaced derivative pattern on glottal source excitation. Biomed Signal Process Control 31:156–164CrossRef Muhammad G, Alsulaiman M, Ali Z, Mesallam TA, Farahat M, Malki KH, Al-nasheri A, Bencherif MA (2017) Voice pathology detection using interlaced derivative pattern on glottal source excitation. Biomed Signal Process Control 31:156–164CrossRef
45.
go back to reference Murphy KP (2006) Naive bayes classifiers. University of British Columbia Murphy KP (2006) Naive bayes classifiers. University of British Columbia
46.
go back to reference Oates J (2009) Auditory-perceptual evaluation of disordered voice quality. Folia Phoniatr Logop 61(1):49–56CrossRef Oates J (2009) Auditory-perceptual evaluation of disordered voice quality. Folia Phoniatr Logop 61(1):49–56CrossRef
47.
go back to reference Parsa V, Jamieson DG (2003) Identification of pathological voices using glottal noise measures. J Speech Lang Hear Res 23(2):469–485CrossRef Parsa V, Jamieson DG (2003) Identification of pathological voices using glottal noise measures. J Speech Lang Hear Res 23(2):469–485CrossRef
48.
go back to reference Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830MathSciNetMATH Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830MathSciNetMATH
49.
go back to reference Pimentel MA, Clifton DA, Clifton L, Tarassenko L (2014) A review of novelty detection. Sig Process 99:215–249CrossRef Pimentel MA, Clifton DA, Clifton L, Tarassenko L (2014) A review of novelty detection. Sig Process 99:215–249CrossRef
50.
go back to reference Reynolds D (2015) Gaussian mixture models. Encyclopedia of biometrics, pp 827–832 Reynolds D (2015) Gaussian mixture models. Encyclopedia of biometrics, pp 827–832
51.
go back to reference Sabir B, Rouda F, Khazri Y, Touri B, Moussetad M (2017) Improved algorithm for pathological and normal voices identification. Int J Electr Comput Eng (IJECE) 7(1):238–243CrossRef Sabir B, Rouda F, Khazri Y, Touri B, Moussetad M (2017) Improved algorithm for pathological and normal voices identification. Int J Electr Comput Eng (IJECE) 7(1):238–243CrossRef
52.
go back to reference Saldanha JC, Ananthakrishna T, Pinto R (2014) Vocal fold pathology assessment using mel-frequency cepstral coefficients and linear predictive cepstral coefficients features. J Med Imaging Health Inf 4(2):168–173CrossRef Saldanha JC, Ananthakrishna T, Pinto R (2014) Vocal fold pathology assessment using mel-frequency cepstral coefficients and linear predictive cepstral coefficients features. J Med Imaging Health Inf 4(2):168–173CrossRef
53.
go back to reference Schalkoff RJ (1997) Artificial neural networks, vol 1. McGraw-Hill, New YorkMATH Schalkoff RJ (1997) Artificial neural networks, vol 1. McGraw-Hill, New YorkMATH
54.
go back to reference Song P (2013) Assessment of vocal cord function and voice disorders. In: Principles and practice of interventional pulmonology, pp 137–149. Springer Song P (2013) Assessment of vocal cord function and voice disorders. In: Principles and practice of interventional pulmonology, pp 137–149. Springer
55.
go back to reference Souissi N, Cherif A (2015) Dimensionality reduction for voice disorders identification system based on mel frequency cepstral coefficients and support vector machine. In: 2015 7th international conference on modelling, identification and control (ICMIC), pp 1–6. IEEE Souissi N, Cherif A (2015) Dimensionality reduction for voice disorders identification system based on mel frequency cepstral coefficients and support vector machine. In: 2015 7th international conference on modelling, identification and control (ICMIC), pp 1–6. IEEE
56.
go back to reference Souissi N, Cherif A (2016) Speech recognition system based on short-term cepstral parameters, feature reduction method and artificial neural networks. In: 2016 2nd international conference on advanced technologies for signal and image processing (ATSIP), pp 667–671. IEEE Souissi N, Cherif A (2016) Speech recognition system based on short-term cepstral parameters, feature reduction method and artificial neural networks. In: 2016 2nd international conference on advanced technologies for signal and image processing (ATSIP), pp 667–671. IEEE
57.
go back to reference Stathopoulos ET, Huber JE, Sussman JE (2011) Changes in acoustic characteristics of the voice across the life span: measures from individuals 4–93 years of age. J Speech Lang Hear Res 54(4):1011–1021CrossRef Stathopoulos ET, Huber JE, Sussman JE (2011) Changes in acoustic characteristics of the voice across the life span: measures from individuals 4–93 years of age. J Speech Lang Hear Res 54(4):1011–1021CrossRef
58.
go back to reference Teager H (1980) Some observations on oral air flow during phonation. IEEE Trans Acoust Speech Signal Process 28(5):599–601CrossRef Teager H (1980) Some observations on oral air flow during phonation. IEEE Trans Acoust Speech Signal Process 28(5):599–601CrossRef
59.
go back to reference Titze IR (1994) Principles of voice production. Prentice-Hall, Englewood Cliffs Titze IR (1994) Principles of voice production. Prentice-Hall, Englewood Cliffs
60.
go back to reference Tsanas A, Little MA, McSharry PE, Ramig LO (2010) Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson’s disease symptom severity. J R Soc Interface 8(59):842–855CrossRef Tsanas A, Little MA, McSharry PE, Ramig LO (2010) Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson’s disease symptom severity. J R Soc Interface 8(59):842–855CrossRef
61.
go back to reference Uloza V, Vegiene A, Saferis V (2015) Correlation between the quantitative video laryngostroboscopic measurements and parameters of multidimensional voice assessment. Biomed Signal Process Control 17(Suppl C):3–10CrossRef Uloza V, Vegiene A, Saferis V (2015) Correlation between the quantitative video laryngostroboscopic measurements and parameters of multidimensional voice assessment. Biomed Signal Process Control 17(Suppl C):3–10CrossRef
62.
go back to reference Woldert-Jokisz B (2007) Saarbruecken voice database Woldert-Jokisz B (2007) Saarbruecken voice database
63.
Metadata
Title
Towards robust voice pathology detection
Investigation of supervised deep learning, gradient boosting, and anomaly detection approaches across four databases
Authors
Pavol Harar
Zoltan Galaz
Jesus B. Alonso-Hernandez
Jiri Mekyska
Radim Burget
Zdenek Smekal
Publication date
04-04-2018
Publisher
Springer London
Published in
Neural Computing and Applications / Issue 20/2020
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-018-3464-7

Other articles of this Issue 20/2020

Neural Computing and Applications 20/2020 Go to the issue

Recent Advances in Deep Learning for Medical Image Processing

Brain tumor detection: a long short-term memory (LSTM)-based learning model

S.I. : Applying Artificial Intelligence to the Internet of Things

Deep neural network-based clustering technique for secure IIoT

Premium Partner