Skip to main content

2018 | OriginalPaper | Buchkapitel

Voice Pathology Detection Using Artificial Neural Networks and Support Vector Machines Powered by a Multicriteria Optimization Algorithm

verfasst von : Henry Jhoán Areiza-Laverde, Andrés Eduardo Castro-Ospina, Diego Hernán Peluffo-Ordóñez

Erschienen in: Applied Computer Sciences in Engineering

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Computer-aided diagnosis (CAD) systems have allowed to enhance the performance of conventional, medical diagnosis procedures in different scenarios. Particularly, in the context of voice pathology detection, the use of machine learning algorithms has proved to be a promising and suitable alternative. This work proposes the implementation of two well known classification algorithms, namely artificial neural networks (ANN) and support vector machines (SVM), optimized by particle swarm optimization (PSO) algorithm, aimed at classifying voice signals between healthy and pathologic ones. Three different configurations of the Saarbrucken voice database (SVD) are used. The effect of using balanced and unbalanced versions of this dataset is proved as well as the usefulness of the considered optimization algorithm to improve the final performance outcomes. Also, proposed approach is comparable with state-of-the-art methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Acharya, U.R., Fujita, H., Oh, S.L., Hagiwara, Y., Tan, J.H., Adam, M.: Application of deep convolutional neural network for automated detection of myocardial infarction using ecg signals. Inf. Sci. 415, 190–198 (2017)CrossRef Acharya, U.R., Fujita, H., Oh, S.L., Hagiwara, Y., Tan, J.H., Adam, M.: Application of deep convolutional neural network for automated detection of myocardial infarction using ecg signals. Inf. Sci. 415, 190–198 (2017)CrossRef
2.
Zurück zum Zitat Al-nasheri, A., Muhammad, G., Alsulaiman, M., Ali, Z.: Investigation of voice pathology detection and classification on different frequency regions using correlation functions. J. Voice 31(1), 3–15 (2017)CrossRef Al-nasheri, A., Muhammad, G., Alsulaiman, M., Ali, Z.: Investigation of voice pathology detection and classification on different frequency regions using correlation functions. J. Voice 31(1), 3–15 (2017)CrossRef
3.
Zurück zum Zitat Al-nasheri, A., et al.: An investigation of multidimensional voice program parameters in three different databases for voice pathology detection and classification. J. Voice 31(1), 113–e9 (2017)CrossRef Al-nasheri, A., et al.: An investigation of multidimensional voice program parameters in three different databases for voice pathology detection and classification. J. Voice 31(1), 113–e9 (2017)CrossRef
4.
Zurück zum Zitat Ali, F.: Voice recognition anatomy, processing, uses and application in C (2017) Ali, F.: Voice recognition anatomy, processing, uses and application in C (2017)
5.
Zurück zum Zitat AlZubaidi, A.K., Sideseq, F.B., Faeq, A., Basil, M.: Computer aided diagnosis in digital pathology application: review and perspective approach in lung cancer classification. In: 2017 Annual Conference on New Trends in Information & Communications Technology Applications (NTICT), pp. 219–224. IEEE (2017) AlZubaidi, A.K., Sideseq, F.B., Faeq, A., Basil, M.: Computer aided diagnosis in digital pathology application: review and perspective approach in lung cancer classification. In: 2017 Annual Conference on New Trends in Information & Communications Technology Applications (NTICT), pp. 219–224. IEEE (2017)
7.
Zurück zum Zitat Béranger, J.: Big Data and Ethics: The Medical Datasphere. Elsevier, New York City (2016) Béranger, J.: Big Data and Ethics: The Medical Datasphere. Elsevier, New York City (2016)
8.
Zurück zum Zitat Castro-Ospina, A., Castro-Hoyos, C., Peluffo-Ordonez, D., Castellanos-Dominguez, G.: Novel heuristic search for ventricular arrhythmia detection using normalized cut clustering. In: 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 7076–7079. IEEE (2013) Castro-Ospina, A., Castro-Hoyos, C., Peluffo-Ordonez, D., Castellanos-Dominguez, G.: Novel heuristic search for ventricular arrhythmia detection using normalized cut clustering. In: 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 7076–7079. IEEE (2013)
9.
10.
Zurück zum Zitat Harar, P., Alonso-Hernandezy, J.B., Mekyska, J., Galaz, Z., Burget, R., Smekal, Z.: Voice pathology detection using deep learning: a preliminary study. In: 2017 International Conference and Workshop on Bioinspired Intelligence (IWOBI), pp. 1–4. IEEE (2017) Harar, P., Alonso-Hernandezy, J.B., Mekyska, J., Galaz, Z., Burget, R., Smekal, Z.: Voice pathology detection using deep learning: a preliminary study. In: 2017 International Conference and Workshop on Bioinspired Intelligence (IWOBI), pp. 1–4. IEEE (2017)
11.
Zurück zum Zitat Hemmerling, D., Skalski, A., Gajda, J.: Voice data mining for laryngeal pathology assessment. Comput. Biol. Med. 69, 270–276 (2016)CrossRef Hemmerling, D., Skalski, A., Gajda, J.: Voice data mining for laryngeal pathology assessment. Comput. Biol. Med. 69, 270–276 (2016)CrossRef
12.
Zurück zum Zitat Ibrahim, S., Djemal, R., Alsuwailem, A.: Electroencephalography (EEG) signal processing for epilepsy and autism spectrum disorder diagnosis. Biocybern. Biomed. Eng. 38(1), 16–26 (2018)CrossRef Ibrahim, S., Djemal, R., Alsuwailem, A.: Electroencephalography (EEG) signal processing for epilepsy and autism spectrum disorder diagnosis. Biocybern. Biomed. Eng. 38(1), 16–26 (2018)CrossRef
13.
Zurück zum Zitat Lytras, M.D., Papadopoulou, P.: Applying Big Data Analytics in Bioinformatics and Medicine. IGI Global, Pennsylvania (2017) Lytras, M.D., Papadopoulou, P.: Applying Big Data Analytics in Bioinformatics and Medicine. IGI Global, Pennsylvania (2017)
14.
Zurück zum Zitat Martínez, D., Lleida, E., Ortega, A., Miguel, A., Villalba, J.: Voice pathology detection on the Saarbrücken voice database with calibration and fusion of scores using MultiFocal toolkit. In: Torre Toledano, D., et al. (eds.) IberSPEECH 2012. CCIS, vol. 328, pp. 99–109. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35292-8_11CrossRef Martínez, D., Lleida, E., Ortega, A., Miguel, A., Villalba, J.: Voice pathology detection on the Saarbrücken voice database with calibration and fusion of scores using MultiFocal toolkit. In: Torre Toledano, D., et al. (eds.) IberSPEECH 2012. CCIS, vol. 328, pp. 99–109. Springer, Heidelberg (2012). https://​doi.​org/​10.​1007/​978-3-642-35292-8_​11CrossRef
16.
Zurück zum Zitat Muhammad, G., Alhamid, M.F., Hossain, M.S., Almogren, A.S., Vasilakos, A.V.: Enhanced living by assessing voice pathology using a co-occurrence matrix. Sensors 17(2), 267 (2017)CrossRef Muhammad, G., Alhamid, M.F., Hossain, M.S., Almogren, A.S., Vasilakos, A.V.: Enhanced living by assessing voice pathology using a co-occurrence matrix. Sensors 17(2), 267 (2017)CrossRef
17.
Zurück zum Zitat Muhammad, G., et al.: Voice pathology detection using interlaced derivative pattern on glottal source excitation. Biomed. Signal Process. Control 31, 156–164 (2017)CrossRef Muhammad, G., et al.: Voice pathology detection using interlaced derivative pattern on glottal source excitation. Biomed. Signal Process. Control 31, 156–164 (2017)CrossRef
18.
Zurück zum Zitat Muhammad, G., et al.: Automatic voice pathology detection and classification using vocal tract area irregularity. Biocybern. Biomed. Eng. 36(2), 309–317 (2016)CrossRef Muhammad, G., et al.: Automatic voice pathology detection and classification using vocal tract area irregularity. Biocybern. Biomed. Eng. 36(2), 309–317 (2016)CrossRef
19.
Zurück zum Zitat Orozco-Naranjo, A.J., Muñoz-Gutiérrez, P.A.: Detection of pathological and normal heartbeat using wavelet packet, support vector machines and multilayer perceptron. Tecno Lógicas 31, 73–91 (2013) Orozco-Naranjo, A.J., Muñoz-Gutiérrez, P.A.: Detection of pathological and normal heartbeat using wavelet packet, support vector machines and multilayer perceptron. Tecno Lógicas 31, 73–91 (2013)
20.
21.
Zurück zum Zitat Schalkoff, R.J.: Artificial Neural Networks, vol. 1. McGraw-Hill, New York (1997)MATH Schalkoff, R.J.: Artificial Neural Networks, vol. 1. McGraw-Hill, New York (1997)MATH
22.
Zurück zum Zitat Schilling, R.J., Harris, S.L.: Fundamentals of Digital Signal Processing Using MATLAB. Cengage Learning, Boston (2011) Schilling, R.J., Harris, S.L.: Fundamentals of Digital Signal Processing Using MATLAB. Cengage Learning, Boston (2011)
23.
Zurück zum Zitat Semmlow, J.L., Griffel, B.: Biosignal and Medical Image Processing. CRC Press, Boca Raton (2014) Semmlow, J.L., Griffel, B.: Biosignal and Medical Image Processing. CRC Press, Boca Raton (2014)
24.
Zurück zum Zitat Shinohara, S., et al.: Multilingual evaluation of voice disability index using pitch rate. ASTESJ 2(3), 765–772 (2017)CrossRef Shinohara, S., et al.: Multilingual evaluation of voice disability index using pitch rate. ASTESJ 2(3), 765–772 (2017)CrossRef
25.
Zurück zum Zitat Shriberg, L.D., et al.: A diagnostic marker to discriminate childhood apraxia of speech from speech delay: II. Validity studies of the pause marker. J. Speech Lang. Hear. Res. 60(4), S1118–S1134 (2017)CrossRef Shriberg, L.D., et al.: A diagnostic marker to discriminate childhood apraxia of speech from speech delay: II. Validity studies of the pause marker. J. Speech Lang. Hear. Res. 60(4), S1118–S1134 (2017)CrossRef
26.
Zurück zum Zitat Summers, R.M.: Deep learning and computer-aided diagnosis for medical image processing: a personal perspective. In: Lu, L., Zheng, Y., Carneiro, G., Yang, L. (eds.) Deep Learning and Convolutional Neural Networks for Medical Image Computing. ACVPR, pp. 3–10. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-42999-1_1CrossRef Summers, R.M.: Deep learning and computer-aided diagnosis for medical image processing: a personal perspective. In: Lu, L., Zheng, Y., Carneiro, G., Yang, L. (eds.) Deep Learning and Convolutional Neural Networks for Medical Image Computing. ACVPR, pp. 3–10. Springer, Cham (2017). https://​doi.​org/​10.​1007/​978-3-319-42999-1_​1CrossRef
27.
Zurück zum Zitat von Tscharner, V.: Time-frequency and principal-component methods for the analysis of emgs recorded during a mildly fatiguing exercise on a cycle ergometer. J. Electromyogr. Kinesiol. 12(6), 479–492 (2002)CrossRef von Tscharner, V.: Time-frequency and principal-component methods for the analysis of emgs recorded during a mildly fatiguing exercise on a cycle ergometer. J. Electromyogr. Kinesiol. 12(6), 479–492 (2002)CrossRef
29.
Zurück zum Zitat Verde, L., De Pietro, G., Sannino, G.: Voice disorder identification by using machine learning techniques. IEEE Access 6, 16246–16255 (2018)CrossRef Verde, L., De Pietro, G., Sannino, G.: Voice disorder identification by using machine learning techniques. IEEE Access 6, 16246–16255 (2018)CrossRef
30.
Zurück zum Zitat Wojcicki, K.: HTK MFCC MATLAB. MATLAB Central File Exchange (2011) Wojcicki, K.: HTK MFCC MATLAB. MATLAB Central File Exchange (2011)
Metadaten
Titel
Voice Pathology Detection Using Artificial Neural Networks and Support Vector Machines Powered by a Multicriteria Optimization Algorithm
verfasst von
Henry Jhoán Areiza-Laverde
Andrés Eduardo Castro-Ospina
Diego Hernán Peluffo-Ordóñez
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-00350-0_13