Skip to main content
Erschienen in: Cognitive Computation 4/2019

13.02.2019

Cognitively Inspired Feature Extraction and Speech Recognition for Automated Hearing Loss Testing

verfasst von: Shibli Nisar, Muhammad Tariq, Ahsan Adeel, Mandar Gogate, Amir Hussain

Erschienen in: Cognitive Computation | Ausgabe 4/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Hearing loss, a partial or total inability to hear, is one of the most commonly reported disabilities. A hearing test can be carried out by an audiologist to assess a patient’s auditory system. However, the procedure requires an appointment, which can result in delays and practitioner fees. In addition, there are often challenges associated with the unavailability of equipment and qualified practitioners, particularly in remote areas. This paper presents a novel idea that automatically identifies any hearing impairment based on a cognitively inspired feature extraction and speech recognition approach. The proposed system uses an adaptive filter bank with weighted Mel-frequency cepstral coefficients for feature extraction. The adaptive filter bank implementation is inspired by the principle of spectrum sensing in cognitive radio that is aware of its environment and adapts to statistical variations in the input stimuli by learning from the environment. Comparative performance evaluation demonstrates the potential of our automated hearing test method to achieve comparable results to the clinical ground truth, established by the expert audiologist’s tests. The overall absolute error of the proposed model when compared with the expert audiologist test is less than 4.9 dB and 4.4 dB for the pure tone and speech audiometry tests, respectively. The overall accuracy achieved is 96.67% with a hidden Markov model (HMM). The proposed method potentially offers a second opinion to audiologists, and serves as a cost-effective pre-screening test to predict hearing loss at an early stage. In future work, authors intend to explore the application of advanced deep learning and optimization approaches to further enhance the performance of the automated testing prototype considering imperfect datasets with real-world background noise.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Organization WH, et al. 2013. Millions of People in the World have Hearing Loss that can be Treated or Prevented. Awareness is the Key to Prevention. Organization WH, et al. 2013. Millions of People in the World have Hearing Loss that can be Treated or Prevented. Awareness is the Key to Prevention.
2.
Zurück zum Zitat Dalton DS, Cruickshanks KJ, Klein BE, Klein R, Wiley TL, Nondahl DM. The impact of hearing loss on quality of life in older adults. Gerontol 2003;43(5):661–668.CrossRef Dalton DS, Cruickshanks KJ, Klein BE, Klein R, Wiley TL, Nondahl DM. The impact of hearing loss on quality of life in older adults. Gerontol 2003;43(5):661–668.CrossRef
3.
Zurück zum Zitat Davis A, Smith P, Ferguson M, Stephens D, Gianopoulos I. Acceptability, benefit and costs of early screening for hearing disability: a study of potential screening tests and models. Health Technology Assessment-Southampton-. 2007;11(42). Davis A, Smith P, Ferguson M, Stephens D, Gianopoulos I. Acceptability, benefit and costs of early screening for hearing disability: a study of potential screening tests and models. Health Technology Assessment-Southampton-. 2007;11(42).
4.
Zurück zum Zitat Fagan J. 2014. Open access guide to audiology and hearing aids for otolaryngologists. Fagan J. 2014. Open access guide to audiology and hearing aids for otolaryngologists.
5.
Zurück zum Zitat Association ASLH, et al. 2005. Guidelines for manual pure-tone threshold audiometry. Association ASLH, et al. 2005. Guidelines for manual pure-tone threshold audiometry.
6.
Zurück zum Zitat Hudgins CV, Hawkins J, Kaklin J, Stevens S. The development of recorded auditory tests for measuring hearing loss for speech. Laryngoscope 1947;57(1):57–89.PubMedCrossRef Hudgins CV, Hawkins J, Kaklin J, Stevens S. The development of recorded auditory tests for measuring hearing loss for speech. Laryngoscope 1947;57(1):57–89.PubMedCrossRef
7.
Zurück zum Zitat Probst R, Lonsbury-Martin B, Martin G, Coats A. Otoacoustic emissions in ears with hearing loss. Amer J Otolaryngol 1987;8(2):73–81.CrossRef Probst R, Lonsbury-Martin B, Martin G, Coats A. Otoacoustic emissions in ears with hearing loss. Amer J Otolaryngol 1987;8(2):73–81.CrossRef
8.
Zurück zum Zitat Wilson DF, Hodgson RS, Gustafson MF. Auditory brainstem response testing. Laryngoscope 1993;103 (5):580–581.CrossRef Wilson DF, Hodgson RS, Gustafson MF. Auditory brainstem response testing. Laryngoscope 1993;103 (5):580–581.CrossRef
9.
Zurück zum Zitat Schlauch RS, Han HJ, Tzu-Ling JY, Carney E. Pure-tone–spondee threshold relationships in functional hearing loss: a test of loudness contribution. J Speech Language Hear Res 2017;60(1):136–143.CrossRef Schlauch RS, Han HJ, Tzu-Ling JY, Carney E. Pure-tone–spondee threshold relationships in functional hearing loss: a test of loudness contribution. J Speech Language Hear Res 2017;60(1):136–143.CrossRef
10.
Zurück zum Zitat Martin FN, Clark JG. Introduction to audiology. Boston: Allyn and Bacon; 1997. Martin FN, Clark JG. Introduction to audiology. Boston: Allyn and Bacon; 1997.
11.
Zurück zum Zitat Brandy WT. Speech audiometry. Handb Clin Audiol 2002;5:96–110. Brandy WT. Speech audiometry. Handb Clin Audiol 2002;5:96–110.
12.
Zurück zum Zitat Franks JR. Hearing measurement. National Institute for Occupational Safety and Health. 2001; p. 183–232. Franks JR. Hearing measurement. National Institute for Occupational Safety and Health. 2001; p. 183–232.
13.
Zurück zum Zitat Carhart R. Clinical application of bone conduction audiometry. Arch Otolaryngol 1950;51(6):798–808.PubMedCrossRef Carhart R. Clinical application of bone conduction audiometry. Arch Otolaryngol 1950;51(6):798–808.PubMedCrossRef
14.
Zurück zum Zitat Stapells DR, Oates P. Estimation of the pure-tone audiogram by the auditory brainstem response: A review. Audiol Neurotol 1997;2(5):257–280.CrossRef Stapells DR, Oates P. Estimation of the pure-tone audiogram by the auditory brainstem response: A review. Audiol Neurotol 1997;2(5):257–280.CrossRef
15.
Zurück zum Zitat Loss CH. 2012. Sensorineural hearing loss. Diseases Ear Nose Throat. Loss CH. 2012. Sensorineural hearing loss. Diseases Ear Nose Throat.
16.
Zurück zum Zitat Pensak ML, Adelman RA. 1993. Conductive hearing loss. Otolaryngology-head and neck surgery St Louis: Mosby Year Book. Pensak ML, Adelman RA. 1993. Conductive hearing loss. Otolaryngology-head and neck surgery St Louis: Mosby Year Book.
17.
Zurück zum Zitat Ramsay HA, Linthicum JF. Mixed hearing loss in otosclerosis: indication for long-term follow-up. Amer J Otol 1994;15(4):536–539. Ramsay HA, Linthicum JF. Mixed hearing loss in otosclerosis: indication for long-term follow-up. Amer J Otol 1994;15(4):536–539.
18.
Zurück zum Zitat Sreedhar J, Venkatesh L, Nagaraja M, Srinivasan P. Development and evaluation of paired words for testing of speech recognition threshold in Telugu A preliminary report. J Indian Speech Lang Hear Assoc 2011;25 (2):128–136. Sreedhar J, Venkatesh L, Nagaraja M, Srinivasan P. Development and evaluation of paired words for testing of speech recognition threshold in Telugu A preliminary report. J Indian Speech Lang Hear Assoc 2011;25 (2):128–136.
19.
Zurück zum Zitat Van Tasell DJ, Yanz JL. Speech recognition threshold in noise: effects of hearing loss, frequency response, and speech materials. J Speech Lang Hear Res 1987;30(3):377–386.CrossRef Van Tasell DJ, Yanz JL. Speech recognition threshold in noise: effects of hearing loss, frequency response, and speech materials. J Speech Lang Hear Res 1987;30(3):377–386.CrossRef
20.
Zurück zum Zitat Association ASLH, et al. 1988. Determining threshold level for speech. Association ASLH, et al. 1988. Determining threshold level for speech.
21.
Zurück zum Zitat Martin FN, Champlin CA, Chambers JA. Seventh survey of audiometric practices in the United States. J-Amer Acad Audiol 1998;9:95–104. Martin FN, Champlin CA, Chambers JA. Seventh survey of audiometric practices in the United States. J-Amer Acad Audiol 1998;9:95–104.
22.
23.
Zurück zum Zitat Schoepflin JR. 2015. Back to basics: speech audiometry. Schoepflin JR. 2015. Back to basics: speech audiometry.
24.
Zurück zum Zitat Boothroyd A. Developments in speech audiometry. Br J Audiol 1968;2(1):3–10.CrossRef Boothroyd A. Developments in speech audiometry. Br J Audiol 1968;2(1):3–10.CrossRef
25.
Zurück zum Zitat Renda L, Selċuk ÖT, Eyigör H, Osma Ü, Yılmaz MD. Smartphone based audiometric test for confirming the level of hearing; Is it useable in underserved areas? J Int Adv Otol 2016;12(1):61–6.PubMedCrossRef Renda L, Selċuk ÖT, Eyigör H, Osma Ü, Yılmaz MD. Smartphone based audiometric test for confirming the level of hearing; Is it useable in underserved areas? J Int Adv Otol 2016;12(1):61–6.PubMedCrossRef
26.
Zurück zum Zitat Szudek J, Ostevik A, Dziegielewski P, Robinson-Anagor J, Gomaa N, Hodgetts B, et al. Can Uhear me now? Validation of an iPod-based hearing loss screening test. Journal of Otolaryngology–Head & Neck Surgery. 2012; p. 41. Szudek J, Ostevik A, Dziegielewski P, Robinson-Anagor J, Gomaa N, Hodgetts B, et al. Can Uhear me now? Validation of an iPod-based hearing loss screening test. Journal of Otolaryngology–Head & Neck Surgery. 2012; p. 41.
27.
Zurück zum Zitat Wong TW, Yu T, Chen W, Chiu Y, Wong C, Wong A. Agreement between hearing thresholds measured in non-soundproof work environments and a soundproof booth. Occup Environ Med 2003;60(9):667–671.PubMedPubMedCentralCrossRef Wong TW, Yu T, Chen W, Chiu Y, Wong C, Wong A. Agreement between hearing thresholds measured in non-soundproof work environments and a soundproof booth. Occup Environ Med 2003;60(9):667–671.PubMedPubMedCentralCrossRef
28.
Zurück zum Zitat Kam ACS, Gao H, Li LKC, Zhao H, Qiu S, Tong MCF. Automated hearing screening for children: a pilot study in China. Int J Audiol 2013;52(12):855–860.PubMedCrossRef Kam ACS, Gao H, Li LKC, Zhao H, Qiu S, Tong MCF. Automated hearing screening for children: a pilot study in China. Int J Audiol 2013;52(12):855–860.PubMedCrossRef
29.
Zurück zum Zitat Foulad A, Bui P, Djalilian H. Automated audiometry using Apple iOS-based application technology. Otolaryngol–Head Neck Surg 2013;149(5):700–706.PubMedCrossRef Foulad A, Bui P, Djalilian H. Automated audiometry using Apple iOS-based application technology. Otolaryngol–Head Neck Surg 2013;149(5):700–706.PubMedCrossRef
30.
Zurück zum Zitat Ananthi S, Dhanalakshmi P. SVM and HMM modeling techniques for speech recognition using LPCC and MFCC features. Proceedings of the 3rd International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA) 2014. Springer; 2015. p. 519–526. Ananthi S, Dhanalakshmi P. SVM and HMM modeling techniques for speech recognition using LPCC and MFCC features. Proceedings of the 3rd International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA) 2014. Springer; 2015. p. 519–526.
31.
Zurück zum Zitat Chen Ch. Handbook of pattern recognition and computer vision. Singapore: World Scientific; 2015. Chen Ch. Handbook of pattern recognition and computer vision. Singapore: World Scientific; 2015.
32.
Zurück zum Zitat Anagnostopoulos CN, Iliou T, Giannoukos I. Features and classifiers for emotion recognition from speech: a survey from 2000 to 2011. Artif Intell Rev 2015;43(2):155–177.CrossRef Anagnostopoulos CN, Iliou T, Giannoukos I. Features and classifiers for emotion recognition from speech: a survey from 2000 to 2011. Artif Intell Rev 2015;43(2):155–177.CrossRef
33.
Zurück zum Zitat Rabiner LR. A tutorial on hidden Markov models and selected applications in speech recognition. Proc IEEE 1989;77(2):257–286.CrossRef Rabiner LR. A tutorial on hidden Markov models and selected applications in speech recognition. Proc IEEE 1989;77(2):257–286.CrossRef
34.
Zurück zum Zitat Carhart R, Jerger J. 1959. Preferred method for clinical determination of pure-tone thresholds. Journal of Speech & Hearing Disorders. Carhart R, Jerger J. 1959. Preferred method for clinical determination of pure-tone thresholds. Journal of Speech & Hearing Disorders.
35.
Zurück zum Zitat Franks JR. Hearing measurement. National Institute for Occupational Safety and Health. 2001; p. 183–232. Franks JR. Hearing measurement. National Institute for Occupational Safety and Health. 2001; p. 183–232.
36.
Zurück zum Zitat Ezeiza A, de Ipiña KL, Hernández C, Barroso N. Enhancing the feature extraction process for automatic speech recognition with fractal dimensions. Cogn Comput 2013;5(4):545–550.CrossRef Ezeiza A, de Ipiña KL, Hernández C, Barroso N. Enhancing the feature extraction process for automatic speech recognition with fractal dimensions. Cogn Comput 2013;5(4):545–550.CrossRef
37.
Zurück zum Zitat Alam MJ, Kenny P, O’shaughnessy D. Low-variance multitaper mel-frequency cepstral coefficient features for speech and speaker recognition systems. Cogn Comput 2013;5(4):533–544.CrossRef Alam MJ, Kenny P, O’shaughnessy D. Low-variance multitaper mel-frequency cepstral coefficient features for speech and speaker recognition systems. Cogn Comput 2013;5(4):533–544.CrossRef
38.
Zurück zum Zitat Hei Y, Li W, Li M, Qiu Z, Fu W. Optimization of multiuser MIMO cooperative spectrum sensing in cognitive radio networks. Cogn Comput 2015;7(3):359–368.CrossRef Hei Y, Li W, Li M, Qiu Z, Fu W. Optimization of multiuser MIMO cooperative spectrum sensing in cognitive radio networks. Cogn Comput 2015;7(3):359–368.CrossRef
39.
Zurück zum Zitat Nisar S, Khan OU, Tariq M. An efficient adaptive window size selection method for improving spectrogram visualization. Computational intelligence and neuroscience. 2016. Nisar S, Khan OU, Tariq M. An efficient adaptive window size selection method for improving spectrogram visualization. Computational intelligence and neuroscience. 2016.
40.
Zurück zum Zitat Dobie RA, Van Hemel S, Council NR, et al. 2004. Basics of Sound, the Ear, and Hearing. Dobie RA, Van Hemel S, Council NR, et al. 2004. Basics of Sound, the Ear, and Hearing.
41.
Zurück zum Zitat Schoepflin JR. 2015. Back to Basics: Speech Audiometry. Schoepflin JR. 2015. Back to Basics: Speech Audiometry.
42.
Zurück zum Zitat Kapul A, Zubova E, Torgaev SN, Drobchik V, Vol. 881. Pure-tone audiometer. In: Journal of Physics: Conference Series. UK: IOP Publishing; 2017, p. 012010. Kapul A, Zubova E, Torgaev SN, Drobchik V, Vol. 881. Pure-tone audiometer. In: Journal of Physics: Conference Series. UK: IOP Publishing; 2017, p. 012010.
43.
Zurück zum Zitat Behgam M, Grant SL. Echo cancellation for bone conduction transducers. 2014 48th Asilomar Conference on Signals, Systems and Computers. IEEE; 2014. p. 1629–1632. Behgam M, Grant SL. Echo cancellation for bone conduction transducers. 2014 48th Asilomar Conference on Signals, Systems and Computers. IEEE; 2014. p. 1629–1632.
44.
Zurück zum Zitat Zhong W, Kong X, You X, Wang B. 2015. Recording Device Identification Based on Cepstral Mixed Features. Zhong W, Kong X, You X, Wang B. 2015. Recording Device Identification Based on Cepstral Mixed Features.
45.
Zurück zum Zitat Hsu CW, Chang CC, Lin CJ, et al. 2003. A practical guide to support vector classification. Hsu CW, Chang CC, Lin CJ, et al. 2003. A practical guide to support vector classification.
46.
Zurück zum Zitat Shady Y, Zayed SHH. Speaker independent Arabic speech recognition using support vector machine. Department of Electrical Engineering, Shoubra Faculty of Engineering. Cairo: Benha University; 2009. Shady Y, Zayed SHH. Speaker independent Arabic speech recognition using support vector machine. Department of Electrical Engineering, Shoubra Faculty of Engineering. Cairo: Benha University; 2009.
47.
Zurück zum Zitat Priya TL, Raajan N, Raju N, Preethi P, Mathini S. Speech and non-speech identification and classification using KNN Algorithm. Proced Eng 2012;38:952–958.CrossRef Priya TL, Raajan N, Raju N, Preethi P, Mathini S. Speech and non-speech identification and classification using KNN Algorithm. Proced Eng 2012;38:952–958.CrossRef
49.
Zurück zum Zitat Breiman L. Bagging predictors. Mach Learn 1996;24(2):123–140. Breiman L. Bagging predictors. Mach Learn 1996;24(2):123–140.
50.
Zurück zum Zitat Freund Y, Schapire RE. Game theory, on-line prediction and boosting. Proceedings of the ninth annual conference on Computational learning theory. ACM; 1996. p. 325–332. Freund Y, Schapire RE. Game theory, on-line prediction and boosting. Proceedings of the ninth annual conference on Computational learning theory. ACM; 1996. p. 325–332.
51.
Zurück zum Zitat Freund Y, Schapire RE, et al. Experiments with a new boosting algorithm. icml; 1996. p. 148–156. Freund Y, Schapire RE, et al. Experiments with a new boosting algorithm. icml; 1996. p. 148–156.
52.
Zurück zum Zitat Rokach L. Ensemble-based classifiers. Artif Intell Rev 2010;33(1):1–39.CrossRef Rokach L. Ensemble-based classifiers. Artif Intell Rev 2010;33(1):1–39.CrossRef
53.
Zurück zum Zitat Dietterich TG. Ensemble methods in machine learning. International workshop on multiple classifier systems. Springer; 2000. p. 1–15. Dietterich TG. Ensemble methods in machine learning. International workshop on multiple classifier systems. Springer; 2000. p. 1–15.
54.
Zurück zum Zitat Vimala C, Radha V. Isolated speech recognition system for Tamil language using statistical pattern matching and machine learning techniques. J Eng Sci Technol (JESTEC) 2015;10(5):617–632. Vimala C, Radha V. Isolated speech recognition system for Tamil language using statistical pattern matching and machine learning techniques. J Eng Sci Technol (JESTEC) 2015;10(5):617–632.
55.
Zurück zum Zitat Juang BH, Rabiner LR. Hidden Markov models for speech recognition. Technometrics 1991;33(3):251–272.CrossRef Juang BH, Rabiner LR. Hidden Markov models for speech recognition. Technometrics 1991;33(3):251–272.CrossRef
57.
Zurück zum Zitat Eddins DA, Walton JP, Dziorny AE, Frisina RD. Comparison of pure tone thresholds obtained via automated audiometry and standard pure tone audiometry. J Acoust Soc Amer 2012;131(4):3518–3518.CrossRef Eddins DA, Walton JP, Dziorny AE, Frisina RD. Comparison of pure tone thresholds obtained via automated audiometry and standard pure tone audiometry. J Acoust Soc Amer 2012;131(4):3518–3518.CrossRef
Metadaten
Titel
Cognitively Inspired Feature Extraction and Speech Recognition for Automated Hearing Loss Testing
verfasst von
Shibli Nisar
Muhammad Tariq
Ahsan Adeel
Mandar Gogate
Amir Hussain
Publikationsdatum
13.02.2019
Verlag
Springer US
Erschienen in
Cognitive Computation / Ausgabe 4/2019
Print ISSN: 1866-9956
Elektronische ISSN: 1866-9964
DOI
https://doi.org/10.1007/s12559-018-9607-4

Weitere Artikel der Ausgabe 4/2019

Cognitive Computation 4/2019 Zur Ausgabe