Skip to main content
Top

2022 | OriginalPaper | Chapter

Cough Sound Identification: An Approach Based on Ensemble Learning

Authors : Christian Salamea-Palacios, Javier Guaña-Moya, Tarquino Sanchez, Xavier Calderón, David Naranjo

Published in: Marketing and Smart Technologies

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Cough identification using DSP techniques in an audio signal is a complex task; thus, an artificial intelligence approach is proposed by applying machine learning, deep learning, and HMMs algorithms. Later, an ensemble learning model has been used to differentiate cough from other environmental sounds, putting those algorithms together and choosing the best result as the performance of the system. The final system consists of a preprocessing stage where the audio signals are adjusted through data augmentation, normalization, removal of silent fragments, and the transformation to Mel spectrograms, while, on back-end stage, three models have been evaluated: a convolutional neural network, a random forest, and a classifier based on hidden Markov models. We assembled a hard voting classifier (VC) model from the three models to obtain a more robust and reliable model. The VC model reached the highest precision and F1-score values without false-negative and up to 75% of true-positive values.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
7.
go back to reference Arturo, G.M.: Reconocimiento de voz basado en MFCC, SBC y Espectrogramas. Ingenius 12–20. (2013) Arturo, G.M.: Reconocimiento de voz basado en MFCC, SBC y Espectrogramas. Ingenius 12–20. (2013)
8.
go back to reference Grama, L.: Choosing an accurate number of mel frequency cepstral coefficients for audio classification purpose. In: International Symposium on Image and Signal Processing and Analysis, ISPA, Ispa, pp. 225–230 (2017). https://doi.org/10.1109/ISPA.2017 Grama, L.: Choosing an accurate number of mel frequency cepstral coefficients for audio classification purpose. In: International Symposium on Image and Signal Processing and Analysis, ISPA, Ispa, pp. 225–230 (2017). https://​doi.​org/​10.​1109/​ISPA.​2017
14.
go back to reference Teyhouee, A.: Cough detection using hidden markov models. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11549 LNCS, pp. 266–276 (2019) Teyhouee, A.: Cough detection using hidden markov models. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11549 LNCS, pp. 266–276 (2019)
21.
go back to reference Grama, L., Rusu, C.: Choosing an accurate number of mel frequency cepstral coefficients for audio classification purpose. In: International Symposium on Image and Signal Processing and Analysis, ISPA, Ispa, pp. 225–230 (2017). https://doi.org/10.1109/ISPA.2017 Grama, L., Rusu, C.: Choosing an accurate number of mel frequency cepstral coefficients for audio classification purpose. In: International Symposium on Image and Signal Processing and Analysis, ISPA, Ispa, pp. 225–230 (2017). https://​doi.​org/​10.​1109/​ISPA.​2017
24.
go back to reference Jeebun, S.: Optimal number of states in hidden Markov models and its application to the detection of human movement. Univ. Mauritius Res. J. 21, 438–469 (2015) Jeebun, S.: Optimal number of states in hidden Markov models and its application to the detection of human movement. Univ. Mauritius Res. J. 21, 438–469 (2015)
Metadata
Title
Cough Sound Identification: An Approach Based on Ensemble Learning
Authors
Christian Salamea-Palacios
Javier Guaña-Moya
Tarquino Sanchez
Xavier Calderón
David Naranjo
Copyright Year
2022
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-16-9268-0_22

Premium Partner