Skip to main content
Top

2023 | OriginalPaper | Chapter

A Deep Meta-model for Environmental Sound Recognition

Author : K. S. Arun

Published in: ICDSMLA 2021

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Nowadays, sound serves as a crucial factor in all facets of human life. Staring from automating personal security systems to critical surveillance systems, sound is an indispensable component. The practical implementation of the present day automatic sound recognition systems in real-life settings is inadmissible due to their poor detection accuracy. However, deep learning-based systems overcome the incompetence of the traditional machine learning-based models, and it can be used to develop automatic sound classification systems. This work proposes a deep meta-model for categorizing environmental sounds on the basis of the spectrogram images generated from these sounds. In the proposed approach, spectrogram images of environmental sounds are used to train five different deep learning models, and the predictions from these base models are then stacked using the proposed deep meta-model. Experimental results on two benchmark datasets such as ESC-50 and UrbanSound 8K demonstrate the fact that the proposed deep meta-model is a promising alternative to the conventional approaches for environmental sound recognition.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Ahmed M, Robin TI, Shafin AA et al (2020) Automatic environmental sound recognition (AESR) using convolutional neural network. Int J Mod Educ Comput Sci 12(5) Ahmed M, Robin TI, Shafin AA et al (2020) Automatic environmental sound recognition (AESR) using convolutional neural network. Int J Mod Educ Comput Sci 12(5)
2.
go back to reference Arun KS, Govindan VK (2015) Optimizing visual dictionaries for effective image retrieval. Int J Multim Inf Retr 4(3):165–185CrossRef Arun KS, Govindan VK (2015) Optimizing visual dictionaries for effective image retrieval. Int J Multim Inf Retr 4(3):165–185CrossRef
3.
go back to reference Arun KS, Govindan VK, Kumar SDM (2017) On integrating re-ranking and rank list fusion techniques for image retrieval. Int J Data Sci Anal 4(1):53–81CrossRef Arun KS, Govindan VK, Kumar SDM (2017) On integrating re-ranking and rank list fusion techniques for image retrieval. Int J Data Sci Anal 4(1):53–81CrossRef
4.
go back to reference Arun KS, Sarath KS (2010) Evaluation of the role of low level and high level features in content based medical image retrieval. In: International conference on advances in information and communication technologies, Springer, pp 319–325 Arun KS, Sarath KS (2010) Evaluation of the role of low level and high level features in content based medical image retrieval. In: International conference on advances in information and communication technologies, Springer, pp 319–325
5.
go back to reference Demir F, Abdullah DA, Sengur A (2020) A new deep CNN model for environmental sound classification. IEEE Access 8:66529–66537CrossRef Demir F, Abdullah DA, Sengur A (2020) A new deep CNN model for environmental sound classification. IEEE Access 8:66529–66537CrossRef
6.
go back to reference Demir F, Turkoglu M, Aslan M, Sengur A (2020) A new pyramidal concatenated CNN approach for environmental sound classification. Appl Acoust 170:107520CrossRef Demir F, Turkoglu M, Aslan M, Sengur A (2020) A new pyramidal concatenated CNN approach for environmental sound classification. Appl Acoust 170:107520CrossRef
7.
go back to reference Dietterich TG (2000) Ensemble methods in machine learning. In: International workshop on multiple classifier systems. Springer, pp 1–15 Dietterich TG (2000) Ensemble methods in machine learning. In: International workshop on multiple classifier systems. Springer, pp 1–15
8.
go back to reference Guzhov A, Raue F, Hees J, Dengel A (2021) ESResNet: environmental sound classification based on visual domain models. In: 2020 25th International conference on pattern recognition (ICPR). IEEE, pp 4933–4940 Guzhov A, Raue F, Hees J, Dengel A (2021) ESResNet: environmental sound classification based on visual domain models. In: 2020 25th International conference on pattern recognition (ICPR). IEEE, pp 4933–4940
9.
go back to reference Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141 Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141
11.
go back to reference Liu C, Hong F, Feng H, Zhai Y, Chen Y (2021) Environmental sound classification based on stacked concatenated DNN using aggregated features. J Signal Process Syst 1–13 Liu C, Hong F, Feng H, Zhai Y, Chen Y (2021) Environmental sound classification based on stacked concatenated DNN using aggregated features. J Signal Process Syst 1–13
12.
go back to reference Ragab MG, Abdulkadir SJ, Aziz N, Alhussian H, Bala A, Alqushaibi A (2021) An ensemble one dimensional convolutional neural network with Bayesian optimization for environmental sound classification. Appl Sci 11(10):4660CrossRef Ragab MG, Abdulkadir SJ, Aziz N, Alhussian H, Bala A, Alqushaibi A (2021) An ensemble one dimensional convolutional neural network with Bayesian optimization for environmental sound classification. Appl Sci 11(10):4660CrossRef
13.
go back to reference Skariah SM, Arun KS (2021) A deep learning based approach for automated diabetic retinopathy detection and grading. In: 2021 4th Biennial international conference on Nascent Technologies in engineering (ICNTE). IEEE (2021) Skariah SM, Arun KS (2021) A deep learning based approach for automated diabetic retinopathy detection and grading. In: 2021 4th Biennial international conference on Nascent Technologies in engineering (ICNTE). IEEE (2021)
14.
go back to reference Stastny J, Munk M, Juranek L (2018) Automatic bird species recognition based on birds vocalization. EURASIP J Audio Speech Music Process 2018(1):1–7CrossRef Stastny J, Munk M, Juranek L (2018) Automatic bird species recognition based on birds vocalization. EURASIP J Audio Speech Music Process 2018(1):1–7CrossRef
15.
go back to reference Tang B, Li Y, Li X, Xu L, Yan Y, Yang Q (2019) Deep CNN framework for environmental sound classification using weighting filters. In: 2019 IEEE international conference on mechatronics and automation (ICMA), IEEE, pp 2297–2302 Tang B, Li Y, Li X, Xu L, Yan Y, Yang Q (2019) Deep CNN framework for environmental sound classification using weighting filters. In: 2019 IEEE international conference on mechatronics and automation (ICMA), IEEE, pp 2297–2302
16.
go back to reference Woo S, Park J, Lee JY, Kweon IS (2018) CBAM: Convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19 Woo S, Park J, Lee JY, Kweon IS (2018) CBAM: Convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19
17.
go back to reference Xie S, Girshick R, Dollár P, Tu Z, He K (2017) Aggregated residual transformations for deep neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1492–1500 Xie S, Girshick R, Dollár P, Tu Z, He K (2017) Aggregated residual transformations for deep neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1492–1500
19.
go back to reference Zhang Z, Xu S, Zhang S, Qiao T, Cao S (2021) Attention based convolutional recurrent neural network for environmental sound classification. Neurocomputing 453:896–903CrossRef Zhang Z, Xu S, Zhang S, Qiao T, Cao S (2021) Attention based convolutional recurrent neural network for environmental sound classification. Neurocomputing 453:896–903CrossRef
Metadata
Title
A Deep Meta-model for Environmental Sound Recognition
Author
K. S. Arun
Copyright Year
2023
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-19-5936-3_19

Premium Partner