Skip to main content
Erschienen in: New Generation Computing 4/2017

04.08.2017 | Special Feature

Automatic Classification of Impact Sounds with Rejection of Unknown Samples

verfasst von: Joaquim Ferreira da Silva, Sofia Cavaco, Gabriel Pereira Lopes

Erschienen in: New Generation Computing | Ausgabe 4/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The discrimination of very similar sounds is a hard task both for artificial systems and humans. For the former, the main problem lies on finding appropriate features to discriminate each class of sounds, which is an especially hard task when the sounds are very similar, such as impacts on rods of different metals. This paper presents a method to automatically select the features to be used in the classification. Given an initial large set of features, the method measures their discriminative power and builds a reduced set of new features which discriminates the sound classes very accurately. This feature selection method is part of the learning phase of a supervised classification approach also proposed here. In addition, this approach contains a module that rejects unknown sounds also very accurately. This is also an important innovation since most audio classifiers assume all test sounds belong to one of the known classes.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
Even though Fourier analysis gives information about both the magnitude and phase spectrum of the sound, here we consider only the magnitudes.
 
2
In the remaining text, given a set \(\mathscr {X}\), we define \(\Vert \mathscr {X}\Vert\) as its size.
 
3
In the remaining text (except for Sect. 5) every reasoning made for \(DP^{\prime }\) is valid for DP, unless otherwise specified or obviously not applicable.
 
Literatur
1.
Zurück zum Zitat Gencoglu, O., Virtanen, T., Huttunen, H.: Recognition of acoustic events using deep neural networks. In: Proc. of the 22nd European signal processing conference (EUSIPCO), IEEE, Lisbon, pp. 506–510 (2014) Gencoglu, O., Virtanen, T., Huttunen, H.: Recognition of acoustic events using deep neural networks. In: Proc. of the 22nd European signal processing conference (EUSIPCO), IEEE, Lisbon, pp. 506–510 (2014)
2.
Zurück zum Zitat McLoughlin, I., Zhang, H., Xie, Z., Song, Y., Xiao, W.: Robust sound event classification using deep neural networks. IEEE/ACM Trans. Audio Speech Lang. Process. 23(3), 540–552 (2015)CrossRef McLoughlin, I., Zhang, H., Xie, Z., Song, Y., Xiao, W.: Robust sound event classification using deep neural networks. IEEE/ACM Trans. Audio Speech Lang. Process. 23(3), 540–552 (2015)CrossRef
3.
Zurück zum Zitat Khunarsal, P., Lursinsap, C., Raicharoen, T.: Very short time environmental sound classification based on spectrogram pattern matching. Inf. Sci. 243, 57–74 (2013) Khunarsal, P., Lursinsap, C., Raicharoen, T.: Very short time environmental sound classification based on spectrogram pattern matching. Inf. Sci. 243, 57–74 (2013)
4.
Zurück zum Zitat Breebaart, J., McKinney, M.: Features for audio classification. Proc. Philips symposium on intelligent algorithms, Eindhoven (2002) Breebaart, J., McKinney, M.: Features for audio classification. Proc. Philips symposium on intelligent algorithms, Eindhoven (2002)
5.
Zurück zum Zitat Slaney, M.: Auditory toolbox. Technical report #45. Apple Computer, Inc., Cupertino (1994) Slaney, M.: Auditory toolbox. Technical report #45. Apple Computer, Inc., Cupertino (1994)
6.
Zurück zum Zitat Dennis, J., Tran, H.D., Chng, E.S.: Overlapping sound event recognition using local spectrogram features and the generalised hough transform. Pattern Recognit. Lett. 34(9), 1085–1093 (2013) Dennis, J., Tran, H.D., Chng, E.S.: Overlapping sound event recognition using local spectrogram features and the generalised hough transform. Pattern Recognit. Lett. 34(9), 1085–1093 (2013)
7.
Zurück zum Zitat Chu, S., Narayanan S., Kuo, C.-C.J.: Environmental sound recognition using MP-based features. Proc. IEEE ICASSP, pp 1–4 (2008) Chu, S., Narayanan S., Kuo, C.-C.J.: Environmental sound recognition using MP-based features. Proc. IEEE ICASSP, pp 1–4 (2008)
8.
Zurück zum Zitat Cavaco, S., Rodeia, J.: Classification of similar impact sounds. In: Elmoataz A et al (eds) Image and signal processing. Lecture notes in computer science, vol 6134. Springer, Berlin, pp 307–314 (2010) Cavaco, S., Rodeia, J.: Classification of similar impact sounds. In: Elmoataz A et al (eds) Image and signal processing. Lecture notes in computer science, vol 6134. Springer, Berlin, pp 307–314 (2010)
9.
Zurück zum Zitat Kraft, F., Schaaf, T., Waibel, A., Malkin, R.: Temporal ICA for classification of acoustic events in a kitchen environment. In: Proc. international conference on speech and language processing—INTERSPEECH, Lisbon, Portugal, pp. 2689–2692 (2005) Kraft, F., Schaaf, T., Waibel, A., Malkin, R.: Temporal ICA for classification of acoustic events in a kitchen environment. In: Proc. international conference on speech and language processing—INTERSPEECH, Lisbon, Portugal, pp. 2689–2692 (2005)
10.
Zurück zum Zitat Eronen, A.: Musical instrument recognition using ICA-based transform of features and discriminatively trained HMMs. Signal Process. Appl. 2, 133–136 (2003) Eronen, A.: Musical instrument recognition using ICA-based transform of features and discriminatively trained HMMs. Signal Process. Appl. 2, 133–136 (2003)
11.
Zurück zum Zitat Dufaux, A., Besacier, L., Ansorge, M., Pellandini, F.: Automatic sound detection and recognition for noisy environment. In: Proc. of the European signal processing conference (EUSIPCO) (2000) Dufaux, A., Besacier, L., Ansorge, M., Pellandini, F.: Automatic sound detection and recognition for noisy environment. In: Proc. of the European signal processing conference (EUSIPCO) (2000)
12.
Zurück zum Zitat Wu, H., Siegel, M., Khosla, P.: Vehicle sound signature recognition by frequency vector principal component analysis. IEEE Trans. Instrum. Meas. 48(5), 1005–1009 (1999)CrossRef Wu, H., Siegel, M., Khosla, P.: Vehicle sound signature recognition by frequency vector principal component analysis. IEEE Trans. Instrum. Meas. 48(5), 1005–1009 (1999)CrossRef
13.
Zurück zum Zitat Ntalampiras, S., Potamitis, I., Fakotakis, N.: Automatic recognition of urban environmental sounds events. New directions in intelligent interactive multimedia, pp 147–153 (2008) Ntalampiras, S., Potamitis, I., Fakotakis, N.: Automatic recognition of urban environmental sounds events. New directions in intelligent interactive multimedia, pp 147–153 (2008)
14.
Zurück zum Zitat Johnson, R.A., Wichern, D.W.: Applied multivariate statistical analysis, 2nd edn. Prentice-Hall, Upper Saddle River (1988) Johnson, R.A., Wichern, D.W.: Applied multivariate statistical analysis, 2nd edn. Prentice-Hall, Upper Saddle River (1988)
Metadaten
Titel
Automatic Classification of Impact Sounds with Rejection of Unknown Samples
verfasst von
Joaquim Ferreira da Silva
Sofia Cavaco
Gabriel Pereira Lopes
Publikationsdatum
04.08.2017
Verlag
Ohmsha
Erschienen in
New Generation Computing / Ausgabe 4/2017
Print ISSN: 0288-3635
Elektronische ISSN: 1882-7055
DOI
https://doi.org/10.1007/s00354-017-0025-z

Weitere Artikel der Ausgabe 4/2017

New Generation Computing 4/2017 Zur Ausgabe