nach oben

Erschienen in:

2016 | OriginalPaper | Buchkapitel

Bio-Inspired Filters for Audio Analysis

verfasst von : Nicola Strisciuglio, Mario Vento, Nicolai Petkov

Erschienen in: Brain-Inspired Computing

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Nowadays, much is known about the functions of the components of the human auditory system. Computational models of these components are widely accepted and recently inspired the work of researchers in pattern recognition and signal processing. In this work we present a novel filter, which we call COPE (Combination of Peaks of Energy), that is inspired by the way the sound waves are converted into neuronal firing activity on the auditory nerve. A COPE filter creates a model of the pattern of the neural activity generated by a sound of interest and is able to detect the same pattern and modified versions of it. We apply the proposed method on the task of event detection for surveillance of roads. For the experiments, we use a publicly available data set, namely the MIVIA road events data set. The results that we achieve (recognition rate equal to \(94\%\) and false positive rate lower than \(4\%\)) and the comparison with existing methods demonstrate the effectiveness of the proposed bio-inspired filters for audio analysis.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Visual Processing in Cortical Architecture from Neuroscience to Neuromorphic Computing

Nächstes Kapitel Sophisticated LVQ Classification Models - Beyond Accuracy Optimization

Azzopardi, G., Petkov, N.: A CORF computational model of a simple cell that relies on LGN input outperforms the Gabor function model. Biol. Cybern. 106(3), 177–189 (2012)CrossRef

Azzopardi, G., Petkov, N.: Trainable COSFIRE filters for keypoint detection and pattern recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35, 490–503 (2013)CrossRef

Azzopardi, G., Strisciuglio, N., Vento, M., Petkov, N.: Trainable COSFIRE filters for vessel delineation with application to retinal images. Med. Image Anal. 19(1), 46–57 (2015)CrossRef

Blauert, J.: The Technology of Binaural Listening. Modern Acoustics and Signal Processing (2013)

Cano, P., Batlle, E., Kalker, T., Haitsma, J.: A review of audio fingerprinting. J. VLSI Sig. Process. Syst. Sig. Image Video Technol. 41(3), 271–284 (2005)CrossRef

Carletti, V., Foggia, P., Percannella, G., Saggese, A., Strisciuglio, N., Vento, M.: Audio surveillance using a bag of aural words classifier. In: IEEE AVSS, pp. 81–86, August 2013

Chin, M., Burred, J.: Audio event detection based on layered symbolic sequence representations. In: IEEE ICASSP, pp. 1953–1956 (2012)

Clavel, C., Ehrette, T., Richard, G.: Events detection for an audio-based surveillance system. In: ICME, pp. 1306–1309 (2005)

Conte, D., Foggia, P., Percannella, G., Saggese, A., Vento, M.: An ensemble of rejecting classifiers for anomaly detection of audio events. In: IEEE AVSS, pp. 76–81, September 2012

10.

Crocco, M., Cristani, M., Trucco, A., Murino, V.: Audio surveillance: a systematic review. CoRR abs/1409.7787 (2014)

11.

Daugman, J.G.: Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. J. Opt. Soc. Am. A 2(7), 1160–1169 (1985)CrossRef

12.

Foggia, P., Petkov, N., Saggese, A., Strisciuglio, N., Vento, M.: Audio surveillance of roads: a system for detecting anomalous sounds. IEEE Trans. Intell. Transp. Syst. PP(99), 1–10 (2015)

13.

Foggia, P., Saggese, A., Strisciuglio, N., Vento, M.: Cascade classifiers trained on gammatonegrams for reliably detecting audio events. In: IEEE AVSS, pp. 50–55, August 2014

14.

Foggia, P., Petkov, N., Saggese, A., Strisciuglio, N., Vento, M.: Reliable detection of audio events in highly noisy environments. Pattern Recogn. Lett. 65, 22–28 (2015)CrossRef

15.

Geman, S., Geman, D.: Stochastic relaxation, gibbs distributions, and the bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell. PAMI–6(6), 721–741 (1984)CrossRefMATH

16.

Jeffress, L.A.: A place theory of sound localization. J. Comp. Physiol. Psychol. 41(1), 35–39 (1948)CrossRef

17.

Lecomte, S., Lengelle, R., Richard, C., Capman, F., Ravera, B.: Abnormal events detection using unsupervised one-class svm - application to audio surveillance and evaluation. In: IEEE AVSS, pp. 124–129, 30 2011-September 2 2011

18.

Lopez-Poveda, E.A., Eustaquio-Martín, A.: A biophysical model of the inner hair cell: The contribution of potassium currents to peripheral auditory compression. J. Assoc. Res. Otolaryngol. 7(3), 218–235 (2006). http://dx.doi.org/10.1007/s10162-006-0037-8

19.

Meddis, R.: Auditory-nerve first-spike latency and auditory absolute threshold: a computer model. J. Acoust. Soc. Am. 119(1), 406–417 (2006)CrossRef

20.

Ntalampiras, S., Potamitis, I., Fakotakis, N.: An adaptive framework for acoustic monitoring of potential hazards. EURASIP J. Audio Speech Music Process. 2009, 13:1–13:15 (2009)

21.

Ogle, J.P., Ellis, D.P.W.: Fingerprinting to identify repeated sound events in long-duration personal audio recordings. In: IEEE International Conference on Acoustics, Speech and Signal Processing, 2007, ICASSP 2007, vol. 1, pp. I-233–I-236, April 2007

22.

Palmer, A., Russell, I.: Phase-locking in the cochlear nerve of the guinea-pig and its relation to the receptor potential of inner hair-cells. Hear. Res. 24(1), 1–15 (1986)CrossRef

23.

Patterson, R.D., Moore, B.C.J.: Auditory filters and excitation patterns as representations of frequency resolution. Frequency selectivity in hearing, pp. 123–177 (1986)

24.

Patterson, R.D., Robinson, K., Holdsworth, J., Mckeown, D., Zhang, C., Allerhand, M.: Complex Sounds and auditory images. In: Cazals, Y., Demany, L., Honer, K. (eds.) Auditory Physiology and Perception, Pergamon, Pergamon, Oxford, pp. 429–443 (1992)

25.

Phan, H., Hertel, L., Maass, M., Mazur, R., Mertins, A.: Audio phrases for audio event recognition. In: 23nd European Signal Processing Conference, EUSIPCO 2015 (2015)

26.

Pour, A.F., Asgari, M., Hasanabadi, M.R.: Gammatonegram based speaker identification. In: 2014 4th International eConference on Computer and Knowledge Engineering (ICCKE), pp. 52–55, October 2014

27.

Poveda, E.A.L., Meddis, R.: A human nonlinear cochlear filterbank. J. Acoust. Soc. Am. 110(6), 3107–18 (2001)CrossRef

28.

Rabaoui, A., Davy, M., Rossignol, S., Ellouze, N.: Using one-class svms and wavelets for audio surveillance. IEEE Trans. Inf. Forensics Security 3(4), 763–775 (2008)CrossRef

29.

Strisciuglio, N., Azzopardi, G., Vento, M., Petkov, N.: Multiscale blood vessel delineation using B-COSFIRE filters. In: Azzopardi, G., Petkov, N. (eds.) CAIP 2015. LNCS, vol. 9257, pp. 300–312. Springer, Heidelberg (2015). doi:10.1007/978-3-319-23117-4_26 CrossRef

30.

Strisciuglio, N., Azzopardi, G., Vento, M., Petkov, N.: Supervised vessel delineation in retinal fundus images with the automatic selection of B-COSFIRE filters. Mach. Vis. Appl., 1–13 (2016). doi:10.1007/s00138-016-0781-7

31.

Sturm, B.L.: A survey of evaluation in music genre recognition. In: Nürnberger, A., Stober, S., Larsen, B., Detyniecki, M. (eds.) AMR 2012. LNCS, vol. 8382, pp. 29–66. Springer, Heidelberg (2014). doi:10.1007/978-3-319-12093-5_2

32.

Vacher, M., Istrate, D., Besacier, L., Serignat, J.F., Castelli, E.: Sound detection and classification for medical telesurvey. In: ACTA Press (eds.) Proceedings of the 2nd ICBME, Innsbruck, Austria, pp. 395–398, February 2004

33.

Valenzise, G., Gerosa, L., Tagliasacchi, M., Antonacci, F., Sarti, A.: Scream and gunshot detection and localization for audio-surveillance systems. In: IEEE AVSS, pp. 21–26 (2007)

34.

Wang, A.L.-C., Th Floor Block F.: An industrial-strength audio search algorithm. In: Proceedings of the 4th International Conference on Music Information Retrieval (2003)

Titel: Bio-Inspired Filters for Audio Analysis
verfasst von: Nicola Strisciuglio
Mario Vento
Nicolai Petkov
Verlag: Springer International Publishing
Buch: Brain-Inspired Computing
Print ISBN: 978-3-319-50861-0

Electronic ISBN: 978-3-319-50862-7

Copyright-Jahr: 2016
DOI: https://doi.org/10.1007/978-3-319-50862-7_8

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Arbeitszeit/© granata68 / Fotolia, E-Autos im Fuhrpark: Lohnt sich das noch?/© Petair / stock.adobe.com, Kryptowährungen/© gopixa / Getty Images / iStock, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Sustainibility Finance/© Robert Kneschke / stock.adobe.com / Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.