Skip to main content
Top
Published in: Wireless Personal Communications 2/2020

24-04-2020

Abnormal Acoustic Event Detection Based on Orthogonal Matching Pursuit in Security Surveillance System

Authors: Pengnan Zhang, Juan Wei, Zhe Liu, Fangli Ning

Published in: Wireless Personal Communications | Issue 2/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper presents a feature extraction approach for surveillance system aimed at achieving the automatic detection and recognition of public security events. The proposed approach first generates a Gabor dictionary based on the human auditory critical frequency bands, and then uses the orthogonal matching pursuit (OMP) algorithm to sparse abnormal audio signal. We select the optimal several important atoms from the Gabor dictionary and extract the scale, frequency, and translation parameters of the atoms to form the OMP feature. The performance of OMP feature is compared with traditional acoustic features and their joint features, using support vector machine (SVM) and random forest (RF) classifiers. Experiments have been performed to evaluate the effectiveness of the OMP feature for supplementing traditional acoustic features. The results show the superior performance classifier for abnormal acoustic event detection (AAED) is RF. Furthermore, the introduction of the combined features addresses the problems of low recognition accuracy and poor robustness for the surveillance system in practical applications.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Shi, B., Sun, M., Kao, C. C., et al. (2019). Semi-supervised acoustic event detection based on tri-training. In Electrical engineering and systems science audio an speech processing (eess.AS), 29 April. Shi, B., Sun, M., Kao, C. C., et al. (2019). Semi-supervised acoustic event detection based on tri-training. In Electrical engineering and systems science audio an speech processing (eess.AS), 29 April.
2.
go back to reference Passler, S., & Fischer, W. J. (2014). Food intake monitoring: Automated chew event detection in chewing sounds. IEEE Journal of Biomedical and Health Informatics, 18(1), 278–289.CrossRef Passler, S., & Fischer, W. J. (2014). Food intake monitoring: Automated chew event detection in chewing sounds. IEEE Journal of Biomedical and Health Informatics, 18(1), 278–289.CrossRef
3.
go back to reference Wang, J. C., Lin, C. H., Chen, B. W., et al. (2014). Gabor-based nonuniform scale-frequency map for environmental sound classification in home automation. IEEE Transactions on Automation Science & Engineering, 11(2), 607–613.CrossRef Wang, J. C., Lin, C. H., Chen, B. W., et al. (2014). Gabor-based nonuniform scale-frequency map for environmental sound classification in home automation. IEEE Transactions on Automation Science & Engineering, 11(2), 607–613.CrossRef
4.
go back to reference Sehili, M. E. A., Lecouteux, B., Vacher, M., et al. (2012). Sound environment analysis in smart home. Lecture Notes in Computer Science, 7683(7683), 208–223.CrossRef Sehili, M. E. A., Lecouteux, B., Vacher, M., et al. (2012). Sound environment analysis in smart home. Lecture Notes in Computer Science, 7683(7683), 208–223.CrossRef
5.
go back to reference Lopatka, K., Kotus, J., & Czyzewski, A. (2016). Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations. Multimedia Tools and Applications, 75(17), 10407–10439.CrossRef Lopatka, K., Kotus, J., & Czyzewski, A. (2016). Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations. Multimedia Tools and Applications, 75(17), 10407–10439.CrossRef
6.
go back to reference Astapov, S., Berdnikova, J., Ehala, J., et al. (2018). Gunshot acoustic event identification and shooter localization in a WSN of asynchronous multichannel acoustic ground sensors. Multidimensional Systems and Signal Processing, 29, 563–595.MathSciNetCrossRef Astapov, S., Berdnikova, J., Ehala, J., et al. (2018). Gunshot acoustic event identification and shooter localization in a WSN of asynchronous multichannel acoustic ground sensors. Multidimensional Systems and Signal Processing, 29, 563–595.MathSciNetCrossRef
7.
go back to reference Dufaux, A., Besacier, L., Ansorge, M., et al. (2000) Automatic sound detection and recognition for noisy environment. In Proceedings of 2000 10th European signal processing conference (pp. 1–4), Tampere, September 2000. Dufaux, A., Besacier, L., Ansorge, M., et al. (2000) Automatic sound detection and recognition for noisy environment. In Proceedings of 2000 10th European signal processing conference (pp. 1–4), Tampere, September 2000.
8.
go back to reference Khunarsa, P., Lursinsap, C., & Raicharoen, T. (2010). Impulsive environment sound detection by neural classification of spectrogram and mel-frequency coefficient images. Advances in Neural Network Research and Applications, 67, 337–346.CrossRef Khunarsa, P., Lursinsap, C., & Raicharoen, T. (2010). Impulsive environment sound detection by neural classification of spectrogram and mel-frequency coefficient images. Advances in Neural Network Research and Applications, 67, 337–346.CrossRef
9.
go back to reference Okuyucu, Ç., Sert, M., & Yazici, A. (2013) Audio feature and classifier analysis for efficient recognition of environmental sounds. In Proceedings 2013 IEEE international symposium on multimedia, Anaheim, CA, December 2013 Okuyucu, Ç., Sert, M., & Yazici, A. (2013) Audio feature and classifier analysis for efficient recognition of environmental sounds. In Proceedings 2013 IEEE international symposium on multimedia, Anaheim, CA, December 2013
10.
go back to reference Xia, X., Togneri, R., Sohel, F., et al. (2018). Random forest classification based acoustic event detection utilizing contextual-information and bottleneck features. Pattern Recognition, 81, 1–13.CrossRef Xia, X., Togneri, R., Sohel, F., et al. (2018). Random forest classification based acoustic event detection utilizing contextual-information and bottleneck features. Pattern Recognition, 81, 1–13.CrossRef
11.
go back to reference Cohen R , Lavner Y . (2012). Infant cry analysis and detection. In Proceedings of 27th conference on electrical and electronics engineers (pp. 1–5), Eilat, Israel, November 2012. Cohen R , Lavner Y . (2012). Infant cry analysis and detection. In Proceedings of 27th conference on electrical and electronics engineers (pp. 1–5), Eilat, Israel, November 2012.
12.
go back to reference Nguyen, Q., & Choi, J. S. (2017). Matching pursuit based robust acoustic event classification for surveillance systems. Computers and Electrical Engineering, 57, 43–54.CrossRef Nguyen, Q., & Choi, J. S. (2017). Matching pursuit based robust acoustic event classification for surveillance systems. Computers and Electrical Engineering, 57, 43–54.CrossRef
13.
go back to reference Wolpert, D. (1996). The lack of a priori distinctions between learning algorithms. Neural Computation, 8(7), 1341–1390.CrossRef Wolpert, D. (1996). The lack of a priori distinctions between learning algorithms. Neural Computation, 8(7), 1341–1390.CrossRef
14.
go back to reference Fernandez-Delgado, M., Cernadas, E., Barro, S., et al. (2014). Do we need hundreds of classifiers to solve real world classification problems. Journal of Machine Learning Research, 15, 3133–3181.MathSciNetMATH Fernandez-Delgado, M., Cernadas, E., Barro, S., et al. (2014). Do we need hundreds of classifiers to solve real world classification problems. Journal of Machine Learning Research, 15, 3133–3181.MathSciNetMATH
15.
go back to reference Hrabina, M., & Sigmund, M. (2017). Comparison of feature performance in gunshot detection depending on noise degradation. In Proceedings 27th international conference on RADIOELEKTRONIKA (pp. 1–4), Brno, Czech Republic, April 2017. Hrabina, M., & Sigmund, M. (2017). Comparison of feature performance in gunshot detection depending on noise degradation. In Proceedings 27th international conference on RADIOELEKTRONIKA (pp. 1–4), Brno, Czech Republic, April 2017.
16.
go back to reference Chu, S., Narayanan, S., & Kuo, C. C. J. (2009). Environmental sound recognition with time-frequency audio features. IEEE Transactions on Audio, Speech and Language Processing, 17(6), 1142–1158.CrossRef Chu, S., Narayanan, S., & Kuo, C. C. J. (2009). Environmental sound recognition with time-frequency audio features. IEEE Transactions on Audio, Speech and Language Processing, 17(6), 1142–1158.CrossRef
17.
go back to reference Mallat, S. G., & Zhang, Z. (1993). Matching pursuits with time-frequency dictionaries. IEEE Transactions on Signal Processing, 41(12), 3397–3415.CrossRef Mallat, S. G., & Zhang, Z. (1993). Matching pursuits with time-frequency dictionaries. IEEE Transactions on Signal Processing, 41(12), 3397–3415.CrossRef
18.
go back to reference Li, D., Tam, J., & Toub, D. (2013). Auditory scene classification using machine learning techniques. In Proceedings of the IEEE AASP challenge on detection and classification of acoustic scenes and events, New Paltz, NY, October 2013. Li, D., Tam, J., & Toub, D. (2013). Auditory scene classification using machine learning techniques. In Proceedings of the IEEE AASP challenge on detection and classification of acoustic scenes and events, New Paltz, NY, October 2013.
19.
go back to reference Petrovsky, A., Azarov, E., & Petrovsky, A. (2011). Hybrid signal decomposition based on instantaneous harmonic parameters and perceptually motivated wavelet packets for scalable audio coding. Signal Processing, 91(6), 1489–1504.CrossRef Petrovsky, A., Azarov, E., & Petrovsky, A. (2011). Hybrid signal decomposition based on instantaneous harmonic parameters and perceptually motivated wavelet packets for scalable audio coding. Signal Processing, 91(6), 1489–1504.CrossRef
20.
go back to reference Sugden, P., & Canagarajah, N. (2003). Underdetermined noisy blind separation using dual matching pursuits. Electronics Letters, 39(1), 158–160.CrossRef Sugden, P., & Canagarajah, N. (2003). Underdetermined noisy blind separation using dual matching pursuits. Electronics Letters, 39(1), 158–160.CrossRef
21.
go back to reference Wei, P. I. (2010). Generalized demodulation method based on multi-scale chirplet and sparse signal decomposition and its application to gear fault diagnosis. Journal of Mechanical Engineering, 46(15), 63–68. Wei, P. I. (2010). Generalized demodulation method based on multi-scale chirplet and sparse signal decomposition and its application to gear fault diagnosis. Journal of Mechanical Engineering, 46(15), 63–68.
22.
go back to reference Schroder, J., Goetze, S., & Anemuller, J. (2015). Spectro-temporal gabor filterbank features for acoustic event detection. IEEE/ACM Transactions on Audio Speech and Language Processing, 23(12), 2198–2208.CrossRef Schroder, J., Goetze, S., & Anemuller, J. (2015). Spectro-temporal gabor filterbank features for acoustic event detection. IEEE/ACM Transactions on Audio Speech and Language Processing, 23(12), 2198–2208.CrossRef
23.
go back to reference Soussen, C., Rémi, G., Jérôme, I., et al. (2013). Joint K-step analysis of orthogonal matching pursuit and orthogonal least squares. IEEE Transactions on Information Theory, 59(5), 3158–3174.MathSciNetCrossRef Soussen, C., Rémi, G., Jérôme, I., et al. (2013). Joint K-step analysis of orthogonal matching pursuit and orthogonal least squares. IEEE Transactions on Information Theory, 59(5), 3158–3174.MathSciNetCrossRef
24.
go back to reference Pace, M. A., Nicolás, U., Masson, F., et al. (2017). Critical frequencies for the annoyance judgments of noises with tonal component, to develop an equal-annoyance contour. Acoustical Society of America Journal, 141(5), 3883.CrossRef Pace, M. A., Nicolás, U., Masson, F., et al. (2017). Critical frequencies for the annoyance judgments of noises with tonal component, to develop an equal-annoyance contour. Acoustical Society of America Journal, 141(5), 3883.CrossRef
25.
go back to reference Zwicker, E. (1961). Subdivision of the audible frequency range into critical bands (Frequenzgruppen). The Journal of the Acoustical Society of America, 33(2), 248.CrossRef Zwicker, E. (1961). Subdivision of the audible frequency range into critical bands (Frequenzgruppen). The Journal of the Acoustical Society of America, 33(2), 248.CrossRef
26.
27.
go back to reference Phan, H., Marco, M., Mazur, R., et al. (2015). Random regression forests for acoustic event detection and classification. IEEE/ACM Transactions on Audio Speech and Language Processing, 23(1), 20–31.CrossRef Phan, H., Marco, M., Mazur, R., et al. (2015). Random regression forests for acoustic event detection and classification. IEEE/ACM Transactions on Audio Speech and Language Processing, 23(1), 20–31.CrossRef
Metadata
Title
Abnormal Acoustic Event Detection Based on Orthogonal Matching Pursuit in Security Surveillance System
Authors
Pengnan Zhang
Juan Wei
Zhe Liu
Fangli Ning
Publication date
24-04-2020
Publisher
Springer US
Published in
Wireless Personal Communications / Issue 2/2020
Print ISSN: 0929-6212
Electronic ISSN: 1572-834X
DOI
https://doi.org/10.1007/s11277-020-07405-z

Other articles of this Issue 2/2020

Wireless Personal Communications 2/2020 Go to the issue