Skip to main content
Erschienen in: Wireless Personal Communications 2/2020

24.04.2020

Abnormal Acoustic Event Detection Based on Orthogonal Matching Pursuit in Security Surveillance System

verfasst von: Pengnan Zhang, Juan Wei, Zhe Liu, Fangli Ning

Erschienen in: Wireless Personal Communications | Ausgabe 2/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper presents a feature extraction approach for surveillance system aimed at achieving the automatic detection and recognition of public security events. The proposed approach first generates a Gabor dictionary based on the human auditory critical frequency bands, and then uses the orthogonal matching pursuit (OMP) algorithm to sparse abnormal audio signal. We select the optimal several important atoms from the Gabor dictionary and extract the scale, frequency, and translation parameters of the atoms to form the OMP feature. The performance of OMP feature is compared with traditional acoustic features and their joint features, using support vector machine (SVM) and random forest (RF) classifiers. Experiments have been performed to evaluate the effectiveness of the OMP feature for supplementing traditional acoustic features. The results show the superior performance classifier for abnormal acoustic event detection (AAED) is RF. Furthermore, the introduction of the combined features addresses the problems of low recognition accuracy and poor robustness for the surveillance system in practical applications.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Shi, B., Sun, M., Kao, C. C., et al. (2019). Semi-supervised acoustic event detection based on tri-training. In Electrical engineering and systems science audio an speech processing (eess.AS), 29 April. Shi, B., Sun, M., Kao, C. C., et al. (2019). Semi-supervised acoustic event detection based on tri-training. In Electrical engineering and systems science audio an speech processing (eess.AS), 29 April.
2.
Zurück zum Zitat Passler, S., & Fischer, W. J. (2014). Food intake monitoring: Automated chew event detection in chewing sounds. IEEE Journal of Biomedical and Health Informatics, 18(1), 278–289.CrossRef Passler, S., & Fischer, W. J. (2014). Food intake monitoring: Automated chew event detection in chewing sounds. IEEE Journal of Biomedical and Health Informatics, 18(1), 278–289.CrossRef
3.
Zurück zum Zitat Wang, J. C., Lin, C. H., Chen, B. W., et al. (2014). Gabor-based nonuniform scale-frequency map for environmental sound classification in home automation. IEEE Transactions on Automation Science & Engineering, 11(2), 607–613.CrossRef Wang, J. C., Lin, C. H., Chen, B. W., et al. (2014). Gabor-based nonuniform scale-frequency map for environmental sound classification in home automation. IEEE Transactions on Automation Science & Engineering, 11(2), 607–613.CrossRef
4.
Zurück zum Zitat Sehili, M. E. A., Lecouteux, B., Vacher, M., et al. (2012). Sound environment analysis in smart home. Lecture Notes in Computer Science, 7683(7683), 208–223.CrossRef Sehili, M. E. A., Lecouteux, B., Vacher, M., et al. (2012). Sound environment analysis in smart home. Lecture Notes in Computer Science, 7683(7683), 208–223.CrossRef
5.
Zurück zum Zitat Lopatka, K., Kotus, J., & Czyzewski, A. (2016). Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations. Multimedia Tools and Applications, 75(17), 10407–10439.CrossRef Lopatka, K., Kotus, J., & Czyzewski, A. (2016). Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations. Multimedia Tools and Applications, 75(17), 10407–10439.CrossRef
6.
Zurück zum Zitat Astapov, S., Berdnikova, J., Ehala, J., et al. (2018). Gunshot acoustic event identification and shooter localization in a WSN of asynchronous multichannel acoustic ground sensors. Multidimensional Systems and Signal Processing, 29, 563–595.MathSciNetCrossRef Astapov, S., Berdnikova, J., Ehala, J., et al. (2018). Gunshot acoustic event identification and shooter localization in a WSN of asynchronous multichannel acoustic ground sensors. Multidimensional Systems and Signal Processing, 29, 563–595.MathSciNetCrossRef
7.
Zurück zum Zitat Dufaux, A., Besacier, L., Ansorge, M., et al. (2000) Automatic sound detection and recognition for noisy environment. In Proceedings of 2000 10th European signal processing conference (pp. 1–4), Tampere, September 2000. Dufaux, A., Besacier, L., Ansorge, M., et al. (2000) Automatic sound detection and recognition for noisy environment. In Proceedings of 2000 10th European signal processing conference (pp. 1–4), Tampere, September 2000.
8.
Zurück zum Zitat Khunarsa, P., Lursinsap, C., & Raicharoen, T. (2010). Impulsive environment sound detection by neural classification of spectrogram and mel-frequency coefficient images. Advances in Neural Network Research and Applications, 67, 337–346.CrossRef Khunarsa, P., Lursinsap, C., & Raicharoen, T. (2010). Impulsive environment sound detection by neural classification of spectrogram and mel-frequency coefficient images. Advances in Neural Network Research and Applications, 67, 337–346.CrossRef
9.
Zurück zum Zitat Okuyucu, Ç., Sert, M., & Yazici, A. (2013) Audio feature and classifier analysis for efficient recognition of environmental sounds. In Proceedings 2013 IEEE international symposium on multimedia, Anaheim, CA, December 2013 Okuyucu, Ç., Sert, M., & Yazici, A. (2013) Audio feature and classifier analysis for efficient recognition of environmental sounds. In Proceedings 2013 IEEE international symposium on multimedia, Anaheim, CA, December 2013
10.
Zurück zum Zitat Xia, X., Togneri, R., Sohel, F., et al. (2018). Random forest classification based acoustic event detection utilizing contextual-information and bottleneck features. Pattern Recognition, 81, 1–13.CrossRef Xia, X., Togneri, R., Sohel, F., et al. (2018). Random forest classification based acoustic event detection utilizing contextual-information and bottleneck features. Pattern Recognition, 81, 1–13.CrossRef
11.
Zurück zum Zitat Cohen R , Lavner Y . (2012). Infant cry analysis and detection. In Proceedings of 27th conference on electrical and electronics engineers (pp. 1–5), Eilat, Israel, November 2012. Cohen R , Lavner Y . (2012). Infant cry analysis and detection. In Proceedings of 27th conference on electrical and electronics engineers (pp. 1–5), Eilat, Israel, November 2012.
12.
Zurück zum Zitat Nguyen, Q., & Choi, J. S. (2017). Matching pursuit based robust acoustic event classification for surveillance systems. Computers and Electrical Engineering, 57, 43–54.CrossRef Nguyen, Q., & Choi, J. S. (2017). Matching pursuit based robust acoustic event classification for surveillance systems. Computers and Electrical Engineering, 57, 43–54.CrossRef
13.
Zurück zum Zitat Wolpert, D. (1996). The lack of a priori distinctions between learning algorithms. Neural Computation, 8(7), 1341–1390.CrossRef Wolpert, D. (1996). The lack of a priori distinctions between learning algorithms. Neural Computation, 8(7), 1341–1390.CrossRef
14.
Zurück zum Zitat Fernandez-Delgado, M., Cernadas, E., Barro, S., et al. (2014). Do we need hundreds of classifiers to solve real world classification problems. Journal of Machine Learning Research, 15, 3133–3181.MathSciNetMATH Fernandez-Delgado, M., Cernadas, E., Barro, S., et al. (2014). Do we need hundreds of classifiers to solve real world classification problems. Journal of Machine Learning Research, 15, 3133–3181.MathSciNetMATH
15.
Zurück zum Zitat Hrabina, M., & Sigmund, M. (2017). Comparison of feature performance in gunshot detection depending on noise degradation. In Proceedings 27th international conference on RADIOELEKTRONIKA (pp. 1–4), Brno, Czech Republic, April 2017. Hrabina, M., & Sigmund, M. (2017). Comparison of feature performance in gunshot detection depending on noise degradation. In Proceedings 27th international conference on RADIOELEKTRONIKA (pp. 1–4), Brno, Czech Republic, April 2017.
16.
Zurück zum Zitat Chu, S., Narayanan, S., & Kuo, C. C. J. (2009). Environmental sound recognition with time-frequency audio features. IEEE Transactions on Audio, Speech and Language Processing, 17(6), 1142–1158.CrossRef Chu, S., Narayanan, S., & Kuo, C. C. J. (2009). Environmental sound recognition with time-frequency audio features. IEEE Transactions on Audio, Speech and Language Processing, 17(6), 1142–1158.CrossRef
17.
Zurück zum Zitat Mallat, S. G., & Zhang, Z. (1993). Matching pursuits with time-frequency dictionaries. IEEE Transactions on Signal Processing, 41(12), 3397–3415.CrossRef Mallat, S. G., & Zhang, Z. (1993). Matching pursuits with time-frequency dictionaries. IEEE Transactions on Signal Processing, 41(12), 3397–3415.CrossRef
18.
Zurück zum Zitat Li, D., Tam, J., & Toub, D. (2013). Auditory scene classification using machine learning techniques. In Proceedings of the IEEE AASP challenge on detection and classification of acoustic scenes and events, New Paltz, NY, October 2013. Li, D., Tam, J., & Toub, D. (2013). Auditory scene classification using machine learning techniques. In Proceedings of the IEEE AASP challenge on detection and classification of acoustic scenes and events, New Paltz, NY, October 2013.
19.
Zurück zum Zitat Petrovsky, A., Azarov, E., & Petrovsky, A. (2011). Hybrid signal decomposition based on instantaneous harmonic parameters and perceptually motivated wavelet packets for scalable audio coding. Signal Processing, 91(6), 1489–1504.CrossRef Petrovsky, A., Azarov, E., & Petrovsky, A. (2011). Hybrid signal decomposition based on instantaneous harmonic parameters and perceptually motivated wavelet packets for scalable audio coding. Signal Processing, 91(6), 1489–1504.CrossRef
20.
Zurück zum Zitat Sugden, P., & Canagarajah, N. (2003). Underdetermined noisy blind separation using dual matching pursuits. Electronics Letters, 39(1), 158–160.CrossRef Sugden, P., & Canagarajah, N. (2003). Underdetermined noisy blind separation using dual matching pursuits. Electronics Letters, 39(1), 158–160.CrossRef
21.
Zurück zum Zitat Wei, P. I. (2010). Generalized demodulation method based on multi-scale chirplet and sparse signal decomposition and its application to gear fault diagnosis. Journal of Mechanical Engineering, 46(15), 63–68. Wei, P. I. (2010). Generalized demodulation method based on multi-scale chirplet and sparse signal decomposition and its application to gear fault diagnosis. Journal of Mechanical Engineering, 46(15), 63–68.
22.
Zurück zum Zitat Schroder, J., Goetze, S., & Anemuller, J. (2015). Spectro-temporal gabor filterbank features for acoustic event detection. IEEE/ACM Transactions on Audio Speech and Language Processing, 23(12), 2198–2208.CrossRef Schroder, J., Goetze, S., & Anemuller, J. (2015). Spectro-temporal gabor filterbank features for acoustic event detection. IEEE/ACM Transactions on Audio Speech and Language Processing, 23(12), 2198–2208.CrossRef
23.
Zurück zum Zitat Soussen, C., Rémi, G., Jérôme, I., et al. (2013). Joint K-step analysis of orthogonal matching pursuit and orthogonal least squares. IEEE Transactions on Information Theory, 59(5), 3158–3174.MathSciNetCrossRef Soussen, C., Rémi, G., Jérôme, I., et al. (2013). Joint K-step analysis of orthogonal matching pursuit and orthogonal least squares. IEEE Transactions on Information Theory, 59(5), 3158–3174.MathSciNetCrossRef
24.
Zurück zum Zitat Pace, M. A., Nicolás, U., Masson, F., et al. (2017). Critical frequencies for the annoyance judgments of noises with tonal component, to develop an equal-annoyance contour. Acoustical Society of America Journal, 141(5), 3883.CrossRef Pace, M. A., Nicolás, U., Masson, F., et al. (2017). Critical frequencies for the annoyance judgments of noises with tonal component, to develop an equal-annoyance contour. Acoustical Society of America Journal, 141(5), 3883.CrossRef
25.
Zurück zum Zitat Zwicker, E. (1961). Subdivision of the audible frequency range into critical bands (Frequenzgruppen). The Journal of the Acoustical Society of America, 33(2), 248.CrossRef Zwicker, E. (1961). Subdivision of the audible frequency range into critical bands (Frequenzgruppen). The Journal of the Acoustical Society of America, 33(2), 248.CrossRef
26.
27.
Zurück zum Zitat Phan, H., Marco, M., Mazur, R., et al. (2015). Random regression forests for acoustic event detection and classification. IEEE/ACM Transactions on Audio Speech and Language Processing, 23(1), 20–31.CrossRef Phan, H., Marco, M., Mazur, R., et al. (2015). Random regression forests for acoustic event detection and classification. IEEE/ACM Transactions on Audio Speech and Language Processing, 23(1), 20–31.CrossRef
Metadaten
Titel
Abnormal Acoustic Event Detection Based on Orthogonal Matching Pursuit in Security Surveillance System
verfasst von
Pengnan Zhang
Juan Wei
Zhe Liu
Fangli Ning
Publikationsdatum
24.04.2020
Verlag
Springer US
Erschienen in
Wireless Personal Communications / Ausgabe 2/2020
Print ISSN: 0929-6212
Elektronische ISSN: 1572-834X
DOI
https://doi.org/10.1007/s11277-020-07405-z

Weitere Artikel der Ausgabe 2/2020

Wireless Personal Communications 2/2020 Zur Ausgabe

Neuer Inhalt