Skip to main content
Top
Published in: International Journal of Speech Technology 3/2016

27-07-2016

Robust acoustic bird recognition for habitat monitoring with wireless sensor networks

Authors: Amira Boulmaiz, Djemil Messadeg, Noureddine Doghmane, Abdelmalik Taleb-Ahmed

Published in: International Journal of Speech Technology | Issue 3/2016

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The key solution to study birds in their natural habitat is the continuous survey using wireless sensors networks (WSN). The final objective of this study is to conceive a system for monitoring threatened bird species using audio sensor nodes. The principal feature for their recognition is their sound. The main limitations encountered with this process are environmental noise and energy consumption in sensor nodes. Over the years, a variety of birdsong classification methods has been introduced, but very few have focused to find an adequate one for WSN. In this paper, a tonal region detector (TRD) using sigmoid function is proposed. This approach for noise power estimation offers flexibility, since the slope and the mean of the sigmoid function can be adapted autonomously for a better trade-off between noise overvaluation and undervaluation. Once the tonal regions in the noisy bird sound are detected, the features gammatone teager energy cepstral coefficients (GTECC) post-processed by quantile-based cepstral normalization were extracted from the above signals for classification using deep neural network classifier. Experimental results for the identification of 36 bird species from Tonga lake (northeast of Algeria) demonstrate that the proposed TRD–GTECC feature is highly effective and performs satisfactorily compared to popular front-ends considered in this study. Moreover, recognition performance, noise immunity and energy consumption are considerably improved after tonal region detection, indicating that it is a very suitable approach for the acoustic bird recognition in complex environments with wireless sensor nodes.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Aissaoui, R., Tahar, A., Saheb, M., Guergueb, L., & Houhamdi, M. (2011). Diurnal behaviour of Ferruginous Duck Aythya nyroca wintering at the El-Kala wetlands (Northeast Algeria). Bulletin de l’Institut Scientifique, Rabat, section Sciences de la Vie, 33(2), 67–75. Aissaoui, R., Tahar, A., Saheb, M., Guergueb, L., & Houhamdi, M. (2011). Diurnal behaviour of Ferruginous Duck Aythya nyroca wintering at the El-Kala wetlands (Northeast Algeria). Bulletin de l’Institut Scientifique, Rabat, section Sciences de la Vie, 33(2), 67–75.
go back to reference Bardeli, R., Wolff, D., Kurth, F., Koch, M., Tauchert, K. H., & Frommolt, K. H. (2010). Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring. Pattern Recognition Letters, 31(12), 1524–1534.CrossRef Bardeli, R., Wolff, D., Kurth, F., Koch, M., Tauchert, K. H., & Frommolt, K. H. (2010). Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring. Pattern Recognition Letters, 31(12), 1524–1534.CrossRef
go back to reference Boll, S. F. (1979). Suppression of acoustic noise in speech using spectral subtraction. Acoustics, Speech and Signal Processing, IEEE Transactions on, 27(2), 113–120.CrossRef Boll, S. F. (1979). Suppression of acoustic noise in speech using spectral subtraction. Acoustics, Speech and Signal Processing, IEEE Transactions on, 27(2), 113–120.CrossRef
go back to reference Bořil, H., & Hansen, J. H. (2010). Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments. Audio, Speech, and Language Processing, IEEE Transactions on, 18(6), 1379–1393.CrossRef Bořil, H., & Hansen, J. H. (2010). Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments. Audio, Speech, and Language Processing, IEEE Transactions on, 18(6), 1379–1393.CrossRef
go back to reference Bořil, H., & Hansen, J. H. (2011). UT-Scope: Towards LVCSR under Lombard effect induced by varying types and levels of noisy background. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on (pp. 4472–4475). IEEE. Bořil, H., & Hansen, J. H. (2011). UT-Scope: Towards LVCSR under Lombard effect induced by varying types and levels of noisy background. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on (pp. 4472–4475). IEEE.
go back to reference Brumm, H. (2004). The impact of environmental noise on song amplitude in a territorial bird. Journal of Animal Ecology, 73(3), 434–440.CrossRef Brumm, H. (2004). The impact of environmental noise on song amplitude in a territorial bird. Journal of Animal Ecology, 73(3), 434–440.CrossRef
go back to reference Chettibi, F., Khelifa, R., Aberkane, M., Bouslama, Z., & Houhamdi, M. (2013). Diurnal activity budget and breeding ecology of the White-headed Duck Oxyura leucocephala at Lake Tonga (North-east Algeria). Zoology and Ecology, 23(3), 183–190.CrossRef Chettibi, F., Khelifa, R., Aberkane, M., Bouslama, Z., & Houhamdi, M. (2013). Diurnal activity budget and breeding ecology of the White-headed Duck Oxyura leucocephala at Lake Tonga (North-east Algeria). Zoology and Ecology, 23(3), 183–190.CrossRef
go back to reference Chu, W., & Blumstein, D. T. (2011). Noise robust bird song detection using syllable pattern-based hidden Markov models. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on (pp. 345–348). IEEE. Chu, W., & Blumstein, D. T. (2011). Noise robust bird song detection using syllable pattern-based hidden Markov models. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on (pp. 345–348). IEEE.
go back to reference Cireşan, D., Meier, U., Masci, J., & Schmidhuber, J. (2012). Multi-column deep neural network for traffic sign classification. Neural Networks, 32, 333–338.CrossRef Cireşan, D., Meier, U., Masci, J., & Schmidhuber, J. (2012). Multi-column deep neural network for traffic sign classification. Neural Networks, 32, 333–338.CrossRef
go back to reference Cohen, I. (2003). Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging. Speech and Audio Processing, IEEE Transactions on, 11(5), 466–475.CrossRef Cohen, I. (2003). Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging. Speech and Audio Processing, IEEE Transactions on, 11(5), 466–475.CrossRef
go back to reference Dahl, G. E., Yu, D., Deng, L., & Acero, A. (2012). Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. Audio, Speech, and Language Processing, IEEE Transactions on, 20(1), 30–42.CrossRef Dahl, G. E., Yu, D., Deng, L., & Acero, A. (2012). Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. Audio, Speech, and Language Processing, IEEE Transactions on, 20(1), 30–42.CrossRef
go back to reference Davis, S. B., & Mermelstein, P. (1980). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. Acoustics, Speech and Signal Processing, IEEE Transactions on, 28(4), 357–366.CrossRef Davis, S. B., & Mermelstein, P. (1980). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. Acoustics, Speech and Signal Processing, IEEE Transactions on, 28(4), 357–366.CrossRef
go back to reference De Oliveira, A. G., Ventura, T. M., Ganchev, T. D., de Figueiredo, J. M., Jahn, O., Marques, M. I., et al. (2015). Bird acoustic activity detection based on morphological filtering of the spectrogram. Applied Acoustics, 98, 34–42.CrossRef De Oliveira, A. G., Ventura, T. M., Ganchev, T. D., de Figueiredo, J. M., Jahn, O., Marques, M. I., et al. (2015). Bird acoustic activity detection based on morphological filtering of the spectrogram. Applied Acoustics, 98, 34–42.CrossRef
go back to reference Deng, L., Hinton, G., & Kingsbury, B. (2013). New types of deep neural network learning for speech recognition and related applications: An overview. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on (pp. 8599–8603). IEEE. Deng, L., Hinton, G., & Kingsbury, B. (2013). New types of deep neural network learning for speech recognition and related applications: An overview. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on (pp. 8599–8603). IEEE.
go back to reference Deng, L., Yu, D., & Platt, J. (2012). Scalable stacking and learning for building deep architectures. In 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 2133–2136). IEEE. Deng, L., Yu, D., & Platt, J. (2012). Scalable stacking and learning for building deep architectures. In 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 2133–2136). IEEE.
go back to reference Dharanipragada, S., & Padmanabhan, M. (2000). A nonlinear unsupervised adaptation technique for speech recognition. In INTERSPEECH (pp. 556–559). Dharanipragada, S., & Padmanabhan, M. (2000). A nonlinear unsupervised adaptation technique for speech recognition. In INTERSPEECH (pp. 556–559).
go back to reference Gerkmann, T., & Hendriks, R. C. (2012). Improved MMSE-based noise PSD tracking using temporal cepstrum smoothing. In 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 105–108). IEEE. Gerkmann, T., & Hendriks, R. C. (2012). Improved MMSE-based noise PSD tracking using temporal cepstrum smoothing. In 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 105–108). IEEE.
go back to reference Ghitza, O. (1994). Auditory models and human performance in tasks related to speech coding and speech recognition. Speech and Audio Processing, IEEE Transactions on, 2(1), 115–132.CrossRef Ghitza, O. (1994). Auditory models and human performance in tasks related to speech coding and speech recognition. Speech and Audio Processing, IEEE Transactions on, 2(1), 115–132.CrossRef
go back to reference Glasberg, B. R., & Moore, B. C. (1990). Derivation of auditory filter shapes from notched-noise data. Hearing Research, 47(1), 103–138.CrossRef Glasberg, B. R., & Moore, B. C. (1990). Derivation of auditory filter shapes from notched-noise data. Hearing Research, 47(1), 103–138.CrossRef
go back to reference Gros-Desormeaux, H., Vidot, N., & Hunel, P. (2010). Wildlife assessment using wireless sensor networks. Rijeka: INTECH Open Access Publisher.CrossRef Gros-Desormeaux, H., Vidot, N., & Hunel, P. (2010). Wildlife assessment using wireless sensor networks. Rijeka: INTECH Open Access Publisher.CrossRef
go back to reference Hilger, F., & Ney, H. (2006). Quantile based histogram equalization for noise robust large vocabulary speech recognition. Audio, Speech, and Language Processing, IEEE Transactions on, 14(3), 845–854.CrossRef Hilger, F., & Ney, H. (2006). Quantile based histogram equalization for noise robust large vocabulary speech recognition. Audio, Speech, and Language Processing, IEEE Transactions on, 14(3), 845–854.CrossRef
go back to reference Hill, J., & Culler, D. (2002). A wireless embedded sensor architecture for system-level optimization. UC Berkeley Technical Report. Hill, J., & Culler, D. (2002). A wireless embedded sensor architecture for system-level optimization. UC Berkeley Technical Report.
go back to reference Hinton, G., Deng, L., Yu, D., Dahl, G. E., Mohamed, A. R., Jaitly, N., et al. (2012). Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. Signal Processing Magazine IEEE, 29(6), 82–97.CrossRef Hinton, G., Deng, L., Yu, D., Dahl, G. E., Mohamed, A. R., Jaitly, N., et al. (2012). Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. Signal Processing Magazine IEEE, 29(6), 82–97.CrossRef
go back to reference Irino, T., & Patterson, R. D. (1997). A time-domain, level-dependent auditory filter: The gammachirp. The Journal of the Acoustical Society of America, 101(1), 412–419.CrossRef Irino, T., & Patterson, R. D. (1997). A time-domain, level-dependent auditory filter: The gammachirp. The Journal of the Acoustical Society of America, 101(1), 412–419.CrossRef
go back to reference Jančovič, P., & Köküer, M. (2011). Automatic detection and recognition of tonal bird sounds in noisy environments. EURASIP Journal on Advances in Signal Processing, 2011(1), 982936.CrossRef Jančovič, P., & Köküer, M. (2011). Automatic detection and recognition of tonal bird sounds in noisy environments. EURASIP Journal on Advances in Signal Processing, 2011(1), 982936.CrossRef
go back to reference Kaiser, J. F. (1990). On a simple algorithm to calculate the energy’of a signal. In Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on (pp. 381–384). Kaiser, J. F. (1990). On a simple algorithm to calculate the energy’of a signal. In Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on (pp. 381–384).
go back to reference Kim, C., & Stern, R. M. (2008). Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis. In INTERSPEECH (pp. 2598–2601). Kim, C., & Stern, R. M. (2008). Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis. In INTERSPEECH (pp. 2598–2601).
go back to reference Kim, C., & Stern, R. M. (2012). Power-normalized cepstral coefficients (PNCC) for robust speech recognition. In Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on (pp. 4101–4104). IEEE. Kim, C., & Stern, R. M. (2012). Power-normalized cepstral coefficients (PNCC) for robust speech recognition. In Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on (pp. 4101–4104). IEEE.
go back to reference Kortelainen, J., & Noponen, K. (2005). Neural Networks, Intelligent Systems. Reading: Addison-Wesley Publishing Co. Kortelainen, J., & Noponen, K. (2005). Neural Networks, Intelligent Systems. Reading: Addison-Wesley Publishing Co.
go back to reference Laibowitz, M., Gips, J., Aylward, R., Pentland, A., & Paradiso, J. A. (2006). A sensor network for social dynamics. In Proceedings of the 5th international conference on Information processing in sensor networks (pp. 483–491). ACM. Laibowitz, M., Gips, J., Aylward, R., Pentland, A., & Paradiso, J. A. (2006). A sensor network for social dynamics. In Proceedings of the 5th international conference on Information processing in sensor networks (pp. 483–491). ACM.
go back to reference Mainwaring, A., Culler, D., Polastre, J., Szewczyk, R., & Anderson, J. (2002). Wireless sensor networks for habitat monitoring. In Proceedings of the 1st ACM international workshop on Wireless sensor networks and applications (pp. 88–97). ACM. Mainwaring, A., Culler, D., Polastre, J., Szewczyk, R., & Anderson, J. (2002). Wireless sensor networks for habitat monitoring. In Proceedings of the 1st ACM international workshop on Wireless sensor networks and applications (pp. 88–97). ACM.
go back to reference Maragos, P., Kaiser, J. F., & Quatieri, T. F. (1993). On amplitude and frequency demodulation using energy operators. IEEE Transactions on signal processing, 41(4), 1532–1550.CrossRefMATH Maragos, P., Kaiser, J. F., & Quatieri, T. F. (1993). On amplitude and frequency demodulation using energy operators. IEEE Transactions on signal processing, 41(4), 1532–1550.CrossRefMATH
go back to reference McIlraith, A. L., & Card, H. C. (1997). Bird song identification using artificial neural networks and statistical analysis. In Electrical and Computer Engineering, 1997. Engineering Innovation: Voyage of Discovery. IEEE 1997 Canadian Conference on (Vol. 1, pp. 63–66). IEEE. McIlraith, A. L., & Card, H. C. (1997). Bird song identification using artificial neural networks and statistical analysis. In Electrical and Computer Engineering, 1997. Engineering Innovation: Voyage of Discovery. IEEE 1997 Canadian Conference on (Vol. 1, pp. 63–66). IEEE.
go back to reference Olguín, D. O., & Pentland, A. S. (2008). Social sensors for automatic data collection. In AMCIS 2008 Proceedings, p. 171. Olguín, D. O., & Pentland, A. S. (2008). Social sensors for automatic data collection. In AMCIS 2008 Proceedings, p. 171.
go back to reference Patil, H., & Parhi, K. K. (2010). Novel variable length Teager energy based features for person recognition from their hum. In Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on (pp. 4526–4529). IEEE. Patil, H., & Parhi, K. K. (2010). Novel variable length Teager energy based features for person recognition from their hum. In Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on (pp. 4526–4529). IEEE.
go back to reference Patterson, R. D., Robinson, K., Holdsworth, J., McKeown, D., Zhang, C., & Allerhand, M. (1992). Complex sounds and auditory images. Auditory Physiology and Perception, 83, 429–446.CrossRef Patterson, R. D., Robinson, K., Holdsworth, J., McKeown, D., Zhang, C., & Allerhand, M. (1992). Complex sounds and auditory images. Auditory Physiology and Perception, 83, 429–446.CrossRef
go back to reference Patti, A., & Williamson, G. (2013). Methods for classification of nocturnal migratory bird vocalizations using Pseudo Wigner-Ville Transform. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on (pp. 758–762). IEEE. Patti, A., & Williamson, G. (2013). Methods for classification of nocturnal migratory bird vocalizations using Pseudo Wigner-Ville Transform. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on (pp. 758–762). IEEE.
go back to reference Potamitis, I. (2015). Unsupervised dictionary extraction of bird vocalisations and new tools on assessing and visualising bird activity. Ecological Informatics, 26, 6–17.CrossRef Potamitis, I. (2015). Unsupervised dictionary extraction of bird vocalisations and new tools on assessing and visualising bird activity. Ecological Informatics, 26, 6–17.CrossRef
go back to reference Prasad, N. V., & Umesh, S. (2013). Improved cepstral mean and variance normalization using Bayesian framework. In Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on (pp. 156–161). IEEE. Prasad, N. V., & Umesh, S. (2013). Improved cepstral mean and variance normalization using Bayesian framework. In Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on (pp. 156–161). IEEE.
go back to reference Ptacek, L., Machlica, L., Linhart, P., Jaska, P., & Muller, L. (2015). Automatic recognition of bird individuals on an open set using as-is recordings. Bioacoustics, 25(1), 1–19. Ptacek, L., Machlica, L., Linhart, P., Jaska, P., & Muller, L. (2015). Automatic recognition of bird individuals on an open set using as-is recordings. Bioacoustics, 25(1), 1–19.
go back to reference Rangachari, S., & Loizou, P. C. (2006). A noise-estimation algorithm for highly non-stationary environments. Speech Communication, 48(2), 220–231.CrossRef Rangachari, S., & Loizou, P. C. (2006). A noise-estimation algorithm for highly non-stationary environments. Speech Communication, 48(2), 220–231.CrossRef
go back to reference Sadjadi, S. O., Bořil, H., & Hansen, J. H. (2012). A comparison of front-end compensation strategies for robust LVCSR under room reverberation and increased vocal effort. In Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on (pp. 4701–4704). IEEE. Sadjadi, S. O., Bořil, H., & Hansen, J. H. (2012). A comparison of front-end compensation strategies for robust LVCSR under room reverberation and increased vocal effort. In Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on (pp. 4701–4704). IEEE.
go back to reference Sainath, T. N., Mohamed, A. R., Kingsbury, B., & Ramabhadran, B. (2013). Deep convolutional neural networks for LVCSR. In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (pp. 8614–8618). IEEE. Sainath, T. N., Mohamed, A. R., Kingsbury, B., & Ramabhadran, B. (2013). Deep convolutional neural networks for LVCSR. In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (pp. 8614–8618). IEEE.
go back to reference Siniscalchi, S. M., Yu, D., Deng, L., & Lee, C. H. (2013). Exploiting deep neural networks for detection-based speech recognition. Neurocomputing, 106, 148–157.CrossRef Siniscalchi, S. M., Yu, D., Deng, L., & Lee, C. H. (2013). Exploiting deep neural networks for detection-based speech recognition. Neurocomputing, 106, 148–157.CrossRef
go back to reference Slaney, M. (1998). Auditory toolbox. Interval Research Corporation, Technical Report (Vol. 10). Slaney, M. (1998). Auditory toolbox. Interval Research Corporation, Technical Report (Vol. 10).
go back to reference Stattner, E. (2012). Contributions à l’étude des réseaux sociaux: propagation, fouille, collecte de données (Doctoral dissertation, Université des Antilles-Guyane). Stattner, E. (2012). Contributions à l’étude des réseaux sociaux: propagation, fouille, collecte de données (Doctoral dissertation, Université des Antilles-Guyane).
go back to reference Stattner, E., Hunel, P., Vidot, N., & Collard, M. (2011). Acoustic scheme to count bird songs with wireless sensor networks. In World of Wireless, Mobile and Multimedia Networks (WoWMoM), 2011 IEEE International Symposium on a(pp. 1–3). IEEE. Stattner, E., Hunel, P., Vidot, N., & Collard, M. (2011). Acoustic scheme to count bird songs with wireless sensor networks. In World of Wireless, Mobile and Multimedia Networks (WoWMoM), 2011 IEEE International Symposium on a(pp. 1–3). IEEE.
go back to reference Stowell, D., & Plumbley, M. D. (2014). Automatic large-scale classification of bird sounds is strongly improved by unsupervised feature learning. PeerJ, 2, e488.CrossRef Stowell, D., & Plumbley, M. D. (2014). Automatic large-scale classification of bird sounds is strongly improved by unsupervised feature learning. PeerJ, 2, e488.CrossRef
go back to reference Szewczyk, R., Osterweil, E., Polastre, J., Hamilton, M., Mainwaring, A., & Estrin, D. (2004). Habitat monitoring with sensor networks. Communications of the ACM, 47(6), 34–40.CrossRef Szewczyk, R., Osterweil, E., Polastre, J., Hamilton, M., Mainwaring, A., & Estrin, D. (2004). Habitat monitoring with sensor networks. Communications of the ACM, 47(6), 34–40.CrossRef
go back to reference Trifa, V., Girod, L., Collier, T. C., Blumstein, D., & Taylor, C. E. (2007). Automated wildlife monitoring using self-configuring sensor networks deployed in natural habitats. Center for Embedded Network Sensing. Trifa, V., Girod, L., Collier, T. C., Blumstein, D., & Taylor, C. E. (2007). Automated wildlife monitoring using self-configuring sensor networks deployed in natural habitats. Center for Embedded Network Sensing.
go back to reference Ventura, T. M., de Oliveira, A. G., Ganchev, T. D., de Figueiredo, J. M., Jahn, O., Marques, M. I., et al. (2015). Audio parameterization with robust frame selection for improved bird identification. Expert Systems with Applications, 42(22), 8463–8471.CrossRef Ventura, T. M., de Oliveira, A. G., Ganchev, T. D., de Figueiredo, J. M., Jahn, O., Marques, M. I., et al. (2015). Audio parameterization with robust frame selection for improved bird identification. Expert Systems with Applications, 42(22), 8463–8471.CrossRef
go back to reference Wang, H., Estrin, D., & Girod, L. (2003). Preprocessing in a Tiered Sensor Network for Habitat Monitoring, EURASIP. Journal on Applied Signal Processing, 4, 392–401.CrossRef Wang, H., Estrin, D., & Girod, L. (2003). Preprocessing in a Tiered Sensor Network for Habitat Monitoring, EURASIP. Journal on Applied Signal Processing, 4, 392–401.CrossRef
go back to reference Weninger, F., & Schuller, B. (2011). Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on (pp. 337–340). IEEE. Weninger, F., & Schuller, B. (2011). Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on (pp. 337–340). IEEE.
go back to reference Yapanel, U. H., & Hansen, J. H. (2008). A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition. Speech Communication, 50(2), 142–152.CrossRef Yapanel, U. H., & Hansen, J. H. (2008). A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition. Speech Communication, 50(2), 142–152.CrossRef
go back to reference Yong, P. C., Nordholm, S., & Dam, H. H. (2012). Noise estimation based on soft decisions and conditional smoothing for speech enhancement. In Acoustic Signal Enhancement; Proceedings of IWAENC 2012; International Workshop on (pp. 1–4). VDE. Yong, P. C., Nordholm, S., & Dam, H. H. (2012). Noise estimation based on soft decisions and conditional smoothing for speech enhancement. In Acoustic Signal Enhancement; Proceedings of IWAENC 2012; International Workshop on (pp. 1–4). VDE.
go back to reference Yoshizawa, S., Hayasaka, N., Wada, N., & Miyanaga, Y. (2004). Cepstral gain normalization for noise robust speech recognition. In Acoustics, Speech, and Signal Processing, 2004 (ICASSP’04). IEEE International Conference on (Vol. 1, pp. I–209). IEEE. Yoshizawa, S., Hayasaka, N., Wada, N., & Miyanaga, Y. (2004). Cepstral gain normalization for noise robust speech recognition. In Acoustics, Speech, and Signal Processing, 2004 (ICASSP’04). IEEE International Conference on (Vol. 1, pp. I–209). IEEE.
go back to reference Yu, R. (2009). A low-complexity noise estimation algorithm based on smoothing of noise power estimation and estimation bias correction. In Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on (pp. 4421–4424). IEEE. Yu, R. (2009). A low-complexity noise estimation algorithm based on smoothing of noise power estimation and estimation bias correction. In Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on (pp. 4421–4424). IEEE.
go back to reference Yu, D., Deng, L., Seide, F. T. B., & Li, G. (2016). U.S. Patent No. 9,235,799. Washington, DC: U.S. Patent and Trademark Office. Yu, D., Deng, L., Seide, F. T. B., & Li, G. (2016). U.S. Patent No. 9,235,799. Washington, DC: U.S. Patent and Trademark Office.
go back to reference Zhang, X., & Li, Y. (2015). Adaptive energy detection for bird sound detection in complex environments. Neurocomputing, 155, 108–116.CrossRef Zhang, X., & Li, Y. (2015). Adaptive energy detection for bird sound detection in complex environments. Neurocomputing, 155, 108–116.CrossRef
Metadata
Title
Robust acoustic bird recognition for habitat monitoring with wireless sensor networks
Authors
Amira Boulmaiz
Djemil Messadeg
Noureddine Doghmane
Abdelmalik Taleb-Ahmed
Publication date
27-07-2016
Publisher
Springer US
Published in
International Journal of Speech Technology / Issue 3/2016
Print ISSN: 1381-2416
Electronic ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-016-9354-4

Other articles of this Issue 3/2016

International Journal of Speech Technology 3/2016 Go to the issue