Skip to main content
Top
Published in: International Journal of Speech Technology 4/2016

30-09-2016

Bird classification based on their sound patterns

Authors: M. A. Raghuram, Nikhil R. Chavan, Ravikiran Belur, Shashidhar G. Koolagudi

Published in: International Journal of Speech Technology | Issue 4/2016

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper we focus on automatic bird classification based on their sound patterns. This is useful in the field of ornithology for studying bird species and their behavior based on their sound. The proposed methodology may be used to conduct survey of birds. The proposed methods may be used to automatically classify birds using different audio processing and machine learning techniques on the basis of their chirping patterns. An effort has been made in this work to map characteristics of birds such as size, habitat, species and types of call, on to their sounds. This study is also part of a broader project that includes development of software and hardware systems to monitor the bird species that appear in different geographical locations which helps ornithologists to monitor environmental conditions with respect to specific bird species.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Acevedo, A., Corrada-Bravo, C., Corrada-Bravo, H., Villanueva-Rivera, L., & Aide, T. (2009). Automated classification of bird and amphibian calls using machine learning: A comparison of methods. Ecological Informatics, 4, 206–214.CrossRef Acevedo, A., Corrada-Bravo, C., Corrada-Bravo, H., Villanueva-Rivera, L., & Aide, T. (2009). Automated classification of bird and amphibian calls using machine learning: A comparison of methods. Ecological Informatics, 4, 206–214.CrossRef
go back to reference Bardeli, R., Wolff, D., Kurth, F., Koch, M., Tauchert, K., & Frommolt, K. (2010). Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring. Pattern Recognition Letters, 31, 1524–1534.CrossRef Bardeli, R., Wolff, D., Kurth, F., Koch, M., Tauchert, K., & Frommolt, K. (2010). Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring. Pattern Recognition Letters, 31, 1524–1534.CrossRef
go back to reference Beckers, G. J. (2011). Bird speech perception and vocal production: A comparison with humans. Human Biology, 83(2), 191–212.CrossRef Beckers, G. J. (2011). Bird speech perception and vocal production: A comparison with humans. Human Biology, 83(2), 191–212.CrossRef
go back to reference Bermúdez-Cuamatzin, E., Ríos-Chelén, A. A., Gil, D., & Garcia, C. M. (2010). Experimental evidence for real-time song frequency shift in response to urban noise in a passerine bird. Biology Letters, 3, 368–370. Bermúdez-Cuamatzin, E., Ríos-Chelén, A. A., Gil, D., & Garcia, C. M. (2010). Experimental evidence for real-time song frequency shift in response to urban noise in a passerine bird. Biology Letters, 3, 368–370.
go back to reference Bolhuis, J. J., Okanoya, K., & Scharff, C. (2010). Twitter evolution: Converging mechanisms in birdsong and human speech. Nature Reviews Neuroscience, 11(11), 747–759.CrossRef Bolhuis, J. J., Okanoya, K., & Scharff, C. (2010). Twitter evolution: Converging mechanisms in birdsong and human speech. Nature Reviews Neuroscience, 11(11), 747–759.CrossRef
go back to reference Brandes, T. S. (2008). Automated sound recording and analysis techniques for bird surveys and conservation. Bird Conservation International, 18(S1), S163–S173. Brandes, T. S. (2008). Automated sound recording and analysis techniques for bird surveys and conservation. Bird Conservation International, 18(S1), S163–S173.
go back to reference Briggs, F., Lakshminarayanan, B., Neal, L., Fern, X. Z., Raich, R., Hadley, S., et al. (2012). Classification of multiple bird species. Journal of Acoustic Society of America, 131, 4640–4650.CrossRef Briggs, F., Lakshminarayanan, B., Neal, L., Fern, X. Z., Raich, R., Hadley, S., et al. (2012). Classification of multiple bird species. Journal of Acoustic Society of America, 131, 4640–4650.CrossRef
go back to reference Chen, Z., & Maher, R. C. (2006). Semi-automatic classification of bird vocalizations using spectral peak tracks. The Journal of the Acoustical Society of America, 120, 2974–2984.CrossRef Chen, Z., & Maher, R. C. (2006). Semi-automatic classification of bird vocalizations using spectral peak tracks. The Journal of the Acoustical Society of America, 120, 2974–2984.CrossRef
go back to reference Clark, G. A. (1979). Body weights of birds: A review. The Condor, 81(2), 193–202.CrossRef Clark, G. A. (1979). Body weights of birds: A review. The Condor, 81(2), 193–202.CrossRef
go back to reference Davis, S. B., & Mermelstein, P. (1980). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, In Proceedings of the IEEE Conference on Acoustics, Speech and Signal Processing (Vol. 28, pp. 357–366). Davis, S. B., & Mermelstein, P. (1980). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, In Proceedings of the IEEE Conference on Acoustics, Speech and Signal Processing (Vol. 28, pp. 357–366).
go back to reference Doupe, A. J., & Kuhl, P. K. (1999). Birdsong and human speech: Common themes and mechanisms. Annual Review of Neuroscience, 22(1), 567–631.CrossRef Doupe, A. J., & Kuhl, P. K. (1999). Birdsong and human speech: Common themes and mechanisms. Annual Review of Neuroscience, 22(1), 567–631.CrossRef
go back to reference Dowling, J., Luther, D., & Marra, P. (2012). Comparative effects of urban development and anthropogenic noise on bird songs. Behavioral Ecology, 23(1), 201–209.CrossRef Dowling, J., Luther, D., & Marra, P. (2012). Comparative effects of urban development and anthropogenic noise on bird songs. Behavioral Ecology, 23(1), 201–209.CrossRef
go back to reference Fagerlund, S. (2007). Bird species recognition using support vector machines. Journal on Advances in Signal Processing, 7, 64–71.MATH Fagerlund, S. (2007). Bird species recognition using support vector machines. Journal on Advances in Signal Processing, 7, 64–71.MATH
go back to reference Hall, M. L., Kingma, S. A., & Peters, A. (2013). Male songbird indicates body size with low-pitched advertising songs. PLoS One, 8(2), e56717.CrossRef Hall, M. L., Kingma, S. A., & Peters, A. (2013). Male songbird indicates body size with low-pitched advertising songs. PLoS One, 8(2), e56717.CrossRef
go back to reference Juang, C., & Chen, T. (2007). Birdsong recognition using prediction-based recurrent neural fuzzy networks. Neurocomputing, 71, 121–130.CrossRef Juang, C., & Chen, T. (2007). Birdsong recognition using prediction-based recurrent neural fuzzy networks. Neurocomputing, 71, 121–130.CrossRef
go back to reference Kight, C. R., & Swaddle, J. P. (2011). How and why environmental noise impacts animals: An integrative, mechanistic review. Ecology Letters, 14(10), 1052–1061.CrossRef Kight, C. R., & Swaddle, J. P. (2011). How and why environmental noise impacts animals: An integrative, mechanistic review. Ecology Letters, 14(10), 1052–1061.CrossRef
go back to reference Kwan, C., Mei, G., Zhao, X., Ren, Z., Xu, R., Stanford, V., Rochet, C., Aube, J., & Ho, K. (2004). Bird classification algorithms: Theory and experimental results, In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP’04) (vol. 5, pp. 289–292), Montreal, Canada. Kwan, C., Mei, G., Zhao, X., Ren, Z., Xu, R., Stanford, V., Rochet, C., Aube, J., & Ho, K. (2004). Bird classification algorithms: Theory and experimental results, In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP’04) (vol. 5, pp. 289–292), Montreal, Canada.
go back to reference Laiolo, P. (2010). The emerging significance of bioacoustics in animal species conservation. Biological Conservation, 143(7), 1635–1645.CrossRef Laiolo, P. (2010). The emerging significance of bioacoustics in animal species conservation. Biological Conservation, 143(7), 1635–1645.CrossRef
go back to reference Lartillot, O., & Toiviainen, P. (2007). A matlab toolbox for musical feature extraction from audio, In International Conference on Digital Audio Effects (pp. 237–244). Lartillot, O., & Toiviainen, P. (2007). A matlab toolbox for musical feature extraction from audio, In International Conference on Digital Audio Effects (pp. 237–244).
go back to reference Lartillot, O., Eerola, T., Toiviainen, P., & Fornari, J. (2008). Multi-feature modeling of pulse clarity: Design, validation and optimization., In ISMIR (pp. 521–526), Citeseer. Lartillot, O., Eerola, T., Toiviainen, P., & Fornari, J. (2008). Multi-feature modeling of pulse clarity: Design, validation and optimization., In ISMIR (pp. 521–526), Citeseer.
go back to reference Lathi, B. P. (2004). Signal processing and linear systems. Oxford: Oxford University Press. Lathi, B. P. (2004). Signal processing and linear systems. Oxford: Oxford University Press.
go back to reference Lee, C.-H., Han, C.-C., & Chuang, C.-C. (2008). Automatic classification of bird species from their sounds using two-dimensional cepstral coefficients. IEEE Transactions on Audio, Speech, and Language Processing, 16(8), 1541–1550.CrossRef Lee, C.-H., Han, C.-C., & Chuang, C.-C. (2008). Automatic classification of bird species from their sounds using two-dimensional cepstral coefficients. IEEE Transactions on Audio, Speech, and Language Processing, 16(8), 1541–1550.CrossRef
go back to reference Linhart, P., & Fuchs, R. (2015). Song pitch indicates body size and correlates with males’ response to playback in a songbird. Animal Behaviour, 103, 91–98.CrossRef Linhart, P., & Fuchs, R. (2015). Song pitch indicates body size and correlates with males’ response to playback in a songbird. Animal Behaviour, 103, 91–98.CrossRef
go back to reference Lopes, M. T., Gioppo, L. L., Higushi, T. T., Kaestner, C. A. A., Silla, Jr., C. N., & Koerich, A. L. (2011). Automatic bird species identification for large number of species, In IEEE International Symposium on Multimedia. Lopes, M. T., Gioppo, L. L., Higushi, T. T., Kaestner, C. A. A., Silla, Jr., C. N., & Koerich, A. L. (2011). Automatic bird species identification for large number of species, In IEEE International Symposium on Multimedia.
go back to reference Lopes, M. T., Koerich, A. L., Kaestner, C. A. A., Silla, Jr., C. N. (2011). Feature set comparison for automatic bird species identification, In IEEE International Conference on Systems, Man, and Cybernetics, Anchorage, Alaska. Lopes, M. T., Koerich, A. L., Kaestner, C. A. A., Silla, Jr., C. N. (2011). Feature set comparison for automatic bird species identification, In IEEE International Conference on Systems, Man, and Cybernetics, Anchorage, Alaska.
go back to reference Luther, D., & Baptista, L. (2010). Urban noise and the cultural evolution of bird songs. Proceedings of the Royal Society of London B: Biological Sciences, 277(1680), 469–473.CrossRef Luther, D., & Baptista, L. (2010). Urban noise and the cultural evolution of bird songs. Proceedings of the Royal Society of London B: Biological Sciences, 277(1680), 469–473.CrossRef
go back to reference Mellinger, D., & Bradbury, J. W. (2007). Acoustic measurement of marine mammal sounds in noisy environments, In Proceedings of the International Conference on Underwater Acoustical Measurements: Technologies and Results. Mellinger, D., & Bradbury, J. W. (2007). Acoustic measurement of marine mammal sounds in noisy environments, In Proceedings of the International Conference on Underwater Acoustical Measurements: Technologies and Results.
go back to reference Mitchell, T. M. (1997). Machine learning. Maidenhead: McGraw-Hill.MATH Mitchell, T. M. (1997). Machine learning. Maidenhead: McGraw-Hill.MATH
go back to reference Rickwood, P., & Taylor, A. (2008). Methods for automatically analyzing humpback song units. Journal of the Acoustical Society of America, 123, 1763–1772.CrossRef Rickwood, P., & Taylor, A. (2008). Methods for automatically analyzing humpback song units. Journal of the Acoustical Society of America, 123, 1763–1772.CrossRef
go back to reference Silla, C. N., & Kaestner, C. A. (2013). Hierarchical classification of bird species using their audio recorded songs (pp. 1895–1900). Washington, DC: IEEE Computer Society. Silla, C. N., & Kaestner, C. A. (2013). Hierarchical classification of bird species using their audio recorded songs (pp. 1895–1900). Washington, DC: IEEE Computer Society.
go back to reference Slabbekoorn, H., & Peet, M. (2003). Ecology: Birds sing at a higher pitch in urban noise. Nature, 424(6946), 267–267.CrossRef Slabbekoorn, H., & Peet, M. (2003). Ecology: Birds sing at a higher pitch in urban noise. Nature, 424(6946), 267–267.CrossRef
go back to reference Somervuo, P., Harma, A., & Fagerlund, S. (2006). Parametric representations of bird sounds for automatic species recognition. IEEE Transactions on Audio, Speech and Language Processing, 14, 2252–2263.CrossRef Somervuo, P., Harma, A., & Fagerlund, S. (2006). Parametric representations of bird sounds for automatic species recognition. IEEE Transactions on Audio, Speech and Language Processing, 14, 2252–2263.CrossRef
go back to reference Sun, R., Marye, Y. W., & Zhao, H. (2013). Wavelet transform digital sound processing to identify wild bird species, In Proceedings of the 2013 International Conference on Wavelet Analysis and Pattern Recognition. Sun, R., Marye, Y. W., & Zhao, H. (2013). Wavelet transform digital sound processing to identify wild bird species, In Proceedings of the 2013 International Conference on Wavelet Analysis and Pattern Recognition.
go back to reference Tsai, W.-H., Xu, Y.-Y., & Lin, W.-C. (2013). Bird species identification based on timbre and pitch features, In IEEE International Conference on Multimedia and Expo (pp. 1–6). Tsai, W.-H., Xu, Y.-Y., & Lin, W.-C. (2013). Bird species identification based on timbre and pitch features, In IEEE International Conference on Multimedia and Expo (pp. 1–6).
go back to reference Vilches, E., Escobar, I., Vallejo, E., & Taylor, C. (2006). Data mining applied to acoustic bird species recognition, In Proceedings of the 18th IEEE International Conference on Pattern Recognition (ICPR’06). Vilches, E., Escobar, I., Vallejo, E., & Taylor, C. (2006). Data mining applied to acoustic bird species recognition, In Proceedings of the 18th IEEE International Conference on Pattern Recognition (ICPR’06).
go back to reference Witten, I. H., & Frank, E. (2005). Data mining: Practical machine learning tools and techniques. San Francisco: Morgan Kaufmann Publishers.MATH Witten, I. H., & Frank, E. (2005). Data mining: Practical machine learning tools and techniques. San Francisco: Morgan Kaufmann Publishers.MATH
Metadata
Title
Bird classification based on their sound patterns
Authors
M. A. Raghuram
Nikhil R. Chavan
Ravikiran Belur
Shashidhar G. Koolagudi
Publication date
30-09-2016
Publisher
Springer US
Published in
International Journal of Speech Technology / Issue 4/2016
Print ISSN: 1381-2416
Electronic ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-016-9372-2

Other articles of this Issue 4/2016

International Journal of Speech Technology 4/2016 Go to the issue