Skip to main content
Erschienen in: International Journal of Speech Technology 3/2017

08.07.2017

Effect of bandwidth modifications on the quality of speech imitated by Alexandrine and Indian Ringneck parrots

verfasst von: Randhir Singh, Ajay Kumar, Parveen Kumar Lehana

Erschienen in: International Journal of Speech Technology | Ausgabe 3/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Alexandrine and Indian Ringneck parrots are known for imitating the voice of other animals. The objective of this paper is to estimate the spectral limits of the imitated sounds produced by parrots and quantify the quality. The investigations showed that 500–3000 Hz spectral band is adequate for retaining the important perceptual information in the phrases uttered by human speakers and imitated by parrots. Investigations confirmed that the Indian Ringneck parrots are capable of following the formant structure and pitch contour of the phrases uttered by the human subjects. The dynamic range of the pitch of Indian Ringneck parrots was observed as higher than that of the human subjects. A rise of about 1000 Hz in the formant F1 of the parrots was observed, indicating the tongue height small and beak opening, relatively large, as compared to that of human subjects. The quality of some of the synthesized and processed phrases was found slightly better as compared to that of the original phrases because of the inherent enhancement capability of the Harmonic plus noise model (HNM). The average Mean opinion score (MOS) score of the Indian Ringnech parrots for the original, synthesized, and processed phrases was observed as 2.65, 2.59, and 2.77, respectively. The investigations may be beneficial for studying the behavior of endangered birds, defense related activities, safeguarding the crashes with aero planes, and safeguard of the birds from wind power generator etc.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Ali, S. (1943). The book of Indian birds. Bombay: The Bombay Natural History Society. Ali, S. (1943). The book of Indian birds. Bombay: The Bombay Natural History Society.
Zurück zum Zitat Beckers, G. J. L. (2011). Bird speech perception and vocal production: A comparison with humans. Human Biology; an International Record of Research, 83, 191–212. Beckers, G. J. L. (2011). Bird speech perception and vocal production: A comparison with humans. Human Biology; an International Record of Research, 83, 191–212.
Zurück zum Zitat Beckers, G. J. L., Nelson, B. S., & Suthers, R. A. (2004). Vocal-tract filtering by lingual articulation in a parrot. Current Biology, 14, 1592–1597.CrossRef Beckers, G. J. L., Nelson, B. S., & Suthers, R. A. (2004). Vocal-tract filtering by lingual articulation in a parrot. Current Biology, 14, 1592–1597.CrossRef
Zurück zum Zitat Berouti, M., Schwartz, R., & Makhoul, J. (1979). Enhancement of speech corrupted by acoustic noise. In Acoustics, speech, and signal processing, IEEE international conference on ICASSP’ (vol. 79(4), pp. 208–211). Berouti, M., Schwartz, R., & Makhoul, J. (1979). Enhancement of speech corrupted by acoustic noise. In Acoustics, speech, and signal processing, IEEE international conference on ICASSP’ (vol. 79(4), pp. 208–211).
Zurück zum Zitat Boll, S. (1979). Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions Acoustics, Speech and Signal Processing, 27(2), 113–120.CrossRef Boll, S. (1979). Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions Acoustics, Speech and Signal Processing, 27(2), 113–120.CrossRef
Zurück zum Zitat Cai, J., Ee, D., Pham, B., Roe, P., & Zhang, J. (2007). Sensor network for the monitoring of ecosystem: Bird species recognition. In Proceedings of the 3rd IEEE international conference on intelligent sensors, sensor networks and information (ISSNIP’07) (pp. 293–298). Melbourne: ISSNIP.CrossRef Cai, J., Ee, D., Pham, B., Roe, P., & Zhang, J. (2007). Sensor network for the monitoring of ecosystem: Bird species recognition. In Proceedings of the 3rd IEEE international conference on intelligent sensors, sensor networks and information (ISSNIP’07) (pp. 293–298). Melbourne: ISSNIP.CrossRef
Zurück zum Zitat Catchpole, C. K., & Slater, P. J. B. (2008). Bird song: Biological themes and variations. Cambridge: Cambridge Press University.CrossRef Catchpole, C. K., & Slater, P. J. B. (2008). Bird song: Biological themes and variations. Cambridge: Cambridge Press University.CrossRef
Zurück zum Zitat Chou, C.-H., & Liu, P.-H. (2009). Bird species recognition by wavelet transformation of a section of birdsong. In Proceedings IEEE symposia and workshops on ubiquitous, autonomic and trusted computing. (pp. 189–193). Brisbane: IEEE.CrossRef Chou, C.-H., & Liu, P.-H. (2009). Bird species recognition by wavelet transformation of a section of birdsong. In Proceedings IEEE symposia and workshops on ubiquitous, autonomic and trusted computing. (pp. 189–193). Brisbane: IEEE.CrossRef
Zurück zum Zitat Chu, W., & Alwan, A. (2012). FBEM: A filter bank EM algorithm for the joint optimization of features and acoustic model parameters in bird call classification. In Proceedings IEEE international conference acoustics, speech, signal processing (ICASSP) (pp. 1993–1996). Kyoto: ICASSP. Chu, W., & Alwan, A. (2012). FBEM: A filter bank EM algorithm for the joint optimization of features and acoustic model parameters in bird call classification. In Proceedings IEEE international conference acoustics, speech, signal processing (ICASSP) (pp. 1993–1996). Kyoto: ICASSP.
Zurück zum Zitat Chu, W., & Blumstein, D. T. (2011). Noise robust bird song detection using syllable pattern-based Hidden Markov models. In Proceedings international conference on acoustics, speech, signal processing (pp. 345–348). Prague: ICASSP. Chu, W., & Blumstein, D. T. (2011). Noise robust bird song detection using syllable pattern-based Hidden Markov models. In Proceedings international conference on acoustics, speech, signal processing (pp. 345–348). Prague: ICASSP.
Zurück zum Zitat Doupe, A. J., & Kuhl, P. K. (1999). Birdsong and human speech: Common themes and mechanisms. Annual Review of Neuroscience, 22, 567–631.CrossRef Doupe, A. J., & Kuhl, P. K. (1999). Birdsong and human speech: Common themes and mechanisms. Annual Review of Neuroscience, 22, 567–631.CrossRef
Zurück zum Zitat Fagerlund, S. (2007). Bird species recognition using support vector machines. EUSASIP Journal on Advances in Signal Processing, 2007, 1–8.MATH Fagerlund, S. (2007). Bird species recognition using support vector machines. EUSASIP Journal on Advances in Signal Processing, 2007, 1–8.MATH
Zurück zum Zitat Forshaw, J. M. (2006). Parrots of the world: An identification Guide. Princeton: Princeton University Press. Forshaw, J. M. (2006). Parrots of the world: An identification Guide. Princeton: Princeton University Press.
Zurück zum Zitat Ganchev, T., Lazaridis, A., Mporas, I., & Fakotakis, N. (2008). Performance evaluation for voice conversion systems. Berlin: Springer.CrossRef Ganchev, T., Lazaridis, A., Mporas, I., & Fakotakis, N. (2008). Performance evaluation for voice conversion systems. Berlin: Springer.CrossRef
Zurück zum Zitat Härmä, A., & Somervuo, P. (2004). Classification of the harmonic structure in bird vocalization. In Proceedings IEEE international conference acoustics, speech, and signal processing (ICASSP ‘04) (pp. 701–704). Montreal, QC: ICASSP. Härmä, A., & Somervuo, P. (2004). Classification of the harmonic structure in bird vocalization. In Proceedings IEEE international conference acoustics, speech, and signal processing (ICASSP ‘04) (pp. 701–704). Montreal, QC: ICASSP.
Zurück zum Zitat Homberger, D. G. (1986). The lingual apparatus of the African grey parrot, Psittacus erithacus Linne (Aves: Psittacidae): Description and theoretical mechanical analysis. Ornithology Monographs, 39, 1–233. Homberger, D. G. (1986). The lingual apparatus of the African grey parrot, Psittacus erithacus Linne (Aves: Psittacidae): Description and theoretical mechanical analysis. Ornithology Monographs, 39, 1–233.
Zurück zum Zitat ITU-T. (1996). Methods for subjective determination of transmission quality. Tech. Rep. ITU-T Recommendation P.800, ITU. ITU-T. (1996). Methods for subjective determination of transmission quality. Tech. Rep. ITU-T Recommendation P.800, ITU.
Zurück zum Zitat ITU-T Rec. P. 862. (2001). Perceptual evaluation of speech quality (PESQ), an objective method for end-to-end speech quality assessment of narrow band telephone networks and speech codecs ITU-T Rec. P. 862. (2001). Perceptual evaluation of speech quality (PESQ), an objective method for end-to-end speech quality assessment of narrow band telephone networks and speech codecs
Zurück zum Zitat Kamath, S. D., & Loizou, P. C. (2002). A multi-band spectral subtraction method for enhancing speech corrupted by colored noise. In Proceedings IEEE international conference acoustics, speech, and signal processing (pp. 4160–4164). Orlando: ICASSP. Kamath, S. D., & Loizou, P. C. (2002). A multi-band spectral subtraction method for enhancing speech corrupted by colored noise. In Proceedings IEEE international conference acoustics, speech, and signal processing (pp. 4160–4164). Orlando: ICASSP.
Zurück zum Zitat King, A. S. (1989). Functional anatomy of the syrinx, in Form and Function in Birds. ed. by A.S. King, J. McLelland. London: Academic Press. King, A. S. (1989). Functional anatomy of the syrinx, in Form and Function in Birds. ed. by A.S. King, J. McLelland. London: Academic Press.
Zurück zum Zitat Ladefoged, P. A. (2001). A course in phonetics (4th edn.). Fort Worth: Harcourt College Publishers. Ladefoged, P. A. (2001). A course in phonetics (4th edn.). Fort Worth: Harcourt College Publishers.
Zurück zum Zitat Laroche, J., Stylianou, Y., & Moulines, E. (1993a). HNM: A simple, efficient harmonic plus noise model for speech. In Proceeding IEEE workshop applications signal processing to audio and acoustics (pp. 169–172).New Paltz, NY: WASPAACrossRef Laroche, J., Stylianou, Y., & Moulines, E. (1993a). HNM: A simple, efficient harmonic plus noise model for speech. In Proceeding IEEE workshop applications signal processing to audio and acoustics (pp. 169–172).New Paltz, NY: WASPAACrossRef
Zurück zum Zitat Laroche, J., Stylianou, Y., & Moulines, E. (1993b). HNS: Speech modification based on a harmonic + noise model. In Proceedings international conference on acoustics, speech, and signal processing (pp. 550–553). Minneapolis, MN: ICASSP Laroche, J., Stylianou, Y., & Moulines, E. (1993b). HNS: Speech modification based on a harmonic + noise model. In Proceedings international conference on acoustics, speech, and signal processing (pp. 550–553). Minneapolis, MN: ICASSP
Zurück zum Zitat Larsen, O. N., & Goller, F. (2002). Direct observation of syringeal muscle function in songbirds and a parrot. The Journal of Experimental Biology, 205, 25–35. Larsen, O. N., & Goller, F. (2002). Direct observation of syringeal muscle function in songbirds and a parrot. The Journal of Experimental Biology, 205, 25–35.
Zurück zum Zitat Lehana, P. K. (2013). Spectral mapping using multivariate polynomial modelling for voice conversion. Ph.D. thesis, Electrical Engineering, IIT Bombay. Lehana, P. K. (2013). Spectral mapping using multivariate polynomial modelling for voice conversion. Ph.D. thesis, Electrical Engineering, IIT Bombay.
Zurück zum Zitat Lehana, P. K., Gupta, R. K., & Kumari, S. (2004). Enhancement of esophagus speech using harmonic plus noise modal. In Proceedings TENCON 2004. 2004 IEEE region 10 conference (pp. 669–672). Lehana, P. K., Gupta, R. K., & Kumari, S. (2004). Enhancement of esophagus speech using harmonic plus noise modal. In Proceedings TENCON 2004. 2004 IEEE region 10 conference (pp. 669–672).
Zurück zum Zitat Leiliany, N. M., Maria, L. S., Marilice, M. F. G., Angélica, L. F. R., Adrine, C. S., & Ivete, F. R. (2014). Gestural communication in a new world parrot. Behavioural Processes, 105, 46–48.CrossRef Leiliany, N. M., Maria, L. S., Marilice, M. F. G., Angélica, L. F. R., Adrine, C. S., & Ivete, F. R. (2014). Gestural communication in a new world parrot. Behavioural Processes, 105, 46–48.CrossRef
Zurück zum Zitat McAulay, R. J., & Quatieri, T. F. (1986). Speech analysis/synthesis based on a sinusoidal representation. IEEE Transactions on Acoust, Speech, Signal Processing, 34(4), 744–754.CrossRef McAulay, R. J., & Quatieri, T. F. (1986). Speech analysis/synthesis based on a sinusoidal representation. IEEE Transactions on Acoust, Speech, Signal Processing, 34(4), 744–754.CrossRef
Zurück zum Zitat McIlraith, A. L., & Card, H. C. (1997). Birdsong recognition using backpropagation and multivariate statistics. IEEE Transactions on Signal Processing, 45(11), 2740–2748.CrossRef McIlraith, A. L., & Card, H. C. (1997). Birdsong recognition using backpropagation and multivariate statistics. IEEE Transactions on Signal Processing, 45(11), 2740–2748.CrossRef
Zurück zum Zitat Nelson, B. S., Beckers, G. J. L., & Suthers, R. A. (2005). Vocal tract filtering and sound radiation in a songbird. The Journal of Experimental Biology, 208, 297–308.CrossRef Nelson, B. S., Beckers, G. J. L., & Suthers, R. A. (2005). Vocal tract filtering and sound radiation in a songbird. The Journal of Experimental Biology, 208, 297–308.CrossRef
Zurück zum Zitat Nottebohm, F. (1976). Phonation in the orange-winged Amazon parrot, Amazona amazonica. Journal of Comparative Physiology B: Biochemical, Systemic, and Environmental Physiology, 108, 157–170.CrossRef Nottebohm, F. (1976). Phonation in the orange-winged Amazon parrot, Amazona amazonica. Journal of Comparative Physiology B: Biochemical, Systemic, and Environmental Physiology, 108, 157–170.CrossRef
Zurück zum Zitat Nowicki, S. (1987). Vocal tract resonances in oscine bird sound production: Evidence from birdsongs in a helium atmosphere. Nature, 325, 53–55.CrossRef Nowicki, S. (1987). Vocal tract resonances in oscine bird sound production: Evidence from birdsongs in a helium atmosphere. Nature, 325, 53–55.CrossRef
Zurück zum Zitat Ohms, V. R., Beckers, G. J. L., ten Cate, C., & Suthers, R. A. (2012). Vocal tract articulation revisited: The case of the monk parakeet. The Journal of Experimental Biology, 215, 85–92.CrossRef Ohms, V. R., Beckers, G. J. L., ten Cate, C., & Suthers, R. A. (2012). Vocal tract articulation revisited: The case of the monk parakeet. The Journal of Experimental Biology, 215, 85–92.CrossRef
Zurück zum Zitat Ohms, V. R., Escudero, P., Lammers, K., & ten Cate, C. (2011). Zebra finches and Dutch adults exhibit the same cue weighting bias in vowel perception. Animal Cognition, 15, 155–161.CrossRef Ohms, V. R., Escudero, P., Lammers, K., & ten Cate, C. (2011). Zebra finches and Dutch adults exhibit the same cue weighting bias in vowel perception. Animal Cognition, 15, 155–161.CrossRef
Zurück zum Zitat Pantazis, Y., & Stylianou, Y. (2008). Improving the modeling of the noise part in the harmonic plus noise model of speech. In Proceedings IEEE international conference acoustics, speech, and signal processing (ICASSP) (pp. 4609–4612). Las Vegas: ICASSP. Pantazis, Y., & Stylianou, Y. (2008). Improving the modeling of the noise part in the harmonic plus noise model of speech. In Proceedings IEEE international conference acoustics, speech, and signal processing (ICASSP) (pp. 4609–4612). Las Vegas: ICASSP.
Zurück zum Zitat Patterson, D. K., & Pepperberg, I. M. (1994). A comparative study of human and parrot phonation: Acoustic and articulatory correlates of vowels. The Journal of the Acoustical Society of America, 96(2), 634–648.CrossRef Patterson, D. K., & Pepperberg, I. M. (1994). A comparative study of human and parrot phonation: Acoustic and articulatory correlates of vowels. The Journal of the Acoustical Society of America, 96(2), 634–648.CrossRef
Zurück zum Zitat Pepperberg, I. M. (1994). Vocal learning in Grey parrots (Psittacus erithacus): Effect of social interaction reference and context. The Auk, 111, 300–313.CrossRef Pepperberg, I. M. (1994). Vocal learning in Grey parrots (Psittacus erithacus): Effect of social interaction reference and context. The Auk, 111, 300–313.CrossRef
Zurück zum Zitat Shuang, Z., Meng, F., & Qin, Y. (2008). Voice conversion by combining frequency warping with unit selection. In Proceedings IEEE international conference acoustics, speech, and signal processing (pp. 4661–4664). Las Vegas: ICASSP Shuang, Z., Meng, F., & Qin, Y. (2008). Voice conversion by combining frequency warping with unit selection. In Proceedings IEEE international conference acoustics, speech, and signal processing (pp. 4661–4664). Las Vegas: ICASSP
Zurück zum Zitat Stylianou, Y. (2001). Applying the harmonic plus noise model in concatenative speech synthesis. IEEE Transactions on Speech and Audio Processing, 9(1), 21–29.CrossRef Stylianou, Y. (2001). Applying the harmonic plus noise model in concatenative speech synthesis. IEEE Transactions on Speech and Audio Processing, 9(1), 21–29.CrossRef
Zurück zum Zitat Suthers, R. A., & Zollinger, S. A. (2004). Producing song: The vocal apparatus. Annals of the New York Academy of Sciences, 1016, 109–129.CrossRef Suthers, R. A., & Zollinger, S. A. (2004). Producing song: The vocal apparatus. Annals of the New York Academy of Sciences, 1016, 109–129.CrossRef
Zurück zum Zitat Warren, D. K., Patterson, D. K., & Pepperberg, I. M. (1996). Mechanisms of American English vowel production in a grey parrot (Psittacus erithacus). The Auk, 113, 41–58.CrossRef Warren, D. K., Patterson, D. K., & Pepperberg, I. M. (1996). Mechanisms of American English vowel production in a grey parrot (Psittacus erithacus). The Auk, 113, 41–58.CrossRef
Metadaten
Titel
Effect of bandwidth modifications on the quality of speech imitated by Alexandrine and Indian Ringneck parrots
verfasst von
Randhir Singh
Ajay Kumar
Parveen Kumar Lehana
Publikationsdatum
08.07.2017
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 3/2017
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-017-9437-x

Weitere Artikel der Ausgabe 3/2017

International Journal of Speech Technology 3/2017 Zur Ausgabe

Neuer Inhalt