Skip to main content
Top
Published in: International Journal of Speech Technology 1/2013

01-03-2013

Effect of aging on speech features and phoneme recognition: a study on Bengali voicing vowels

Authors: Biswajit Das, Sandipan Mandal, Pabitra Mitra, Anupam Basu

Published in: International Journal of Speech Technology | Issue 1/2013

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The article studies age related variations of speech characteristics of two age groups, in the Bengali language. The study considers 60 speakers in the each age groups, 60–80 years and 20–40 years, respectively. We have considered different voice source features like fundamental frequency, formant frequencies, jitter, shimmer and harmonic to noise ratio. Cepstral domain feature, Mel Frequency Cepstral coefficients (MFCC) of different voiced Bengali vowels are also analyzed for younger and older adult groups. MFCC feature and Hidden Markov model parameter of different voiced vowels are used to study phoneme dissimilarities measure between two age groups. Age related changes in elderly speech affect the automatic speech recognition performance as was observed in our study, raising the need for specific acoustic models for elderly persons.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Baken, R. J. (2005). The aged voice: a new hypothesis. Journal of Voice, 19, 317–325. CrossRef Baken, R. J. (2005). The aged voice: a new hypothesis. Journal of Voice, 19, 317–325. CrossRef
go back to reference Barlow III, J.A. (2009). Age-related changes in acoustic characteristics of adult speech. Journal of Communication Disorders, 42(5), 324–333. CrossRef Barlow III, J.A. (2009). Age-related changes in acoustic characteristics of adult speech. Journal of Communication Disorders, 42(5), 324–333. CrossRef
go back to reference Barman, B. (2011). A contrastive analysis of English and Bangla phonemics. Dhaka University. Journal of Linguistics, 2(4), 19–42. Barman, B. (2011). A contrastive analysis of English and Bangla phonemics. Dhaka University. Journal of Linguistics, 2(4), 19–42.
go back to reference Benzeghiba, M., Mori, R. D., Deroo, O., Dupont, S., Erbes, T., Jouvet, D., Fissore, L., Laface, P., Mertins, A., Ris, C., Rose, R., Tyagi, V., & Wellekens, C. (2007). Automatic speech recognition and speech variability: a review. Speech Communication, 49, 763–786. CrossRef Benzeghiba, M., Mori, R. D., Deroo, O., Dupont, S., Erbes, T., Jouvet, D., Fissore, L., Laface, P., Mertins, A., Ris, C., Rose, R., Tyagi, V., & Wellekens, C. (2007). Automatic speech recognition and speech variability: a review. Speech Communication, 49, 763–786. CrossRef
go back to reference Cassidy, S., & Harrington, J. (2001). Multi-level annotation in the emu speech database management system. Speech Communication, 33(1–2), 61–77. MATHCrossRef Cassidy, S., & Harrington, J. (2001). Multi-level annotation in the emu speech database management system. Speech Communication, 33(1–2), 61–77. MATHCrossRef
go back to reference Chatterji, S. K. (1921). Bengali phonetics. Bulletin of the School of Oriental Studies, University of London, 2(1), 1–25. CrossRef Chatterji, S. K. (1921). Bengali phonetics. Bulletin of the School of Oriental Studies, University of London, 2(1), 1–25. CrossRef
go back to reference Deliyski, D. & Xue, S. A.: (2001). Effects of aging on selected acoustic voice parameters: preliminary normative data and educational implications. Educational Gerontology, 27(2), 159–168. CrossRef Deliyski, D. & Xue, S. A.: (2001). Effects of aging on selected acoustic voice parameters: preliminary normative data and educational implications. Educational Gerontology, 27(2), 159–168. CrossRef
go back to reference Endres, W., Bambach, W., & Flösser, G. (1971). Voice spectrograms as a function of age, voice disguise, and voice imitation. The Journal of the Acoustical Society of America, 49(6B), 1842–1848. CrossRef Endres, W., Bambach, W., & Flösser, G. (1971). Voice spectrograms as a function of age, voice disguise, and voice imitation. The Journal of the Acoustical Society of America, 49(6B), 1842–1848. CrossRef
go back to reference Ghosh, S., Burnham, K. P., Laubscher, N. F., Dallal, G. E., Wilkinson, L., Morrison, D. F., Loyer, M. W., Eisenberg, B., Kullback, S., Jolliffe, I. T., & Simonoff, J. S. (1987). Letters to the editor. The American Statistician, 41(4), 338–341. CrossRef Ghosh, S., Burnham, K. P., Laubscher, N. F., Dallal, G. E., Wilkinson, L., Morrison, D. F., Loyer, M. W., Eisenberg, B., Kullback, S., Jolliffe, I. T., & Simonoff, J. S. (1987). Letters to the editor. The American Statistician, 41(4), 338–341. CrossRef
go back to reference Gorham-Rowan, M. M., & Laures-Gore, J. (2006). Acoustic-perceptual correlates of voice quality in elderly men and women. Journal of Communication Disorders, 39(3), 171–184. CrossRef Gorham-Rowan, M. M., & Laures-Gore, J. (2006). Acoustic-perceptual correlates of voice quality in elderly men and women. Journal of Communication Disorders, 39(3), 171–184. CrossRef
go back to reference Harrington, J., Palethorpe, S., & Watson, C. I. (2010). Age-related changes in fundamental frequency and formants: a longitudinal study of four speakers. In Interspeech (pp. 2753–2756). Harrington, J., Palethorpe, S., & Watson, C. I. (2010). Age-related changes in fundamental frequency and formants: a longitudinal study of four speakers. In Interspeech (pp. 2753–2756).
go back to reference Hillenbrand, J., Cleveland, R. A., & Erickson, R. L. (1994). Acoustic correlates of breathy vocal quality. Journal of Speech, Language, and Hearing Research, 37(4), 769–778. Hillenbrand, J., Cleveland, R. A., & Erickson, R. L. (1994). Acoustic correlates of breathy vocal quality. Journal of Speech, Language, and Hearing Research, 37(4), 769–778.
go back to reference Hisao, K. (1997). Acoustic and perceptual properties of phonemes in continuous speech as a function of speaking rate. In EUROSPEECH (pp. 1003–1006). Hisao, K. (1997). Acoustic and perceptual properties of phonemes in continuous speech as a function of speaking rate. In EUROSPEECH (pp. 1003–1006).
go back to reference Krom, G. d. (1993). A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals. Journal of Speech, Language, and Hearing Research, 36(2), 254–266. Krom, G. d. (1993). A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals. Journal of Speech, Language, and Hearing Research, 36(2), 254–266.
go back to reference Lindblom, B. E. F. (1971). Acoustical consequences of lip, tongue, jaw, and larynx movement. The Journal of the Acoustical Society of America, 50, 1166–1179. CrossRef Lindblom, B. E. F. (1971). Acoustical consequences of lip, tongue, jaw, and larynx movement. The Journal of the Acoustical Society of America, 50, 1166–1179. CrossRef
go back to reference Linville, S. E. (1996). The sound of senescence. Journal of Voice, 10, 190–200. CrossRef Linville, S. E. (1996). The sound of senescence. Journal of Voice, 10, 190–200. CrossRef
go back to reference Linville, S. E. (2001). Vocal aging. San Diego: Singular Publishing Group. Linville, S. E. (2001). Vocal aging. San Diego: Singular Publishing Group.
go back to reference Linville, S. E., & Rens, J. (2001). Vocal tract resonance analysis of aging voice using long-term average spectra. Journal of Voice, 15(3), 323–330. CrossRef Linville, S. E., & Rens, J. (2001). Vocal tract resonance analysis of aging voice using long-term average spectra. Journal of Voice, 15(3), 323–330. CrossRef
go back to reference Liss, J. M., Weismer, G., & Rosenbek, J. C. (1990). Selected acoustic characteristics of speech production in very old males. Journal of Gerontology, 45(2), 35–45. CrossRef Liss, J. M., Weismer, G., & Rosenbek, J. C. (1990). Selected acoustic characteristics of speech production in very old males. Journal of Gerontology, 45(2), 35–45. CrossRef
go back to reference Mann, H. B., & Whitney, D. R. (1947). On a test of whether one of two random variables is stochastically larger than the other. Annals of Mathematical Statistics, 18(1), 50–60. MathSciNetMATHCrossRef Mann, H. B., & Whitney, D. R. (1947). On a test of whether one of two random variables is stochastically larger than the other. Annals of Mathematical Statistics, 18(1), 50–60. MathSciNetMATHCrossRef
go back to reference Markus, B., & Walter, S. (2003). Aging female voices: an acoustic and perceptive analysis. In VOQUAL (pp. 163–168). Markus, B., & Walter, S. (2003). Aging female voices: an acoustic and perceptive analysis. In VOQUAL (pp. 163–168).
go back to reference Paulsen, F. P., & Tillmann, B. N. (1998). Degenerative changes in the human cricoarytenoid joint. Archives of Otolaryngology, Head of Neck Surgery, 124, 903–906. Paulsen, F. P., & Tillmann, B. N. (1998). Degenerative changes in the human cricoarytenoid joint. Archives of Otolaryngology, Head of Neck Surgery, 124, 903–906.
go back to reference Ramig, L. A., & Ringel, R. L. (1983). Effects of physiological aging on selected acoustic characteristics of voice. Journal of Speech, Language, and Hearing Research, 26(1), 22–30. Ramig, L. A., & Ringel, R. L. (1983). Effects of physiological aging on selected acoustic characteristics of voice. Journal of Speech, Language, and Hearing Research, 26(1), 22–30.
go back to reference Ramig, L. O., Gray, S., Baker, K., Corbin-Lewis, K., Buder, E., Luschei, E., Coon, H., & Smith, M. (2001). The aging voice: a review, treatment data and familial and genetic perspectives. Folia Phoniatrica et Logopaedica, 53(5), 252–265. CrossRef Ramig, L. O., Gray, S., Baker, K., Corbin-Lewis, K., Buder, E., Luschei, E., Coon, H., & Smith, M. (2001). The aging voice: a review, treatment data and familial and genetic perspectives. Folia Phoniatrica et Logopaedica, 53(5), 252–265. CrossRef
go back to reference Reubold, U., Harrington, J., & Kleber, F. (2010). Vocal aging effects on F 0 and the first formant: a longitudinal analysis in adult speakers. Speech Communication, 52(7–8), 638–651. CrossRef Reubold, U., Harrington, J., & Kleber, F. (2010). Vocal aging effects on F 0 and the first formant: a longitudinal analysis in adult speakers. Speech Communication, 52(7–8), 638–651. CrossRef
go back to reference Rodeño, M. T., Sánchez-Fernández, J. M., & Rivera-Pomar, J. M. (1993). Histochemical and morphometrical ageing changes in human vocal cord muscles. Acta Oto-Laryngologica, 113, 445–449. CrossRef Rodeño, M. T., Sánchez-Fernández, J. M., & Rivera-Pomar, J. M. (1993). Histochemical and morphometrical ageing changes in human vocal cord muscles. Acta Oto-Laryngologica, 113, 445–449. CrossRef
go back to reference Rother, P., Wohlgemuth, B., Wolff, W., & Rebentrost, I. (2002). Morphometrically observable aging changes in the human tongue. Annals of Anatomy - Anatomischer Anzeiger, 184(2), 159–164. CrossRef Rother, P., Wohlgemuth, B., Wolff, W., & Rebentrost, I. (2002). Morphometrically observable aging changes in the human tongue. Annals of Anatomy - Anatomischer Anzeiger, 184(2), 159–164. CrossRef
go back to reference Tanmay, B. (2000). Bangla (Bengali). In Gary, Jane; Rubino, Carl, Encyclopedia of World’s languages: past and present (facts about the World’s languages). Tanmay, B. (2000). Bangla (Bengali). In Gary, Jane; Rubino, Carl, Encyclopedia of World’s languages: past and present (facts about the World’s languages).
go back to reference Tolep, K., Higgins, N., Muza, S., Criner, G., & Kelsen, S. G. (1995). Comparison of diaphragm strength between healthy adult elderly and young men. American Journal of Respiratory and Critical Care Medicine, 152, 677–682. Tolep, K., Higgins, N., Muza, S., Criner, G., & Kelsen, S. G. (1995). Comparison of diaphragm strength between healthy adult elderly and young men. American Journal of Respiratory and Critical Care Medicine, 152, 677–682.
go back to reference Traunmuller, H. (1984). Articulatory and perceptual factors controlling the age and sex-conditioned variability in formant frequencies of vowels. Speech Communication, 3(1), 49–61. CrossRef Traunmuller, H. (1984). Articulatory and perceptual factors controlling the age and sex-conditioned variability in formant frequencies of vowels. Speech Communication, 3(1), 49–61. CrossRef
go back to reference Ulatowska, H. K. (1985). The aging brain: communication in the elderly. San Diego: College-Hill Press. Ulatowska, H. K. (1985). The aging brain: communication in the elderly. San Diego: College-Hill Press.
go back to reference Vipperla, R., Renals, S., & Frankel, J. Ageing voices: the effect of changes in voice parameters on asr performance. EURASIP Journal on Audio, Speech, and Music Processing, 2010, 41–50 (2010). doi:10.1155/2010/525783. Vipperla, R., Renals, S., & Frankel, J. Ageing voices: the effect of changes in voice parameters on asr performance. EURASIP Journal on Audio, Speech, and Music Processing, 2010, 41–50 (2010). doi:10.​1155/​2010/​525783.
go back to reference Wilcox, K. A., & Horii, Y. (1980). Age and changes in vocal jitter. Journal of Gerontology, 35(2), 194–198. CrossRef Wilcox, K. A., & Horii, Y. (1980). Age and changes in vocal jitter. Journal of Gerontology, 35(2), 194–198. CrossRef
go back to reference Xue, S. A., & Hao, G. J. (2003). Changes in the human vocal tract due to aging and the acoustic correlates of speech production: a pilot study. Journal of Speech, Language, and Hearing Research, 46(3), 689–701. CrossRef Xue, S. A., & Hao, G. J. (2003). Changes in the human vocal tract due to aging and the acoustic correlates of speech production: a pilot study. Journal of Speech, Language, and Hearing Research, 46(3), 689–701. CrossRef
go back to reference Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., & Woodland, P. (2000). The HTK book version 3.0. Cambridge: Cambridge University Press. Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., & Woodland, P. (2000). The HTK book version 3.0. Cambridge: Cambridge University Press.
go back to reference Yumoto, E., Sasaki, Y., & Okamura, H. (1984). Harmonics-to-noise ratio and psychophysical measurement of the degree of hoarseness. Journal of Speech, Language, and Hearing Research, 27(1), 2–6. Yumoto, E., Sasaki, Y., & Okamura, H. (1984). Harmonics-to-noise ratio and psychophysical measurement of the degree of hoarseness. Journal of Speech, Language, and Hearing Research, 27(1), 2–6.
Metadata
Title
Effect of aging on speech features and phoneme recognition: a study on Bengali voicing vowels
Authors
Biswajit Das
Sandipan Mandal
Pabitra Mitra
Anupam Basu
Publication date
01-03-2013
Publisher
Springer US
Published in
International Journal of Speech Technology / Issue 1/2013
Print ISSN: 1381-2416
Electronic ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-012-9147-3

Other articles of this Issue 1/2013

International Journal of Speech Technology 1/2013 Go to the issue