Skip to main content
Top

2015 | OriginalPaper | Chapter

4. Parametric Excitation Source Features for Language Identification

Authors : K. Sreenivasa Rao, Dipanjan Nandi

Published in: Language Identification Using Excitation Source Features

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This chapter describes the proposed methods to extract parametric features at sub-segmental, segmental and supra-segmental levels to capture the language-specific excitation source information. In this work, glottal pulse, spectral and epoch parameters are used for representing sub-segmental, segmental and supra-segmental information present in excitation source signal. Further, these individual features are combined at score level to enhance the accuracy of LID systems by exploiting the non-overlapping information present among the features.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference L.R. Rabiner, R.W. Schafer, Digital Processing of Speech Signals (Prentice-Hall, Englewood Cliffs, 1978) L.R. Rabiner, R.W. Schafer, Digital Processing of Speech Signals (Prentice-Hall, Englewood Cliffs, 1978)
2.
go back to reference J. Makhoul, Linear prediction: a tutorial review. Proc. IEEE 63(4), 561–580 (1975)CrossRef J. Makhoul, Linear prediction: a tutorial review. Proc. IEEE 63(4), 561–580 (1975)CrossRef
3.
go back to reference D.G. Childers, A.K. Krishnamurthy, A critical review of electroglottography. Crit. Rev. Biomed. Eng. 12(2), 131–161 (1985) D.G. Childers, A.K. Krishnamurthy, A critical review of electroglottography. Crit. Rev. Biomed. Eng. 12(2), 131–161 (1985)
4.
go back to reference M.D. Plumpe, T.F. Quatieri, D.A. Reynolds, Modeling of the glottal flow derivative waveform with application to speaker identification. IEEE Trans. Audio Speech Lang. Process. 7(5), 569–586 (1999)CrossRef M.D. Plumpe, T.F. Quatieri, D.A. Reynolds, Modeling of the glottal flow derivative waveform with application to speaker identification. IEEE Trans. Audio Speech Lang. Process. 7(5), 569–586 (1999)CrossRef
5.
go back to reference R. Veldhuish, A computationally efficient alternative for the Liljencrants-Fant model and its perceptual evaluation. J. Acoust. Soc. Am. 103(1), 566–571 (1998)CrossRef R. Veldhuish, A computationally efficient alternative for the Liljencrants-Fant model and its perceptual evaluation. J. Acoust. Soc. Am. 103(1), 566–571 (1998)CrossRef
6.
go back to reference T.V. Ananthapadmanabha, G. Fant, Calculation of true glottal flow and its components. Speech Commun. 1, 167–184 (1982)CrossRef T.V. Ananthapadmanabha, G. Fant, Calculation of true glottal flow and its components. Speech Commun. 1, 167–184 (1982)CrossRef
7.
go back to reference Y. Qi, N. Bi, A simplified approximation of the four-parameter LF model of voice source. J. Acoust. Soc. Am. 96(2), 1182–1185 (1994)CrossRef Y. Qi, N. Bi, A simplified approximation of the four-parameter LF model of voice source. J. Acoust. Soc. Am. 96(2), 1182–1185 (1994)CrossRef
8.
go back to reference K.S.R. Murty, B. Yegnanarayana, Epoch extraction from speech signals. IEEE Trans. Audio Speech Lang. Process. 16(8), 1602–1613 (2008)CrossRef K.S.R. Murty, B. Yegnanarayana, Epoch extraction from speech signals. IEEE Trans. Audio Speech Lang. Process. 16(8), 1602–1613 (2008)CrossRef
9.
go back to reference P. Naylor, A. Kounoudes, J. Gudnason, M. Brookes, Estimation of glottal closure instants in voiced speech using the DYPSA algorithm. IEEE Trans. Audio Speech Lang. Process. 15(1), 34–43 (2007)CrossRef P. Naylor, A. Kounoudes, J. Gudnason, M. Brookes, Estimation of glottal closure instants in voiced speech using the DYPSA algorithm. IEEE Trans. Audio Speech Lang. Process. 15(1), 34–43 (2007)CrossRef
10.
go back to reference S. Hayakawa, K. Takeda, F. Itakura, Speaker identification using harmonic structure of LP-residual spectrum, Biometric Personal Authentification, vol. 1206, Lecture notes (Springer, Berlin, 1997) S. Hayakawa, K. Takeda, F. Itakura, Speaker identification using harmonic structure of LP-residual spectrum, Biometric Personal Authentification, vol. 1206, Lecture notes (Springer, Berlin, 1997)
11.
go back to reference A.H. Gray, J.D. Markel, A spectral-flatness measure for studying the autocorrelation method of linear prediction of speech analysis. IEEE Trans. Audio Speech Lang. Process. ASSP-22(3), 207–217 (1974) A.H. Gray, J.D. Markel, A spectral-flatness measure for studying the autocorrelation method of linear prediction of speech analysis. IEEE Trans. Audio Speech Lang. Process. ASSP-22(3), 207–217 (1974)
12.
go back to reference J.J. Wolf, Efficient acoustic parameters for speaker recognition. J. Acoust. Soc. Am. 51(2), 2044–2055 (1972)CrossRef J.J. Wolf, Efficient acoustic parameters for speaker recognition. J. Acoust. Soc. Am. 51(2), 2044–2055 (1972)CrossRef
13.
go back to reference B.S. Atal, Automatic speaker recognition based on pitch contours. J. Acoust. Soc. Am. 52(6), 1687–1697 (1972)CrossRef B.S. Atal, Automatic speaker recognition based on pitch contours. J. Acoust. Soc. Am. 52(6), 1687–1697 (1972)CrossRef
14.
go back to reference B. Yegnenarayana, K.S.R. Murthy, Event based instantaneous fundamental frequency estimation from speech signals. IEEE Trans. Audio Speech Lang. Process. 17(4), 614–624 (2009)CrossRef B. Yegnenarayana, K.S.R. Murthy, Event based instantaneous fundamental frequency estimation from speech signals. IEEE Trans. Audio Speech Lang. Process. 17(4), 614–624 (2009)CrossRef
15.
go back to reference K.S.R. Murthy, B. Yegnanarayana, Epoch extraction from speech signal. IEEE Trans. Audio Speech Lang. Process. 16(8), 1602–1613 (2008)CrossRef K.S.R. Murthy, B. Yegnanarayana, Epoch extraction from speech signal. IEEE Trans. Audio Speech Lang. Process. 16(8), 1602–1613 (2008)CrossRef
16.
go back to reference K.S.R. Murthy, B. Yegnanarayana, Characterization of glottal activity from speech signal. IEEE Signal Process. Lett. 16(6), 469–472 (2009)CrossRef K.S.R. Murthy, B. Yegnanarayana, Characterization of glottal activity from speech signal. IEEE Signal Process. Lett. 16(6), 469–472 (2009)CrossRef
17.
go back to reference G. Seshadria, B. Yegnanarayana, Perceived loudness of speech based on the characteristics of glottal excitation source. J. Acoust. Soc. Am. 126(4), 2061–2071 (2009)CrossRef G. Seshadria, B. Yegnanarayana, Perceived loudness of speech based on the characteristics of glottal excitation source. J. Acoust. Soc. Am. 126(4), 2061–2071 (2009)CrossRef
18.
go back to reference D.A. Reynolds, R.C. Rose, Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans. Audio Speech Lang. Process. 3(1), 72–83 (1995)CrossRef D.A. Reynolds, R.C. Rose, Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans. Audio Speech Lang. Process. 3(1), 72–83 (1995)CrossRef
19.
go back to reference V.R. Reddy, S. Maity, K.S. Rao, Identification of Indian languages using multi-level spectral and prosodic features. Int. J. Speech Technol. (Springer) 16(4), 489–511 (2013)CrossRef V.R. Reddy, S. Maity, K.S. Rao, Identification of Indian languages using multi-level spectral and prosodic features. Int. J. Speech Technol. (Springer) 16(4), 489–511 (2013)CrossRef
20.
go back to reference Y.K. Muthusamy, R.A. Cole, B.T. Oshika, The OGI multilanguage telephone speech corpus, in Spoken Language Processing, pp. 895–898 (1992) Y.K. Muthusamy, R.A. Cole, B.T. Oshika, The OGI multilanguage telephone speech corpus, in Spoken Language Processing, pp. 895–898 (1992)
Metadata
Title
Parametric Excitation Source Features for Language Identification
Authors
K. Sreenivasa Rao
Dipanjan Nandi
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-17725-0_4