nach oben

Erschienen in:

2015 | OriginalPaper | Buchkapitel

2. Language Identification—A Brief Review

verfasst von : K. Sreenivasa Rao, Dipanjan Nandi

Erschienen in: Language Identification Using Excitation Source Features

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

This chapter provides compendious reviews about both the explicit and implicit LID systems present in the literature. Existing works related to language identification in Indian context are briefly discussed. The related works about the excitation source features are also presented here. Various speech features and models proposed in the context of language identification are briefly reviewed in this chapter. The motivation for the present work from the existing literature is briefly discussed.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Introduction

Nächstes Kapitel Implicit Excitation Source Features for Language Identification

R. Leonard, G. Doddington, Automatic language identification. Technical Report RADC-TR-74-200 (Air Force Rome Air Development Center, Technical Report) August 1974

R. Leonard, Language Recognition Test and Evaluation. Technical Report RADCTR-80-83 (Air Force Rome Air Development Center, Technical Report). March 1980

A.S. House, E.P. Neuberg, Toward automatic identification of the languages of an utterance. J. Acoust. Soc. Am. 62(3), 708–713 (1977)CrossRef

K.P. Li, T.J. Edwards, Statistical models for automatic language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 884–887, April 1980

L.F. Lamel, J.L. Gauvain, Cross lingual experiments with phone recognition. in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 507–510, April 1993

L.F. Lamel, J.L. Gauvain, Language identification using phonebased acoustic likelihoods, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, , pp. I/293–I/296, April 1994

Y. Muthusamy, R. Cole, M. Gopalakrishnan, A segment-based approach to automatic language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 353–356, April 1991

K.M. Berkling, T. Arai, E. Bernard, Analysis of phoneme based features for language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. I/289–I/292, April 1994

R.C.F. Tucker, M. Carey, E. Parris, Automatic language identification using sub-word models, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. I/301–I/30, April 1994

10.

M.A. Zissman, E. Singer, Automatic language identification of telephone speech messages using phoneme recognition and N-gram modeling, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1 pp. I/305–I/308, (1994)

11.

S. Kadambe, J. Hieronymus, Language identification with phonological and lexical models, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 5, pp. 3507–351, May 1995

12.

Y. Yan, E. Barnard, An approach to automatic language identification based on language-dependent phone recognition, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 5, pp. 3511–3514, May 1995

13.

J. Navratil, W. Zuhlke, Phonetic-context mapping in language identification. Eur. Speech Commun. Assoc. (EUROSPEECH) 1, 71–74 (1997)

14.

T.J. Hazen, V.W. Zue, Segment-based automatic language identification. J. Acoust. Soc. Am. 101, 2323–2331 (1997)CrossRef

15.

K. Kirchhoff, S. Parandekar, Multi-stream statistical n-gram modeling with application to automatic language identification, in European Speech Communication Association (EUROSPEECH), pp. 803–806, (2001)

16.

T. Gleason, M. Zissman, Composite background models and score standardization for language identification systems, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 529–532 (2001)

17.

V. Ramasubramanian, A.K.V.S. Jayram, T.V. Sreenivas, Language identification using parallel sub-word recognition - an ergodic HMM equivalence, European Speech Communication Association (EUROSPEECH) (Geneva, Switzerland), September 2003

18.

J. Gauvain, A. Messaoudi, H. Schwenk, Language recognition using phone latices, in International Speech Communication Association (INTERSPEECH), pp. 25–28 (2004)

19.

W. Shen, W. Campbell, T. Gleason, D. Reynolds, E. Singer, Experiments with lattice-based PPRLM language identification, in Speaker and Language Recognition Workshop, pp. 1–6 (2006)

20.

H. Li, B. Ma, C.H. Lee, A vector space modeling approach to spoken language identification. IEEE Trans. Audio Speech Lang. Process. 15(1), 271–284 (2007)CrossRef

21.

K.C. Sim, H. Li, On acoustic diversification front-end for spoken language identification. IEEE Trans. Audio Speech Lang. Process. 16(5), 1029–1037 (2008)CrossRef

22.

R. Tong, B. Ma, H. Li, E.S. Chng, A target-oriented phonotactic front-end for spoken language recognition. IEEE Trans. Audio Speech Lang. Process. 17(7), 1335–1347 (2009)CrossRef

23.

G.R. Botha, E. Barnard, Factors that affect the accuracy of text-based language identification. Comput. Speech Lang. 26(5), 307–320 (2012)CrossRef

24.

N. Barroso, K. Lopez de Ipina, C. Hernandez, A. Ezeiza, M. Grana, Semantic speech recognition in the Basque context Part II: language identification for under-resourced languages. Int. J. Speech Technol. 15(1), 41–47 (2012)CrossRef

25.

S.M. Siniscalchi, J. Reed, T. Svendsen, C.-H. Lee, Universal attribute characterization of spoken languages for automatic spoken language recognition. Comput. Speech Lang. 27(1), 209–227 (2013)CrossRef

26.

J.T. Foil, Language identification using noisy speech, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 861–864, (1986)

27.

F. Goodman, A. Martin, R. Wohlford, Improved automatic language identification in noisy speech, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 528–531, May 1989

28.

M. Sugiyama, Automatic language recognition using acoustic features, in IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 813–816, May 1991

29.

D. Morgan, L. Riek, W. Mistretta, C. Scofield, P. Grouin, F. Hull, Experiments in language identification with neural networks. Int. Joint Conf. Neural Netw. 2, 320–325 (1992)

30.

M. Zissman, Automatic language identification using gaussian mixture and hidden markov models, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 2, pp. 399–402, April 1993

31.

D.A. Reynolds, R.C. Rose, Robust text -independent speaker identification using gaussian mixture speaker models. IEEE Trans. Audio Speech Lang. Process. 3(1), 72–83 (1995)CrossRef

32.

S. Itahashi, J. Zhou, K. Tanaka, Spoken language discrimination using speech fundamental frequency, in International Conference on Spoken Language Processing (ICSLP), pp. 1899–1902, (1994)

33.

I. Shuichi, D. Liang, Language identification based on speech fundamental frequency, in European Speech Communication Association (EUROSPEECH), pp. 1359–1362 (1995)

34.

K.P. Li, Automatic language identification using syllabic spectral features, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. I/297–I/300, April 1994

35.

F. Pellegrino, R. Andre-Obrecht, An unsupervised approach to language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 2, pp. 833–836, Mar 1999

36.

J.L. Rouas, J. Farinas, F. Pellegrino, R. Andr-Obrecht, Rhythmic unit extraction and modelling for automatic language identification. Speech Commun. 47, 436–456 (2005)CrossRef

37.

J.L. Rouas, Automatic prosodic variations modeling for language and dialect discrimination. IEEE Trans. Audio Speech Lang. Process. 15(6), 1904–1911 (2007)CrossRef

38.

A. Sangwan, M. Mehrabani, J. Hansen, Automatic language analysis and identification based on speech production knowledge, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 5006–5009, March 2010

39.

D. Martinez, L. Burget, L. Ferrer, N. Scheffer, i-vector based prosodic system for language identification, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4861–4864, March 2012

40.

J. Balleda, H.A. Murthy, T. Nagarajan, Language Identification from Short Segments of Speech, in International Conference on Spoken Language Processing (ICSLP), pp. 1033–1036, October 2000

41.

T. Nagarajan, Implicit system for spoken language identification, Ph.D. dissertation, Indian Institute of Technology Madras, India (2004)

42.

A.K.V.S. Jayaram, V. Ramasubramanian, T.V. Sreenivas, Language identification using parallel sub-word recognition, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 32–35, April 2003

43.

L. Mary, B. Yegnanarayana, Autoassociative neural network models for language identification. in Internatioanl Conference on Intelligent Sensing and Information Processing, pp. 317–320 (2004)

44.

L. Mary, Multilevel implicit features for language and speaker recognition, Ph.D. dissertation, Indian Institute of Technology Madras, India (2006)

45.

K.S. Rao, S. Maity, V.R. Reddy, Pitch synchronous and glottal closure based speech analysis for language recognition. Int. J. Speech Technol. (Springer) 16(4), 413–430 (2013)CrossRef

46.

V.R. Reddy, S. Maity, K.S. Rao, Identification of indian languages using multi-level spectral and prosodic features. Int. J. Speech Technol. (Springer) 16(4), 489–511 (2013)CrossRef

47.

S. Jothilakshmi, V. Ramalingam, S. Palanivel, A hierarchical language identification system for Indian languages. Digital Signal Process. (Elsevier) 22(3), 544–553 (2012)CrossRefMathSciNet

48.

B. Bhaskar, D. Nandi, K.S. Rao, Analysis of language identification performance based on gender and hierarchial grouping approaches, in International Conference on Natural Language Processing, December 2013

49.

B. Yegnanarayana, T.K. Raja, Perfoemance of linear prediction analysis on speech with additive noise, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (1977)

50.

B. Yegnanarayana, S.R.M. Prasanna, J. Zachariah, C. Gupta, Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system. IEEE Trans. Audio Speech Lang. Process. 13(4), 575–582 (2005)CrossRef

51.

C.S. Gupta, S.R.M. Prasanna, B. Yegnanarayana, Autoassociative neural network models for online speaker verification using source features from vowels, in IEEE International Joint Conference Neural Networks, May 2002

52.

D. Pati, S.R.M. Prasanna, Subsegmental, segmental and suprasegmental processing of linear prediction residual for speaker information. Int. J. Speech Technol. (Springer) 14(1), 49–63 (2011)CrossRef

53.

D. Pati, D. Nandi, K. Sreenivasa Rao, Robustness of excitation source information for language independent speaker recognition, in 16th International Oriental COCOSDA Conference, Gurgoan, November 2013

54.

A. Bajpai, B. Yegnanarayana, Exploring features for audio clip classification using LP residual and AANN models, in International Conference on Intelligent Sensing and Information Processing, pp. 305–310, January 2004

55.

K.S. Rao, S.G. Koolagudi, Characterization and recognition of emotions from speech using excitation source information. Int. J. Speech Technol. (Springer) 16, 181–201 (2013)CrossRef

56.

K.S. Rao, B. Yegnanarayana, Duration modification using glottal closure instants and vowel onset points. Speech Commun. 51(12), 1263–1269 (2009)CrossRef

57.

K.S. Rao, B. Yegnanarayana, Prosody modification using instants of significant excitation. IEEE Trans. Audio Speech Lang. Process. 14(3), 972–980 (2006)CrossRef

58.

K.S. Rao, S.R.M. Prasanna, B. Yegnanarayana, Determination of instants of significant excitation in speech using Hilbert envelope and group delay function. IEEE Signal Process. Lett. 14(10), 762–765 (2007)CrossRef

59.

K.S. Rao, Unconstrained pitch contour modification using instants of significant excitation. Circuits Syst. Signal Process. (Springer) 31(6), 2133–2152 (2012)CrossRef

60.

K.S. Rao, Voice conversion by mapping the speaker-specific features using pitch synchronous approach. Comput. Speech Lang. 24(3), 474–494 (2010)CrossRef

61.

R. Hussain Laskar, K. Banerjee, F. Ahmed Talukdar, K. Sreenivasa Rao, A pitch synchronous approach to design voice conversion system using source-filter correlation. Int. J. Speech Technol. (Springer) 15(3), 419–431 (2012)CrossRef

Titel: Language Identification—A Brief Review
verfasst von: K. Sreenivasa Rao
Dipanjan Nandi
Verlag: Springer International Publishing
Buch: Language Identification Using Excitation Source Features
Print ISBN: 978-3-319-17724-3

Electronic ISBN: 978-3-319-17725-0

Copyright-Jahr: 2015
DOI: https://doi.org/10.1007/978-3-319-17725-0_2

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Die Gewinner und Laudatoren des Sustainability Award in Automotive 2024/© Uli Regenscheit | ATZlive, Search Icon, Banner Hanser, Sebastian Glenschek/© Hermes International, Dinko Eror/© Red Hat GmbH, Suresh Vittal/© Alteryx, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade, chassis.tech plus 2023/© [M] ATZlive / TÜV SÜD PRODUCT SERVICE GMBH, adäsion-Webinar-Matinee/© krystiannawrocki_ Getty Images

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.