Skip to main content

2015 | OriginalPaper | Buchkapitel

2. Language Identification—A Brief Review

verfasst von : K. Sreenivasa Rao, Dipanjan Nandi

Erschienen in: Language Identification Using Excitation Source Features

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This chapter provides compendious reviews about both the explicit and implicit LID systems present in the literature. Existing works related to language identification in Indian context are briefly discussed. The related works about the excitation source features are also presented here. Various speech features and models proposed in the context of language identification are briefly reviewed in this chapter. The motivation for the present work from the existing literature is briefly discussed.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat R. Leonard, G. Doddington, Automatic language identification. Technical Report RADC-TR-74-200 (Air Force Rome Air Development Center, Technical Report) August 1974 R. Leonard, G. Doddington, Automatic language identification. Technical Report RADC-TR-74-200 (Air Force Rome Air Development Center, Technical Report) August 1974
2.
Zurück zum Zitat R. Leonard, Language Recognition Test and Evaluation. Technical Report RADCTR-80-83 (Air Force Rome Air Development Center, Technical Report). March 1980 R. Leonard, Language Recognition Test and Evaluation. Technical Report RADCTR-80-83 (Air Force Rome Air Development Center, Technical Report). March 1980
3.
Zurück zum Zitat A.S. House, E.P. Neuberg, Toward automatic identification of the languages of an utterance. J. Acoust. Soc. Am. 62(3), 708–713 (1977)CrossRef A.S. House, E.P. Neuberg, Toward automatic identification of the languages of an utterance. J. Acoust. Soc. Am. 62(3), 708–713 (1977)CrossRef
4.
Zurück zum Zitat K.P. Li, T.J. Edwards, Statistical models for automatic language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 884–887, April 1980 K.P. Li, T.J. Edwards, Statistical models for automatic language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 884–887, April 1980
5.
Zurück zum Zitat L.F. Lamel, J.L. Gauvain, Cross lingual experiments with phone recognition. in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 507–510, April 1993 L.F. Lamel, J.L. Gauvain, Cross lingual experiments with phone recognition. in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 507–510, April 1993
6.
Zurück zum Zitat L.F. Lamel, J.L. Gauvain, Language identification using phonebased acoustic likelihoods, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, , pp. I/293–I/296, April 1994 L.F. Lamel, J.L. Gauvain, Language identification using phonebased acoustic likelihoods, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, , pp. I/293–I/296, April 1994
7.
Zurück zum Zitat Y. Muthusamy, R. Cole, M. Gopalakrishnan, A segment-based approach to automatic language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 353–356, April 1991 Y. Muthusamy, R. Cole, M. Gopalakrishnan, A segment-based approach to automatic language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 353–356, April 1991
8.
Zurück zum Zitat K.M. Berkling, T. Arai, E. Bernard, Analysis of phoneme based features for language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. I/289–I/292, April 1994 K.M. Berkling, T. Arai, E. Bernard, Analysis of phoneme based features for language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. I/289–I/292, April 1994
9.
Zurück zum Zitat R.C.F. Tucker, M. Carey, E. Parris, Automatic language identification using sub-word models, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. I/301–I/30, April 1994 R.C.F. Tucker, M. Carey, E. Parris, Automatic language identification using sub-word models, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. I/301–I/30, April 1994
10.
Zurück zum Zitat M.A. Zissman, E. Singer, Automatic language identification of telephone speech messages using phoneme recognition and N-gram modeling, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1 pp. I/305–I/308, (1994) M.A. Zissman, E. Singer, Automatic language identification of telephone speech messages using phoneme recognition and N-gram modeling, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1 pp. I/305–I/308, (1994)
11.
Zurück zum Zitat S. Kadambe, J. Hieronymus, Language identification with phonological and lexical models, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 5, pp. 3507–351, May 1995 S. Kadambe, J. Hieronymus, Language identification with phonological and lexical models, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 5, pp. 3507–351, May 1995
12.
Zurück zum Zitat Y. Yan, E. Barnard, An approach to automatic language identification based on language-dependent phone recognition, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 5, pp. 3511–3514, May 1995 Y. Yan, E. Barnard, An approach to automatic language identification based on language-dependent phone recognition, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 5, pp. 3511–3514, May 1995
13.
Zurück zum Zitat J. Navratil, W. Zuhlke, Phonetic-context mapping in language identification. Eur. Speech Commun. Assoc. (EUROSPEECH) 1, 71–74 (1997) J. Navratil, W. Zuhlke, Phonetic-context mapping in language identification. Eur. Speech Commun. Assoc. (EUROSPEECH) 1, 71–74 (1997)
14.
Zurück zum Zitat T.J. Hazen, V.W. Zue, Segment-based automatic language identification. J. Acoust. Soc. Am. 101, 2323–2331 (1997)CrossRef T.J. Hazen, V.W. Zue, Segment-based automatic language identification. J. Acoust. Soc. Am. 101, 2323–2331 (1997)CrossRef
15.
Zurück zum Zitat K. Kirchhoff, S. Parandekar, Multi-stream statistical n-gram modeling with application to automatic language identification, in European Speech Communication Association (EUROSPEECH), pp. 803–806, (2001) K. Kirchhoff, S. Parandekar, Multi-stream statistical n-gram modeling with application to automatic language identification, in European Speech Communication Association (EUROSPEECH), pp. 803–806, (2001)
16.
Zurück zum Zitat T. Gleason, M. Zissman, Composite background models and score standardization for language identification systems, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 529–532 (2001) T. Gleason, M. Zissman, Composite background models and score standardization for language identification systems, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 529–532 (2001)
17.
Zurück zum Zitat V. Ramasubramanian, A.K.V.S. Jayram, T.V. Sreenivas, Language identification using parallel sub-word recognition - an ergodic HMM equivalence, European Speech Communication Association (EUROSPEECH) (Geneva, Switzerland), September 2003 V. Ramasubramanian, A.K.V.S. Jayram, T.V. Sreenivas, Language identification using parallel sub-word recognition - an ergodic HMM equivalence, European Speech Communication Association (EUROSPEECH) (Geneva, Switzerland), September 2003
18.
Zurück zum Zitat J. Gauvain, A. Messaoudi, H. Schwenk, Language recognition using phone latices, in International Speech Communication Association (INTERSPEECH), pp. 25–28 (2004) J. Gauvain, A. Messaoudi, H. Schwenk, Language recognition using phone latices, in International Speech Communication Association (INTERSPEECH), pp. 25–28 (2004)
19.
Zurück zum Zitat W. Shen, W. Campbell, T. Gleason, D. Reynolds, E. Singer, Experiments with lattice-based PPRLM language identification, in Speaker and Language Recognition Workshop, pp. 1–6 (2006) W. Shen, W. Campbell, T. Gleason, D. Reynolds, E. Singer, Experiments with lattice-based PPRLM language identification, in Speaker and Language Recognition Workshop, pp. 1–6 (2006)
20.
Zurück zum Zitat H. Li, B. Ma, C.H. Lee, A vector space modeling approach to spoken language identification. IEEE Trans. Audio Speech Lang. Process. 15(1), 271–284 (2007)CrossRef H. Li, B. Ma, C.H. Lee, A vector space modeling approach to spoken language identification. IEEE Trans. Audio Speech Lang. Process. 15(1), 271–284 (2007)CrossRef
21.
Zurück zum Zitat K.C. Sim, H. Li, On acoustic diversification front-end for spoken language identification. IEEE Trans. Audio Speech Lang. Process. 16(5), 1029–1037 (2008)CrossRef K.C. Sim, H. Li, On acoustic diversification front-end for spoken language identification. IEEE Trans. Audio Speech Lang. Process. 16(5), 1029–1037 (2008)CrossRef
22.
Zurück zum Zitat R. Tong, B. Ma, H. Li, E.S. Chng, A target-oriented phonotactic front-end for spoken language recognition. IEEE Trans. Audio Speech Lang. Process. 17(7), 1335–1347 (2009)CrossRef R. Tong, B. Ma, H. Li, E.S. Chng, A target-oriented phonotactic front-end for spoken language recognition. IEEE Trans. Audio Speech Lang. Process. 17(7), 1335–1347 (2009)CrossRef
23.
Zurück zum Zitat G.R. Botha, E. Barnard, Factors that affect the accuracy of text-based language identification. Comput. Speech Lang. 26(5), 307–320 (2012)CrossRef G.R. Botha, E. Barnard, Factors that affect the accuracy of text-based language identification. Comput. Speech Lang. 26(5), 307–320 (2012)CrossRef
24.
Zurück zum Zitat N. Barroso, K. Lopez de Ipina, C. Hernandez, A. Ezeiza, M. Grana, Semantic speech recognition in the Basque context Part II: language identification for under-resourced languages. Int. J. Speech Technol. 15(1), 41–47 (2012)CrossRef N. Barroso, K. Lopez de Ipina, C. Hernandez, A. Ezeiza, M. Grana, Semantic speech recognition in the Basque context Part II: language identification for under-resourced languages. Int. J. Speech Technol. 15(1), 41–47 (2012)CrossRef
25.
Zurück zum Zitat S.M. Siniscalchi, J. Reed, T. Svendsen, C.-H. Lee, Universal attribute characterization of spoken languages for automatic spoken language recognition. Comput. Speech Lang. 27(1), 209–227 (2013)CrossRef S.M. Siniscalchi, J. Reed, T. Svendsen, C.-H. Lee, Universal attribute characterization of spoken languages for automatic spoken language recognition. Comput. Speech Lang. 27(1), 209–227 (2013)CrossRef
26.
Zurück zum Zitat J.T. Foil, Language identification using noisy speech, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 861–864, (1986) J.T. Foil, Language identification using noisy speech, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 861–864, (1986)
27.
Zurück zum Zitat F. Goodman, A. Martin, R. Wohlford, Improved automatic language identification in noisy speech, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 528–531, May 1989 F. Goodman, A. Martin, R. Wohlford, Improved automatic language identification in noisy speech, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 528–531, May 1989
28.
Zurück zum Zitat M. Sugiyama, Automatic language recognition using acoustic features, in IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 813–816, May 1991 M. Sugiyama, Automatic language recognition using acoustic features, in IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 813–816, May 1991
29.
Zurück zum Zitat D. Morgan, L. Riek, W. Mistretta, C. Scofield, P. Grouin, F. Hull, Experiments in language identification with neural networks. Int. Joint Conf. Neural Netw. 2, 320–325 (1992) D. Morgan, L. Riek, W. Mistretta, C. Scofield, P. Grouin, F. Hull, Experiments in language identification with neural networks. Int. Joint Conf. Neural Netw. 2, 320–325 (1992)
30.
Zurück zum Zitat M. Zissman, Automatic language identification using gaussian mixture and hidden markov models, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 2, pp. 399–402, April 1993 M. Zissman, Automatic language identification using gaussian mixture and hidden markov models, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 2, pp. 399–402, April 1993
31.
Zurück zum Zitat D.A. Reynolds, R.C. Rose, Robust text -independent speaker identification using gaussian mixture speaker models. IEEE Trans. Audio Speech Lang. Process. 3(1), 72–83 (1995)CrossRef D.A. Reynolds, R.C. Rose, Robust text -independent speaker identification using gaussian mixture speaker models. IEEE Trans. Audio Speech Lang. Process. 3(1), 72–83 (1995)CrossRef
32.
Zurück zum Zitat S. Itahashi, J. Zhou, K. Tanaka, Spoken language discrimination using speech fundamental frequency, in International Conference on Spoken Language Processing (ICSLP), pp. 1899–1902, (1994) S. Itahashi, J. Zhou, K. Tanaka, Spoken language discrimination using speech fundamental frequency, in International Conference on Spoken Language Processing (ICSLP), pp. 1899–1902, (1994)
33.
Zurück zum Zitat I. Shuichi, D. Liang, Language identification based on speech fundamental frequency, in European Speech Communication Association (EUROSPEECH), pp. 1359–1362 (1995) I. Shuichi, D. Liang, Language identification based on speech fundamental frequency, in European Speech Communication Association (EUROSPEECH), pp. 1359–1362 (1995)
34.
Zurück zum Zitat K.P. Li, Automatic language identification using syllabic spectral features, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. I/297–I/300, April 1994 K.P. Li, Automatic language identification using syllabic spectral features, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. I/297–I/300, April 1994
35.
Zurück zum Zitat F. Pellegrino, R. Andre-Obrecht, An unsupervised approach to language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 2, pp. 833–836, Mar 1999 F. Pellegrino, R. Andre-Obrecht, An unsupervised approach to language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 2, pp. 833–836, Mar 1999
36.
Zurück zum Zitat J.L. Rouas, J. Farinas, F. Pellegrino, R. Andr-Obrecht, Rhythmic unit extraction and modelling for automatic language identification. Speech Commun. 47, 436–456 (2005)CrossRef J.L. Rouas, J. Farinas, F. Pellegrino, R. Andr-Obrecht, Rhythmic unit extraction and modelling for automatic language identification. Speech Commun. 47, 436–456 (2005)CrossRef
37.
Zurück zum Zitat J.L. Rouas, Automatic prosodic variations modeling for language and dialect discrimination. IEEE Trans. Audio Speech Lang. Process. 15(6), 1904–1911 (2007)CrossRef J.L. Rouas, Automatic prosodic variations modeling for language and dialect discrimination. IEEE Trans. Audio Speech Lang. Process. 15(6), 1904–1911 (2007)CrossRef
38.
Zurück zum Zitat A. Sangwan, M. Mehrabani, J. Hansen, Automatic language analysis and identification based on speech production knowledge, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 5006–5009, March 2010 A. Sangwan, M. Mehrabani, J. Hansen, Automatic language analysis and identification based on speech production knowledge, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 5006–5009, March 2010
39.
Zurück zum Zitat D. Martinez, L. Burget, L. Ferrer, N. Scheffer, i-vector based prosodic system for language identification, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4861–4864, March 2012 D. Martinez, L. Burget, L. Ferrer, N. Scheffer, i-vector based prosodic system for language identification, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4861–4864, March 2012
40.
Zurück zum Zitat J. Balleda, H.A. Murthy, T. Nagarajan, Language Identification from Short Segments of Speech, in International Conference on Spoken Language Processing (ICSLP), pp. 1033–1036, October 2000 J. Balleda, H.A. Murthy, T. Nagarajan, Language Identification from Short Segments of Speech, in International Conference on Spoken Language Processing (ICSLP), pp. 1033–1036, October 2000
41.
Zurück zum Zitat T. Nagarajan, Implicit system for spoken language identification, Ph.D. dissertation, Indian Institute of Technology Madras, India (2004) T. Nagarajan, Implicit system for spoken language identification, Ph.D. dissertation, Indian Institute of Technology Madras, India (2004)
42.
Zurück zum Zitat A.K.V.S. Jayaram, V. Ramasubramanian, T.V. Sreenivas, Language identification using parallel sub-word recognition, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 32–35, April 2003 A.K.V.S. Jayaram, V. Ramasubramanian, T.V. Sreenivas, Language identification using parallel sub-word recognition, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 32–35, April 2003
43.
Zurück zum Zitat L. Mary, B. Yegnanarayana, Autoassociative neural network models for language identification. in Internatioanl Conference on Intelligent Sensing and Information Processing, pp. 317–320 (2004) L. Mary, B. Yegnanarayana, Autoassociative neural network models for language identification. in Internatioanl Conference on Intelligent Sensing and Information Processing, pp. 317–320 (2004)
44.
Zurück zum Zitat L. Mary, Multilevel implicit features for language and speaker recognition, Ph.D. dissertation, Indian Institute of Technology Madras, India (2006) L. Mary, Multilevel implicit features for language and speaker recognition, Ph.D. dissertation, Indian Institute of Technology Madras, India (2006)
45.
Zurück zum Zitat K.S. Rao, S. Maity, V.R. Reddy, Pitch synchronous and glottal closure based speech analysis for language recognition. Int. J. Speech Technol. (Springer) 16(4), 413–430 (2013)CrossRef K.S. Rao, S. Maity, V.R. Reddy, Pitch synchronous and glottal closure based speech analysis for language recognition. Int. J. Speech Technol. (Springer) 16(4), 413–430 (2013)CrossRef
46.
Zurück zum Zitat V.R. Reddy, S. Maity, K.S. Rao, Identification of indian languages using multi-level spectral and prosodic features. Int. J. Speech Technol. (Springer) 16(4), 489–511 (2013)CrossRef V.R. Reddy, S. Maity, K.S. Rao, Identification of indian languages using multi-level spectral and prosodic features. Int. J. Speech Technol. (Springer) 16(4), 489–511 (2013)CrossRef
47.
Zurück zum Zitat S. Jothilakshmi, V. Ramalingam, S. Palanivel, A hierarchical language identification system for Indian languages. Digital Signal Process. (Elsevier) 22(3), 544–553 (2012)CrossRefMathSciNet S. Jothilakshmi, V. Ramalingam, S. Palanivel, A hierarchical language identification system for Indian languages. Digital Signal Process. (Elsevier) 22(3), 544–553 (2012)CrossRefMathSciNet
48.
Zurück zum Zitat B. Bhaskar, D. Nandi, K.S. Rao, Analysis of language identification performance based on gender and hierarchial grouping approaches, in International Conference on Natural Language Processing, December 2013 B. Bhaskar, D. Nandi, K.S. Rao, Analysis of language identification performance based on gender and hierarchial grouping approaches, in International Conference on Natural Language Processing, December 2013
49.
Zurück zum Zitat B. Yegnanarayana, T.K. Raja, Perfoemance of linear prediction analysis on speech with additive noise, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (1977) B. Yegnanarayana, T.K. Raja, Perfoemance of linear prediction analysis on speech with additive noise, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (1977)
50.
Zurück zum Zitat B. Yegnanarayana, S.R.M. Prasanna, J. Zachariah, C. Gupta, Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system. IEEE Trans. Audio Speech Lang. Process. 13(4), 575–582 (2005)CrossRef B. Yegnanarayana, S.R.M. Prasanna, J. Zachariah, C. Gupta, Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system. IEEE Trans. Audio Speech Lang. Process. 13(4), 575–582 (2005)CrossRef
51.
Zurück zum Zitat C.S. Gupta, S.R.M. Prasanna, B. Yegnanarayana, Autoassociative neural network models for online speaker verification using source features from vowels, in IEEE International Joint Conference Neural Networks, May 2002 C.S. Gupta, S.R.M. Prasanna, B. Yegnanarayana, Autoassociative neural network models for online speaker verification using source features from vowels, in IEEE International Joint Conference Neural Networks, May 2002
52.
Zurück zum Zitat D. Pati, S.R.M. Prasanna, Subsegmental, segmental and suprasegmental processing of linear prediction residual for speaker information. Int. J. Speech Technol. (Springer) 14(1), 49–63 (2011)CrossRef D. Pati, S.R.M. Prasanna, Subsegmental, segmental and suprasegmental processing of linear prediction residual for speaker information. Int. J. Speech Technol. (Springer) 14(1), 49–63 (2011)CrossRef
53.
Zurück zum Zitat D. Pati, D. Nandi, K. Sreenivasa Rao, Robustness of excitation source information for language independent speaker recognition, in 16th International Oriental COCOSDA Conference, Gurgoan, November 2013 D. Pati, D. Nandi, K. Sreenivasa Rao, Robustness of excitation source information for language independent speaker recognition, in 16th International Oriental COCOSDA Conference, Gurgoan, November 2013
54.
Zurück zum Zitat A. Bajpai, B. Yegnanarayana, Exploring features for audio clip classification using LP residual and AANN models, in International Conference on Intelligent Sensing and Information Processing, pp. 305–310, January 2004 A. Bajpai, B. Yegnanarayana, Exploring features for audio clip classification using LP residual and AANN models, in International Conference on Intelligent Sensing and Information Processing, pp. 305–310, January 2004
55.
Zurück zum Zitat K.S. Rao, S.G. Koolagudi, Characterization and recognition of emotions from speech using excitation source information. Int. J. Speech Technol. (Springer) 16, 181–201 (2013)CrossRef K.S. Rao, S.G. Koolagudi, Characterization and recognition of emotions from speech using excitation source information. Int. J. Speech Technol. (Springer) 16, 181–201 (2013)CrossRef
56.
Zurück zum Zitat K.S. Rao, B. Yegnanarayana, Duration modification using glottal closure instants and vowel onset points. Speech Commun. 51(12), 1263–1269 (2009)CrossRef K.S. Rao, B. Yegnanarayana, Duration modification using glottal closure instants and vowel onset points. Speech Commun. 51(12), 1263–1269 (2009)CrossRef
57.
Zurück zum Zitat K.S. Rao, B. Yegnanarayana, Prosody modification using instants of significant excitation. IEEE Trans. Audio Speech Lang. Process. 14(3), 972–980 (2006)CrossRef K.S. Rao, B. Yegnanarayana, Prosody modification using instants of significant excitation. IEEE Trans. Audio Speech Lang. Process. 14(3), 972–980 (2006)CrossRef
58.
Zurück zum Zitat K.S. Rao, S.R.M. Prasanna, B. Yegnanarayana, Determination of instants of significant excitation in speech using Hilbert envelope and group delay function. IEEE Signal Process. Lett. 14(10), 762–765 (2007)CrossRef K.S. Rao, S.R.M. Prasanna, B. Yegnanarayana, Determination of instants of significant excitation in speech using Hilbert envelope and group delay function. IEEE Signal Process. Lett. 14(10), 762–765 (2007)CrossRef
59.
Zurück zum Zitat K.S. Rao, Unconstrained pitch contour modification using instants of significant excitation. Circuits Syst. Signal Process. (Springer) 31(6), 2133–2152 (2012)CrossRef K.S. Rao, Unconstrained pitch contour modification using instants of significant excitation. Circuits Syst. Signal Process. (Springer) 31(6), 2133–2152 (2012)CrossRef
60.
Zurück zum Zitat K.S. Rao, Voice conversion by mapping the speaker-specific features using pitch synchronous approach. Comput. Speech Lang. 24(3), 474–494 (2010)CrossRef K.S. Rao, Voice conversion by mapping the speaker-specific features using pitch synchronous approach. Comput. Speech Lang. 24(3), 474–494 (2010)CrossRef
61.
Zurück zum Zitat R. Hussain Laskar, K. Banerjee, F. Ahmed Talukdar, K. Sreenivasa Rao, A pitch synchronous approach to design voice conversion system using source-filter correlation. Int. J. Speech Technol. (Springer) 15(3), 419–431 (2012)CrossRef R. Hussain Laskar, K. Banerjee, F. Ahmed Talukdar, K. Sreenivasa Rao, A pitch synchronous approach to design voice conversion system using source-filter correlation. Int. J. Speech Technol. (Springer) 15(3), 419–431 (2012)CrossRef
Metadaten
Titel
Language Identification—A Brief Review
verfasst von
K. Sreenivasa Rao
Dipanjan Nandi
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-17725-0_2

Neuer Inhalt