Skip to main content
Top
Published in: International Journal of Speech Technology 2/2020

10-02-2020 | Manuscript

Identification of regional dialects of Telugu language using text independent speech processing models

Authors: S. Shivaprasad, M. Sadanandam

Published in: International Journal of Speech Technology | Issue 2/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Telugu language is one of the important languages in the world. The language that is spoken by most of the people in a region is called as dialect. In the recent days, speech recognition system is present in almost all electronic devices. In this, dialects of particular language perform a vital role. The accurate dialects identification technique helps in not only enhancing its features but also expected to provide in modern services in health and telemedicine for older and homebound peoples. Like any other language, even Telugu language has diversified itself into different dialects viz., Telangana, Kostha Andhra, and Rayalaseema. Combination of all the dialects is the language TELUGU and it is a perfect blend of elegance in Sanskrit, sweetness in Tamil along with the essence of Kannada language. The formation of dialects can be of different reasons. For speech processing research, till today there is no standard speech database created for Telugu dialects. In this paper we developed a speech database that can be utilized for the recognition of Telugu dialects and we had applied two modeling techniques that are, Hidden Markov Model (HMM) and Gaussian mixture model (GMM) in order to recognize the dialects of Telugu language by using speech independant utterances. We imposed Mel-Frequency Cepstral Coefficient for extracting the spectral features from the obtained speech data and observed that GMM provides better accurate results than HMM.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Al-Walaie, M. A., & Khan, M. B. (2017). Arabic dialects classification using text mining techniques. In International conference on computer and applications (ICCA). Al-Walaie, M. A., & Khan, M. B. (2017). Arabic dialects classification using text mining techniques. In International conference on computer and applications (ICCA).
go back to reference Bailey, C. N. (1968). Is there a midland dialect? Washington, D.C.: ERIC Clearinghouse. Bailey, C. N. (1968). Is there a midland dialect? Washington, D.C.: ERIC Clearinghouse.
go back to reference Balleda, J., Murthy, H. A., & Nagarajan, T. (2000). Language identification from short segments of speech. In Proceedings of the INTERSPEECH (pp. 1033–1036). Balleda, J., Murthy, H. A., & Nagarajan, T. (2000). Language identification from short segments of speech. In Proceedings of the INTERSPEECH (pp. 1033–1036).
go back to reference Chen, M., Wang, L., & Xu, C.-Z. (2017). A novel approach of system design for dialect speech interaction with NAO robot. In 18th international conference on advanced robotics (ICAR). Chen, M., Wang, L., & Xu, C.-Z. (2017). A novel approach of system design for dialect speech interaction with NAO robot. In 18th international conference on advanced robotics (ICAR).
go back to reference Chittaragi, N. B., & Koolagudi, S. G. (2017). Acoustic features based word level dialect classification using SVM and ensemble methods. In IC3, Noida, 10–12 August 2017. Chittaragi, N. B., & Koolagudi, S. G. (2017). Acoustic features based word level dialect classification using SVM and ensemble methods. In IC3, Noida, 10–12 August 2017.
go back to reference Grierson, G. A. (1886). Linguistic survey of India (LSI). In Seventh international oriental congress. Grierson, G. A. (1886). Linguistic survey of India (LSI). In Seventh international oriental congress.
go back to reference Ibrahim, J., & Lestari, D. P. (2017). Classification and clustering to identify spoken dialects in Indonesian. In International conference on data and software engineering (ICoDSE). Ibrahim, J., & Lestari, D. P. (2017). Classification and clustering to identify spoken dialects in Indonesian. In International conference on data and software engineering (ICoDSE).
go back to reference Ismail, T., & Deka, G.K. (2017). Identification of Kamrupi dialect and similar languages. In 4th International conference on signal processing and integrated networks, SPIN. Ismail, T., & Deka, G.K. (2017). Identification of Kamrupi dialect and similar languages. In 4th International conference on signal processing and integrated networks, SPIN.
go back to reference Jothilakshmi, S., Ramalingam, V., & Palanivil, S. (2012). A hierarchical language identification system for Indian languages. Digital Signal Process,22(3), 544–553.MathSciNetCrossRef Jothilakshmi, S., Ramalingam, V., & Palanivil, S. (2012). A hierarchical language identification system for Indian languages. Digital Signal Process,22(3), 544–553.MathSciNetCrossRef
go back to reference Khan, S., Ali, H., & Ullah, K. (2017). Pashto language dialect recognition using mel frequency cepstral coefficient and support vector machines. In International conference on innovations in electrical engineering and computational technologies (ICIEECT). Khan, S., Ali, H., & Ullah, K. (2017). Pashto language dialect recognition using mel frequency cepstral coefficient and support vector machines. In International conference on innovations in electrical engineering and computational technologies (ICIEECT).
go back to reference Mahnoosh, M., & Hansen, J. H. L. (2015). Automatic analysis of dialect/language sets. International Journal of Speech Technology,18(3), 277–286.CrossRef Mahnoosh, M., & Hansen, J. H. L. (2015). Automatic analysis of dialect/language sets. International Journal of Speech Technology,18(3), 277–286.CrossRef
go back to reference Manwani, N., Mitra, S. K., & Joshi, M. V. (2007). Spoken language identification for Indian languages using split and merge EMAlgorithm. In A. Ghosh, R. K. De, & S. K. Pal (Eds.), Pattern recognition and machine intelligence. Editions. PReMI 2007. Lecture notes in computer science (Vol. 4815, pp. 463–468). Manwani, N., Mitra, S. K., & Joshi, M. V. (2007). Spoken language identification for Indian languages using split and merge EMAlgorithm. In A. Ghosh, R. K. De, & S. K. Pal (Eds.), Pattern recognition and machine intelligence. Editions. PReMI 2007. Lecture notes in computer science (Vol. 4815, pp. 463–468).
go back to reference Mengistu, A. D., & Melesew, D. (2017). Text independent Amharic language dialect recognition: A hybrid approach of VQ and GMM. International Journal of Signal Processing, Image Processing and Pattern Recognition, 10(1), 215–222.CrossRef Mengistu, A. D., & Melesew, D. (2017). Text independent Amharic language dialect recognition: A hybrid approach of VQ and GMM. International Journal of Signal Processing, Image Processing and Pattern Recognition, 10(1), 215–222.CrossRef
go back to reference Mohanty, S. (2011). Phonotactic model for spoken language identification in Indian language perspective. International Journal of Computers and Applications,19(9), 18–24.CrossRef Mohanty, S. (2011). Phonotactic model for spoken language identification in Indian language perspective. International Journal of Computers and Applications,19(9), 18–24.CrossRef
go back to reference Reddy, V. R., Maity, S., & Rao, K. S. (2013). Identification of Indian languages using multi-level spectral and prosodic features. International Journal of Speech Technology,16(4), 489–511.CrossRef Reddy, V. R., Maity, S., & Rao, K. S. (2013). Identification of Indian languages using multi-level spectral and prosodic features. International Journal of Speech Technology,16(4), 489–511.CrossRef
go back to reference Roy, P. (2010). Language recognition of three Indian languages based on clustering and supervised learning. In Proceedings of the international conference on computer applications—telecommunications (pp. 77–82). Roy, P. (2010). Language recognition of three Indian languages based on clustering and supervised learning. In Proceedings of the international conference on computer applications—telecommunications (pp. 77–82).
go back to reference Sadanandam, M., & Kamakshi Prasad, V. (2013). Automatic text independent language identification using reduct set of feature vectors. In IEEE international conference on fuzzy systems (FUZZ-IEEE). Springer. Sadanandam, M., & Kamakshi Prasad, V. (2013). Automatic text independent language identification using reduct set of feature vectors. In IEEE international conference on fuzzy systems (FUZZ-IEEE). Springer.
Metadata
Title
Identification of regional dialects of Telugu language using text independent speech processing models
Authors
S. Shivaprasad
M. Sadanandam
Publication date
10-02-2020
Publisher
Springer US
Published in
International Journal of Speech Technology / Issue 2/2020
Print ISSN: 1381-2416
Electronic ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-020-09678-y

Other articles of this Issue 2/2020

International Journal of Speech Technology 2/2020 Go to the issue