Skip to main content
Erschienen in: International Journal of Speech Technology 2/2020

10.02.2020 | Manuscript

Identification of regional dialects of Telugu language using text independent speech processing models

verfasst von: S. Shivaprasad, M. Sadanandam

Erschienen in: International Journal of Speech Technology | Ausgabe 2/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Telugu language is one of the important languages in the world. The language that is spoken by most of the people in a region is called as dialect. In the recent days, speech recognition system is present in almost all electronic devices. In this, dialects of particular language perform a vital role. The accurate dialects identification technique helps in not only enhancing its features but also expected to provide in modern services in health and telemedicine for older and homebound peoples. Like any other language, even Telugu language has diversified itself into different dialects viz., Telangana, Kostha Andhra, and Rayalaseema. Combination of all the dialects is the language TELUGU and it is a perfect blend of elegance in Sanskrit, sweetness in Tamil along with the essence of Kannada language. The formation of dialects can be of different reasons. For speech processing research, till today there is no standard speech database created for Telugu dialects. In this paper we developed a speech database that can be utilized for the recognition of Telugu dialects and we had applied two modeling techniques that are, Hidden Markov Model (HMM) and Gaussian mixture model (GMM) in order to recognize the dialects of Telugu language by using speech independant utterances. We imposed Mel-Frequency Cepstral Coefficient for extracting the spectral features from the obtained speech data and observed that GMM provides better accurate results than HMM.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Al-Walaie, M. A., & Khan, M. B. (2017). Arabic dialects classification using text mining techniques. In International conference on computer and applications (ICCA). Al-Walaie, M. A., & Khan, M. B. (2017). Arabic dialects classification using text mining techniques. In International conference on computer and applications (ICCA).
Zurück zum Zitat Bailey, C. N. (1968). Is there a midland dialect? Washington, D.C.: ERIC Clearinghouse. Bailey, C. N. (1968). Is there a midland dialect? Washington, D.C.: ERIC Clearinghouse.
Zurück zum Zitat Balleda, J., Murthy, H. A., & Nagarajan, T. (2000). Language identification from short segments of speech. In Proceedings of the INTERSPEECH (pp. 1033–1036). Balleda, J., Murthy, H. A., & Nagarajan, T. (2000). Language identification from short segments of speech. In Proceedings of the INTERSPEECH (pp. 1033–1036).
Zurück zum Zitat Chen, M., Wang, L., & Xu, C.-Z. (2017). A novel approach of system design for dialect speech interaction with NAO robot. In 18th international conference on advanced robotics (ICAR). Chen, M., Wang, L., & Xu, C.-Z. (2017). A novel approach of system design for dialect speech interaction with NAO robot. In 18th international conference on advanced robotics (ICAR).
Zurück zum Zitat Chittaragi, N. B., & Koolagudi, S. G. (2017). Acoustic features based word level dialect classification using SVM and ensemble methods. In IC3, Noida, 10–12 August 2017. Chittaragi, N. B., & Koolagudi, S. G. (2017). Acoustic features based word level dialect classification using SVM and ensemble methods. In IC3, Noida, 10–12 August 2017.
Zurück zum Zitat Grierson, G. A. (1886). Linguistic survey of India (LSI). In Seventh international oriental congress. Grierson, G. A. (1886). Linguistic survey of India (LSI). In Seventh international oriental congress.
Zurück zum Zitat Ibrahim, J., & Lestari, D. P. (2017). Classification and clustering to identify spoken dialects in Indonesian. In International conference on data and software engineering (ICoDSE). Ibrahim, J., & Lestari, D. P. (2017). Classification and clustering to identify spoken dialects in Indonesian. In International conference on data and software engineering (ICoDSE).
Zurück zum Zitat Ismail, T., & Deka, G.K. (2017). Identification of Kamrupi dialect and similar languages. In 4th International conference on signal processing and integrated networks, SPIN. Ismail, T., & Deka, G.K. (2017). Identification of Kamrupi dialect and similar languages. In 4th International conference on signal processing and integrated networks, SPIN.
Zurück zum Zitat Jothilakshmi, S., Ramalingam, V., & Palanivil, S. (2012). A hierarchical language identification system for Indian languages. Digital Signal Process,22(3), 544–553.MathSciNetCrossRef Jothilakshmi, S., Ramalingam, V., & Palanivil, S. (2012). A hierarchical language identification system for Indian languages. Digital Signal Process,22(3), 544–553.MathSciNetCrossRef
Zurück zum Zitat Khan, S., Ali, H., & Ullah, K. (2017). Pashto language dialect recognition using mel frequency cepstral coefficient and support vector machines. In International conference on innovations in electrical engineering and computational technologies (ICIEECT). Khan, S., Ali, H., & Ullah, K. (2017). Pashto language dialect recognition using mel frequency cepstral coefficient and support vector machines. In International conference on innovations in electrical engineering and computational technologies (ICIEECT).
Zurück zum Zitat Mahnoosh, M., & Hansen, J. H. L. (2015). Automatic analysis of dialect/language sets. International Journal of Speech Technology,18(3), 277–286.CrossRef Mahnoosh, M., & Hansen, J. H. L. (2015). Automatic analysis of dialect/language sets. International Journal of Speech Technology,18(3), 277–286.CrossRef
Zurück zum Zitat Manwani, N., Mitra, S. K., & Joshi, M. V. (2007). Spoken language identification for Indian languages using split and merge EMAlgorithm. In A. Ghosh, R. K. De, & S. K. Pal (Eds.), Pattern recognition and machine intelligence. Editions. PReMI 2007. Lecture notes in computer science (Vol. 4815, pp. 463–468). Manwani, N., Mitra, S. K., & Joshi, M. V. (2007). Spoken language identification for Indian languages using split and merge EMAlgorithm. In A. Ghosh, R. K. De, & S. K. Pal (Eds.), Pattern recognition and machine intelligence. Editions. PReMI 2007. Lecture notes in computer science (Vol. 4815, pp. 463–468).
Zurück zum Zitat Mengistu, A. D., & Melesew, D. (2017). Text independent Amharic language dialect recognition: A hybrid approach of VQ and GMM. International Journal of Signal Processing, Image Processing and Pattern Recognition, 10(1), 215–222.CrossRef Mengistu, A. D., & Melesew, D. (2017). Text independent Amharic language dialect recognition: A hybrid approach of VQ and GMM. International Journal of Signal Processing, Image Processing and Pattern Recognition, 10(1), 215–222.CrossRef
Zurück zum Zitat Mohanty, S. (2011). Phonotactic model for spoken language identification in Indian language perspective. International Journal of Computers and Applications,19(9), 18–24.CrossRef Mohanty, S. (2011). Phonotactic model for spoken language identification in Indian language perspective. International Journal of Computers and Applications,19(9), 18–24.CrossRef
Zurück zum Zitat Reddy, V. R., Maity, S., & Rao, K. S. (2013). Identification of Indian languages using multi-level spectral and prosodic features. International Journal of Speech Technology,16(4), 489–511.CrossRef Reddy, V. R., Maity, S., & Rao, K. S. (2013). Identification of Indian languages using multi-level spectral and prosodic features. International Journal of Speech Technology,16(4), 489–511.CrossRef
Zurück zum Zitat Roy, P. (2010). Language recognition of three Indian languages based on clustering and supervised learning. In Proceedings of the international conference on computer applications—telecommunications (pp. 77–82). Roy, P. (2010). Language recognition of three Indian languages based on clustering and supervised learning. In Proceedings of the international conference on computer applications—telecommunications (pp. 77–82).
Zurück zum Zitat Sadanandam, M., & Kamakshi Prasad, V. (2013). Automatic text independent language identification using reduct set of feature vectors. In IEEE international conference on fuzzy systems (FUZZ-IEEE). Springer. Sadanandam, M., & Kamakshi Prasad, V. (2013). Automatic text independent language identification using reduct set of feature vectors. In IEEE international conference on fuzzy systems (FUZZ-IEEE). Springer.
Metadaten
Titel
Identification of regional dialects of Telugu language using text independent speech processing models
verfasst von
S. Shivaprasad
M. Sadanandam
Publikationsdatum
10.02.2020
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 2/2020
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-020-09678-y

Weitere Artikel der Ausgabe 2/2020

International Journal of Speech Technology 2/2020 Zur Ausgabe

Neuer Inhalt