nach oben

International Journal of Speech Technology

Erschienen in:

10.02.2020 | Manuscript

Identification of regional dialects of Telugu language using text independent speech processing models

verfasst von: S. Shivaprasad, M. Sadanandam

Erschienen in: International Journal of Speech Technology | Ausgabe 2/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Telugu language is one of the important languages in the world. The language that is spoken by most of the people in a region is called as dialect. In the recent days, speech recognition system is present in almost all electronic devices. In this, dialects of particular language perform a vital role. The accurate dialects identification technique helps in not only enhancing its features but also expected to provide in modern services in health and telemedicine for older and homebound peoples. Like any other language, even Telugu language has diversified itself into different dialects viz., Telangana, Kostha Andhra, and Rayalaseema. Combination of all the dialects is the language TELUGU and it is a perfect blend of elegance in Sanskrit, sweetness in Tamil along with the essence of Kannada language. The formation of dialects can be of different reasons. For speech processing research, till today there is no standard speech database created for Telugu dialects. In this paper we developed a speech database that can be utilized for the recognition of Telugu dialects and we had applied two modeling techniques that are, Hidden Markov Model (HMM) and Gaussian mixture model (GMM) in order to recognize the dialects of Telugu language by using speech independant utterances. We imposed Mel-Frequency Cepstral Coefficient for extracting the spectral features from the obtained speech data and observed that GMM provides better accurate results than HMM.

Vorheriger Artikel New and robust composite micro structure descriptor (CMSD) for CBIR

Nächster Artikel ASIC implementation of distributed arithmetic based FIR filter using RNS for high speed DSP systems

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Al-Walaie, M. A., & Khan, M. B. (2017). Arabic dialects classification using text mining techniques. In International conference on computer and applications (ICCA).

Bailey, C. N. (1968). Is there a midland dialect? Washington, D.C.: ERIC Clearinghouse.

Balleda, J., Murthy, H. A., & Nagarajan, T. (2000). Language identification from short segments of speech. In Proceedings of the INTERSPEECH (pp. 1033–1036).

Chen, M., Wang, L., & Xu, C.-Z. (2017). A novel approach of system design for dialect speech interaction with NAO robot. In 18th international conference on advanced robotics (ICAR).

Chittaragi, N. B., & Koolagudi, S. G. (2017). Acoustic features based word level dialect classification using SVM and ensemble methods. In IC3, Noida, 10–12 August 2017.

Grierson, G. A. (1886). Linguistic survey of India (LSI). In Seventh international oriental congress.

Ibrahim, J., & Lestari, D. P. (2017). Classification and clustering to identify spoken dialects in Indonesian. In International conference on data and software engineering (ICoDSE).

Ismail, T., & Singh, L. J. (2017). Dialect identification of Assamese language using spectral features. Indian Journal of Science and Technology,10(20), 1–7. https://doi.org/10.17485/ijst/2017/v10i20/115033.CrossRef

Ismail, T., & Deka, G.K. (2017). Identification of Kamrupi dialect and similar languages. In 4th International conference on signal processing and integrated networks, SPIN.

Jothilakshmi, S., Ramalingam, V., & Palanivil, S. (2012). A hierarchical language identification system for Indian languages. Digital Signal Process,22(3), 544–553.MathSciNetCrossRef

Khan, S., Ali, H., & Ullah, K. (2017). Pashto language dialect recognition using mel frequency cepstral coefficient and support vector machines. In International conference on innovations in electrical engineering and computational technologies (ICIEECT).

Mahnoosh, M., & Hansen, J. H. L. (2015). Automatic analysis of dialect/language sets. International Journal of Speech Technology,18(3), 277–286.CrossRef

Manwani, N., Mitra, S. K., & Joshi, M. V. (2007). Spoken language identification for Indian languages using split and merge EMAlgorithm. In A. Ghosh, R. K. De, & S. K. Pal (Eds.), Pattern recognition and machine intelligence. Editions. PReMI 2007. Lecture notes in computer science (Vol. 4815, pp. 463–468).

Mengistu, A. D., & Melesew, D. (2017). Text independent Amharic language dialect recognition: A hybrid approach of VQ and GMM. International Journal of Signal Processing, Image Processing and Pattern Recognition, 10(1), 215–222.CrossRef

Mohanty, S. (2011). Phonotactic model for spoken language identification in Indian language perspective. International Journal of Computers and Applications,19(9), 18–24.CrossRef

Reddy, V. R., Maity, S., & Rao, K. S. (2013). Identification of Indian languages using multi-level spectral and prosodic features. International Journal of Speech Technology,16(4), 489–511.CrossRef

Roy, P. (2010). Language recognition of three Indian languages based on clustering and supervised learning. In Proceedings of the international conference on computer applications—telecommunications (pp. 77–82).

Sadanandam, M., & Kamakshi Prasad, V. (2013). Automatic text independent language identification using reduct set of feature vectors. In IEEE international conference on fuzzy systems (FUZZ-IEEE). Springer.

Titel: Identification of regional dialects of Telugu language using text independent speech processing models
verfasst von: S. Shivaprasad
M. Sadanandam
Publikationsdatum: 10.02.2020
Verlag: Springer US
Erschienen in: International Journal of Speech Technology / Ausgabe 2/2020
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI: https://doi.org/10.1007/s10772-020-09678-y

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Internationaler Motorenkongress/© [M] ATZlive | Chisnikov / Fotolia.com, Search Icon, Banner Hanser, Benedikt Bonnmann von Adesso/© Adesso, Teilzeit/© Fokussiert / stock.adobe.com, Hans-Joachim Lefeld/© Lucht Probst Associates GmbH, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, 2023_Antrieb/© supervisuell, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade, chassis.tech plus 2023/© [M] ATZlive / TÜV SÜD PRODUCT SERVICE GMBH

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 2/2020

Bi-directional LSTM–CNN Combined method for Sentiment Analysis in Part of Speech Tagging (PoS)

ASIC implementation of distributed arithmetic based FIR filter using RNS for high speed DSP systems

Synthesis of phased array antenna for side lobe level reduction using the differential evolution algorithm

A genetic model for acoustic and phonetic decoding of standard arabic vowels in continuous speech

A probabilistic stochastic model for analysis on the epileptic syndrome using speech synthesis and state space representation

Audio compression with multi-algorithm fusion and its impact in speech emotion recognition

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.