nach oben

International Journal of Speech Technology

Erschienen in:

09.04.2017

An automatic speech recognition system for spontaneous Punjabi speech corpus

verfasst von: Yogesh Kumar, Navdeep Singh

Erschienen in: International Journal of Speech Technology | Ausgabe 2/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Automatic speech recognition is the central part of the wheel towards the natural person-to-machine interaction technique. Due to the high disparity of speaking styles, speech recognition surely demands composite methods to constitute this irregularity. A speech recognition method can work in numerous distinct states such as speaker dependent/independent speech, isolated/continuous/spontaneous speech recognition, for less to very large vocabulary. The Punjabi language is being spoken by concerning 104 million peoples in India, Pakistan and other countries with Punjabi migrants. The Punjabi language is written in Gurmukhi writing in Indian Punjab, while in Shahmukhi writing in Pakistani Punjab. In the paper, the objective is to build the speaker independent automatic spontaneous speech recognition system for the Punjabi language. The system is also capable to recognize the spontaneous Punjabi live speech. So far, no work has to be achieved in the area of spontaneous speech recognition system for the Punjabi language. The user interfaces for Punjabi live speech system is created by using the java programming. Till now, automatic speech system is trained with 6012 Punjabi words and 1433 Punjabi sentences. The performance measured in terms of recognition accuracy which is 93.79% for Punjabi words and 90.8% for Punjabi sentences.

Vorheriger Artikel Extraction of terms and semantic relationships from Arabic texts for automatic construction of an ontology

Nächster Artikel Discourse prosody planning in native (L1) and nonnative (L2) (L1-Bengali) English: a comparative study

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Alhawiti, K. M. (2015) Advances in artificial intelligence using speech recognition. International Journal of Computer, Electrical, Automation, Control and Information Engineering, 9(6), 1439–1442.

Ankit, A., Mishra, S. K., Shaikh, R., Gupta, C. K., Mathur, P., Pawar, S. (2016) A survey paper on acoustic speech recognition techniques. International Journal of Recent Advances in Engineering and Technology, 7(7), 2347–2812.

Bhardwaj, B., Kumar, D. (2016) Free Model speech recognition system using MFCC model. International Journal of Innovative Research in Computer and Communication Engineering, 4(5), 10065–10073.

Hoesen, D., Satriawan, C. H., Lestari, D. P., Khodra, M. L. (2016): Towards robust Indonesian speech recognition with spontaneous-speech adapted acoustic models. Procedia Computer Science 81, 167–173.CrossRef

Kumar, Y., Singh, N. (2015) A first step towards an automatic spontaneous speech recognition system for Punjabi language. International Journal of Statistics and Reliability Engineering, 2, 81–93.

Kumar, Y., Singh, N. (2016) Automatic spontaneous speech recognition for Punjabi language interview speech corpus. I.J. Education and Management Engineering, 6, 64–73.

Kumar, Y., Singh, N. (2016) An automatic spontaneous live speech recognition system for Punjabi language corpus. IJCTA, 9(20), 259–266.

Muda, L., Begam, M., Elamvazuthi, I. (2010) Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping techniques. Journal of Computing, 2(3),138–143.

Narayanan, A., Wang, D. (2015) Improving robustness of deep neural network acoustic models via speech separation and joint adaptive training. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(1), 92–101.

Patadia, J., Reshamwala, A. (2016) Feature extraction approach in emotional speech recognition system. International Journal of Advanced Research in Computer Science and Software Engineering, 6(5), 706–710.

Swamy, S., Ramakrishnan, K. V. (2013) An efficient speech recognition system. An International Journal, 3(4), 21–27.

Titel: An automatic speech recognition system for spontaneous Punjabi speech corpus
verfasst von: Yogesh Kumar
Navdeep Singh
Publikationsdatum: 09.04.2017
Verlag: Springer US
Erschienen in: International Journal of Speech Technology / Ausgabe 2/2017
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI: https://doi.org/10.1007/s10772-017-9408-2

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Interview Entropie Bild 1/© Bernhard Weßling, Joerg Schweinsberg/© Datacore Software, Smart Factory Symbolbild/© TensorSpark | Generated with AI | Getty Images, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Sustainibility Finance/© Robert Kneschke / stock.adobe.com / Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 2/2017

Semantic role labeling for Arabic language using case-based reasoning approach

Speech enhancement using MMSE estimation under phase uncertainty

Multi-pitch estimation based on multi-scale product analysis, improved comb filter and dynamic programming

Extraction of terms and semantic relationships from Arabic texts for automatic construction of an ontology

Supervector-based approaches in a discriminative framework for speaker verification in noisy environments

Discourse prosody planning in native (L1) and nonnative (L2) (L1-Bengali) English: a comparative study

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.