Skip to main content
Erschienen in: International Journal of Speech Technology 2/2017

09.04.2017

An automatic speech recognition system for spontaneous Punjabi speech corpus

verfasst von: Yogesh Kumar, Navdeep Singh

Erschienen in: International Journal of Speech Technology | Ausgabe 2/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Automatic speech recognition is the central part of the wheel towards the natural person-to-machine interaction technique. Due to the high disparity of speaking styles, speech recognition surely demands composite methods to constitute this irregularity. A speech recognition method can work in numerous distinct states such as speaker dependent/independent speech, isolated/continuous/spontaneous speech recognition, for less to very large vocabulary. The Punjabi language is being spoken by concerning 104 million peoples in India, Pakistan and other countries with Punjabi migrants. The Punjabi language is written in Gurmukhi writing in Indian Punjab, while in Shahmukhi writing in Pakistani Punjab. In the paper, the objective is to build the speaker independent automatic spontaneous speech recognition system for the Punjabi language. The system is also capable to recognize the spontaneous Punjabi live speech. So far, no work has to be achieved in the area of spontaneous speech recognition system for the Punjabi language. The user interfaces for Punjabi live speech system is created by using the java programming. Till now, automatic speech system is trained with 6012 Punjabi words and 1433 Punjabi sentences. The performance measured in terms of recognition accuracy which is 93.79% for Punjabi words and 90.8% for Punjabi sentences.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Alhawiti, K. M. (2015) Advances in artificial intelligence using speech recognition. International Journal of Computer, Electrical, Automation, Control and Information Engineering, 9(6), 1439–1442. Alhawiti, K. M. (2015) Advances in artificial intelligence using speech recognition. International Journal of Computer, Electrical, Automation, Control and Information Engineering, 9(6), 1439–1442.
Zurück zum Zitat Ankit, A., Mishra, S. K., Shaikh, R., Gupta, C. K., Mathur, P., Pawar, S. (2016) A survey paper on acoustic speech recognition techniques. International Journal of Recent Advances in Engineering and Technology, 7(7), 2347–2812. Ankit, A., Mishra, S. K., Shaikh, R., Gupta, C. K., Mathur, P., Pawar, S. (2016) A survey paper on acoustic speech recognition techniques. International Journal of Recent Advances in Engineering and Technology, 7(7), 2347–2812.
Zurück zum Zitat Bhardwaj, B., Kumar, D. (2016) Free Model speech recognition system using MFCC model. International Journal of Innovative Research in Computer and Communication Engineering, 4(5), 10065–10073. Bhardwaj, B., Kumar, D. (2016) Free Model speech recognition system using MFCC model. International Journal of Innovative Research in Computer and Communication Engineering, 4(5), 10065–10073.
Zurück zum Zitat Hoesen, D., Satriawan, C. H., Lestari, D. P., Khodra, M. L. (2016): Towards robust Indonesian speech recognition with spontaneous-speech adapted acoustic models. Procedia Computer Science 81, 167–173.CrossRef Hoesen, D., Satriawan, C. H., Lestari, D. P., Khodra, M. L. (2016): Towards robust Indonesian speech recognition with spontaneous-speech adapted acoustic models. Procedia Computer Science 81, 167–173.CrossRef
Zurück zum Zitat Kumar, Y., Singh, N. (2015) A first step towards an automatic spontaneous speech recognition system for Punjabi language. International Journal of Statistics and Reliability Engineering, 2, 81–93. Kumar, Y., Singh, N. (2015) A first step towards an automatic spontaneous speech recognition system for Punjabi language. International Journal of Statistics and Reliability Engineering, 2, 81–93.
Zurück zum Zitat Kumar, Y., Singh, N. (2016) Automatic spontaneous speech recognition for Punjabi language interview speech corpus. I.J. Education and Management Engineering, 6, 64–73. Kumar, Y., Singh, N. (2016) Automatic spontaneous speech recognition for Punjabi language interview speech corpus. I.J. Education and Management Engineering, 6, 64–73.
Zurück zum Zitat Kumar, Y., Singh, N. (2016) An automatic spontaneous live speech recognition system for Punjabi language corpus. IJCTA, 9(20), 259–266. Kumar, Y., Singh, N. (2016) An automatic spontaneous live speech recognition system for Punjabi language corpus. IJCTA, 9(20), 259–266.
Zurück zum Zitat Muda, L., Begam, M., Elamvazuthi, I. (2010) Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping techniques. Journal of Computing, 2(3),138–143. Muda, L., Begam, M., Elamvazuthi, I. (2010) Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping techniques. Journal of Computing, 2(3),138–143.
Zurück zum Zitat Narayanan, A., Wang, D. (2015) Improving robustness of deep neural network acoustic models via speech separation and joint adaptive training. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(1), 92–101. Narayanan, A., Wang, D. (2015) Improving robustness of deep neural network acoustic models via speech separation and joint adaptive training. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(1), 92–101.
Zurück zum Zitat Patadia, J., Reshamwala, A. (2016) Feature extraction approach in emotional speech recognition system. International Journal of Advanced Research in Computer Science and Software Engineering, 6(5), 706–710. Patadia, J., Reshamwala, A. (2016) Feature extraction approach in emotional speech recognition system. International Journal of Advanced Research in Computer Science and Software Engineering, 6(5), 706–710.
Zurück zum Zitat Swamy, S., Ramakrishnan, K. V. (2013) An efficient speech recognition system. An International Journal, 3(4), 21–27. Swamy, S., Ramakrishnan, K. V. (2013) An efficient speech recognition system. An International Journal, 3(4), 21–27.
Metadaten
Titel
An automatic speech recognition system for spontaneous Punjabi speech corpus
verfasst von
Yogesh Kumar
Navdeep Singh
Publikationsdatum
09.04.2017
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 2/2017
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-017-9408-2

Weitere Artikel der Ausgabe 2/2017

International Journal of Speech Technology 2/2017 Zur Ausgabe

Neuer Inhalt