Skip to main content
Top

2017 | OriginalPaper | Chapter

Hindi Dialect (Bangro) Spoken Language Recognition (HD-SLR) System Using Sphinx3

Authors : Virender Kadyan, Amitoj Singh, Parth Wadhwa

Published in: Proceeding of International Conference on Intelligent Communication, Control and Devices

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

A Hindi dialect (Bangro) Spoken Language Recognition (HD-SLR) System is designed to recognize language from a given spoken utterance. Paper focuses on the influence of Hindi dialects, i.e., Haryanvi spoken by males and females of different age groups ranging from 18 to 40 years. The system is trained and tested with the help of Sphinx3 toolkit on Linux platform. Also, it has been tried with semicontinuous speech corpus in clean environment of around 5 h that includes 1000 distinct Hindi dialect words spoken in different parts of Haryana. The dialectal information of the input speech signals is extracted with the help of MFCC technique and the same system is then tested on the basis of utterance level. The Speaker Independent Semicontinuous (SISC) word recognition system has an average of 75–85 % accuracy rate by native and nonnative speakers of Hindi dialect.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Huggins-Daines, D., Kumar, M., Chan, A., Black, A. W., Ravishankar, M., and Rudnicky, A. “Pocketsphinx: A free, real-time continuous speech recognition system for hand-held devices.” IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Vol. 1. IEEE, (2006) 1–1. Huggins-Daines, D., Kumar, M., Chan, A., Black, A. W., Ravishankar, M., and Rudnicky, A. “Pocketsphinx: A free, real-time continuous speech recognition system for hand-held devices.” IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Vol. 1. IEEE, (2006) 1–1.
2.
go back to reference Rao, K. S., and Koolagudi, S. G. “Identification of Hindi dialects and emotions using spectral and prosodic features of speech”. IJSCI: International Journal of Systemics, Cybernetics and Informatics 9.4 (2011) 24–33. Rao, K. S., and Koolagudi, S. G. “Identification of Hindi dialects and emotions using spectral and prosodic features of speech”. IJSCI: International Journal of Systemics, Cybernetics and Informatics 9.4 (2011) 24–33.
3.
go back to reference Sinha, S., Jain, A., and Agrawal, S. S. “Speech processing for Hindi dialect recognition.” Advances in Signal Processing and Intelligent Recognition Systems. Springer International Publishing, (2014) 161–169. Sinha, S., Jain, A., and Agrawal, S. S. “Speech processing for Hindi dialect recognition.” Advances in Signal Processing and Intelligent Recognition Systems. Springer International Publishing, (2014) 161–169.
4.
go back to reference Huang, R., Hansen, J. H., n Angkititrakul, P. “Dialect/accent classification using unrestricted audio.” Audio, Speech, IEEE Transactions and Language Processing, 15(2) (2007) 453–464. Huang, R., Hansen, J. H., n Angkititrakul, P. “Dialect/accent classification using unrestricted audio.” Audio, Speech, IEEE Transactions and Language Processing, 15(2) (2007) 453–464.
5.
go back to reference Liu, M., Xu, B., Hunng, T., Deng, Y., and Li, C. “Mandarin accent adaptation based on context-independent/context-dependent pronunciation modeling.” IEEE International Conference, Acoustics, Speech, and Signal Processing. ICASSP. Proceedings Vol. 2. IEEE, (2000) 1025–1028. Liu, M., Xu, B., Hunng, T., Deng, Y., and Li, C. “Mandarin accent adaptation based on context-independent/context-dependent pronunciation modeling.” IEEE International Conference, Acoustics, Speech, and Signal Processing. ICASSP. Proceedings Vol. 2. IEEE, (2000) 1025–1028.
6.
go back to reference Rabiner, L. R. “A tutorial on hidden Markov models and selected applications in speech recognition.” Proceedings of the IEEE 77.2 (1989) 257–286. Rabiner, L. R. “A tutorial on hidden Markov models and selected applications in speech recognition.” Proceedings of the IEEE 77.2 (1989) 257–286.
7.
go back to reference Aggarwal, R. K., and Dave, M. “Acoustic modeling problem for automatic speech recognition system: conventional methods (Part I)”. International Journal of Speech Technology, 14(4), (2011) 297–308. Aggarwal, R. K., and Dave, M. “Acoustic modeling problem for automatic speech recognition system: conventional methods (Part I)”. International Journal of Speech Technology, 14(4), (2011) 297–308.
Metadata
Title
Hindi Dialect (Bangro) Spoken Language Recognition (HD-SLR) System Using Sphinx3
Authors
Virender Kadyan
Amitoj Singh
Parth Wadhwa
Copyright Year
2017
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-1708-7_116

Premium Partner