Skip to main content

2018 | OriginalPaper | Buchkapitel

Speech Recognition System Using Open-Source Speech Engine for Indian Names

verfasst von : Nitin Arun Kallole, R. Prakash

Erschienen in: Intelligent Embedded Systems

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Speaker independence, continuous speech and huge vocabularies create most of the greatest challenges in automatic speech recognition. This paper describes Sphinx, a library that offers the feasibility of accurate, huge vocabulary, speaker-independent, continuous speech recognition. Using speech for device control is a proven hands-free solution. There are several products that use speech input for hands-free control. They usually cater the users with US/UK accent. In this paper, a speech recognition system is developed for the application of hands-free control system to be deployed in automotive environment for Indian users. This paper demonstrates the methodology and the challenges of customizing an open-source speech recognition engine for Indian users. It is demonstrated for the application of speech-based control of smartphone, and rear-view mirror rotation. Open-source package used is Pocketsphinx for speech recognition and festival for text-to-speech and pronunciation generation. All the implementations are done on a single-board computer, i.e. raspberry pi.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Wilpon JG, Rabiner LR, Bergh A (1982) Speaker-independent isolated word recognition using a 129-word airline vocabulary. J Acoust Soc Atnrr 72(2):390–396CrossRef Wilpon JG, Rabiner LR, Bergh A (1982) Speaker-independent isolated word recognition using a 129-word airline vocabulary. J Acoust Soc Atnrr 72(2):390–396CrossRef
2.
Zurück zum Zitat Jelinek F et al (1985) A real-time, isolated-word, speech recognition system for dictation transcription. In: Proceeding of IEEE international conference acoustics speech, signal processing, Mar 1985 Jelinek F et al (1985) A real-time, isolated-word, speech recognition system for dictation transcription. In: Proceeding of IEEE international conference acoustics speech, signal processing, Mar 1985
3.
Zurück zum Zitat Rabiner LR, Wilpon JG, Soong FK (1988) High performance connected digit recognition using hidden Markov models. In: Presented at the IEEE international conference acoustics speech, signal processing, Apr 1988 Rabiner LR, Wilpon JG, Soong FK (1988) High performance connected digit recognition using hidden Markov models. In: Presented at the IEEE international conference acoustics speech, signal processing, Apr 1988
4.
Zurück zum Zitat Cole RA, Stem RM, Phillips MS, Brill SM, Specker P, Pilant AP (1983) Feature-based speaker independent recognition of English letters. In: Presented at the IEEE international conference acoustics speech, signal processing, Oct 1983 Cole RA, Stem RM, Phillips MS, Brill SM, Specker P, Pilant AP (1983) Feature-based speaker independent recognition of English letters. In: Presented at the IEEE international conference acoustics speech, signal processing, Oct 1983
5.
Zurück zum Zitat Ravishankar MK (2005) Efficient algorithms for speech recognition. Ph.D. thesis, Citeseer Ravishankar MK (2005) Efficient algorithms for speech recognition. Ph.D. thesis, Citeseer
6.
Zurück zum Zitat Shim B-K, Kang K-W, Lee W-S (2010) An intelligent control of mobile robot based on voice commands. Proc lEEE 98(8):1107–IIIO Shim B-K, Kang K-W, Lee W-S (2010) An intelligent control of mobile robot based on voice commands. Proc lEEE 98(8):1107–IIIO
7.
Zurück zum Zitat Huggins-Daines D, Kumar M, Chan A, Black AW, Ravishankar M, Rudnicky AI (2006) IEEE international conference on PocketSphinx: a free, real-time, vol 1. I. IEEE, p I Huggins-Daines D, Kumar M, Chan A, Black AW, Ravishankar M, Rudnicky AI (2006) IEEE international conference on PocketSphinx: a free, real-time, vol 1. I. IEEE, p I
8.
Zurück zum Zitat Lee K-F, Hon H-W, Reddy R (1990) IEEE “an overview of the SPHINX speech recognition system”, vol 38, no I, January 1990 Lee K-F, Hon H-W, Reddy R (1990) IEEE “an overview of the SPHINX speech recognition system”, vol 38, no I, January 1990
9.
Zurück zum Zitat Kumar A, Tewari A, Horrigan S, Kam M, Metze F, Canny J (2011) Rethinking speech recognition on mobile devices Kumar A, Tewari A, Horrigan S, Kam M, Metze F, Canny J (2011) Rethinking speech recognition on mobile devices
Metadaten
Titel
Speech Recognition System Using Open-Source Speech Engine for Indian Names
verfasst von
Nitin Arun Kallole
R. Prakash
Copyright-Jahr
2018
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-8575-8_26

Neuer Inhalt