Skip to main content
Top

2021 | OriginalPaper | Chapter

Speech Recognition Using Neural Network for Mobile Robot Navigation

Authors : Prashant Patel, Arockia Selvakumar Arockia Doss, L. PavanKalyan, Parag J. Tarwadi

Published in: Trends in Mechanical and Biomedical Design

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Automatic speech recognition (ASR) has gained a lot of popularity in the mobile robotics, where the commands could be provided to the robot wirelessly to maneuver. A navigation system combined with ASR is a complex system to carry out, because the system has difficulty in recognizing the voice commands when the environment involved already has disturbances like road noise, air conditioner, music, and passengers. The objective of this research is to operate a mobile robot with a single-arm manipulator, where the robot can perceive the speech and it can react to the individual speech commands provided by the operator swiftly and precisely. In order to recognize the speech, mel-frequency cepstral coefficient (MFCC) speech recognition algorithm is chosen and implemented in MATLAB. Various training and testing have been done in MFCC algorithm where it has to carry out the real-time processing of speech data and respond to it. Based on both the training and testing the voice commands collected from the five test subjects both male and female, the speech recognition system achieved 89% efficiency for the test database.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Campbell JP (1997) Speaker recognition: a tutorial. Proc IEEE 85(9):1437–1462CrossRef Campbell JP (1997) Speaker recognition: a tutorial. Proc IEEE 85(9):1437–1462CrossRef
2.
go back to reference Bort L, del Pobil AP (2000, October) Using speech to guide a mobile robot manipulator. In Smc 2000 Conference Proceedings. 2000 IEEE International Conference on Systems, Man and Cybernetics. ‘cybernetics evolving to systems, humans, organizations, and their complex interactions’ (cat. no. 0, vol 4, pp 2356–2361). IEEE Bort L, del Pobil AP (2000, October) Using speech to guide a mobile robot manipulator. In Smc 2000 Conference Proceedings. 2000 IEEE International Conference on Systems, Man and Cybernetics. ‘cybernetics evolving to systems, humans, organizations, and their complex interactions’ (cat. no. 0, vol 4, pp 2356–2361). IEEE
3.
go back to reference Neerja SD, Rupesh SM (2013) Robotic automation using speech recognition Neerja SD, Rupesh SM (2013) Robotic automation using speech recognition
4.
go back to reference Saxena A, Sinha AK, Chakrawarti S, Charu S (2013) Speech recognition using matlab. Suresh Gyan Vihar University, Jaipur, Rajasthan, India Saxena A, Sinha AK, Chakrawarti S, Charu S (2013) Speech recognition using matlab. Suresh Gyan Vihar University, Jaipur, Rajasthan, India
5.
go back to reference Kim NS, Un CK (1995) On estimating robust probability distribution in HMM-based speech recognition. IEEE Trans Speech Audio Process 3(4):279–285 Kim NS, Un CK (1995) On estimating robust probability distribution in HMM-based speech recognition. IEEE Trans Speech Audio Process 3(4):279–285
6.
go back to reference Chandra E, Sunitha C (2009, March) A review on speech and speaker authentication system using voice signal feature selection and extraction. In 2009 IEEE International Advance Computing Conference IACC 2009, pp 1341–1346 Chandra E, Sunitha C (2009, March) A review on speech and speaker authentication system using voice signal feature selection and extraction. In 2009 IEEE International Advance Computing Conference IACC 2009, pp 1341–1346
7.
go back to reference Harris FJ (1978) On the use of windows for harmonic analysis with the discrete Fourier transform. Proc IEEE 66(1):51–83 Harris FJ (1978) On the use of windows for harmonic analysis with the discrete Fourier transform. Proc IEEE 66(1):51–83
8.
go back to reference Zebulum RS, Vellasco M, Perelmuter G, Pacheco MA (1996) A comparison of different spectral analysis models for speech recognition using neural networks. IEEE Zebulum RS, Vellasco M, Perelmuter G, Pacheco MA (1996) A comparison of different spectral analysis models for speech recognition using neural networks. IEEE
9.
go back to reference Fatima N, Zheng TF (2012, May) Short utterance speaker recognition a research agenda. In 2012 International Conference on Systems and Informatics (ICSAI), pp 1746–1750 Fatima N, Zheng TF (2012, May) Short utterance speaker recognition a research agenda. In 2012 International Conference on Systems and Informatics (ICSAI), pp 1746–1750
10.
go back to reference Shukla A, Tiwari R (2008) A novel approach of speaker authentication by fusion of speech and image features using artificial neural networks. Int J Inf Commun Technol 1(2):159–170 Shukla A, Tiwari R (2008) A novel approach of speaker authentication by fusion of speech and image features using artificial neural networks. Int J Inf Commun Technol 1(2):159–170
11.
go back to reference Chakraborty P, Ahmed F, Kabir MM, Shahjahan M, Murase K (2008) An automatic speaker recognition system. In: Ishikawa M et al. (eds) ICONIP 2007, Part I, LNCS 4984, Springer-Verlag, Berlin Heidelberg, pp 517–526 Chakraborty P, Ahmed F, Kabir MM, Shahjahan M, Murase K (2008) An automatic speaker recognition system. In: Ishikawa M et al. (eds) ICONIP 2007, Part I, LNCS 4984, Springer-Verlag, Berlin Heidelberg, pp 517–526
12.
go back to reference Himanshu M, Kaur SS, Sharma A (2014) Real-time information system based on speech recognition Himanshu M, Kaur SS, Sharma A (2014) Real-time information system based on speech recognition
13.
go back to reference Muda L, Begam M, Elamvazuthi I (2010) Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques. arXiv preprint arXiv:1003.4083 Muda L, Begam M, Elamvazuthi I (2010) Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques. arXiv preprint arXiv:1003.4083
14.
go back to reference Gaikwad S, Gawali B, Yannawar P, Mehrotra S (2011, December) Feature extraction using fusion MFCC for continuous marathi speech recognition. In 2011 Annual IEEE India Conference, Hyderabad, pp 1–5 Gaikwad S, Gawali B, Yannawar P, Mehrotra S (2011, December) Feature extraction using fusion MFCC for continuous marathi speech recognition. In 2011 Annual IEEE India Conference, Hyderabad, pp 1–5
15.
go back to reference Razak Z, Ibrahim NJ, Tamil EM, Idris MYI (2012) Quarnic verse recitation feature extraction using mel-frequency cepstral coefficient (MFCC). Department of Al-Quran & Al- Hadith, Academy of Islamic Studies, University of Malaya Razak Z, Ibrahim NJ, Tamil EM, Idris MYI (2012) Quarnic verse recitation feature extraction using mel-frequency cepstral coefficient (MFCC). Department of Al-Quran & Al- Hadith, Academy of Islamic Studies, University of Malaya
16.
go back to reference Gaikwad SK, Gawali BW, Yannawar P (2010) A review on speech recognition technique. Int J Comput Appl 10(3):16–24 Gaikwad SK, Gawali BW, Yannawar P (2010) A review on speech recognition technique. Int J Comput Appl 10(3):16–24
17.
go back to reference Kabir A, Ahsan SMM (2007, December) Vector quantization in text dependent automatic speaker recognition using mel-frequency cepstrum coefficient. In 6th WSEAS International Conference on Circuits, Systems, Electronics, Control & Signal Processing, Cairo, Egypt, pp 352–355 Kabir A, Ahsan SMM (2007, December) Vector quantization in text dependent automatic speaker recognition using mel-frequency cepstrum coefficient. In 6th WSEAS International Conference on Circuits, Systems, Electronics, Control & Signal Processing, Cairo, Egypt, pp 352–355
Metadata
Title
Speech Recognition Using Neural Network for Mobile Robot Navigation
Authors
Prashant Patel
Arockia Selvakumar Arockia Doss
L. PavanKalyan
Parag J. Tarwadi
Copyright Year
2021
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-15-4488-0_56

Premium Partners