Skip to main content
Top

2024 | OriginalPaper | Chapter

Voice Enabled Form Filling Using Hidden Markov Model

Authors : Babu Sallagundla, Bharath Naik Kethavath, Shaik Arshad Hussain Mitaigiri, Siddartha Kata, Kodandaram Sri Satya Sai Merla

Published in: Advanced Computing

Publisher: Springer Nature Switzerland

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Speech Recognition technology is widely used for voice-enabled form filling. The manual process of filling out forms by typing has become increasingly challenging and time-consuming. This issue is particularly evident in various locations such as job applications and internships. To address this problem, a solution is proposed as a system that automates the form-filling process using speech recognition technology. The ability to operate anything with voice command is a crucial factor in today’s environment. The proposed system is that it automatically fills out the forms. i.e., the system analyses the user’s unique voice, identifies the user’s speech, and then transcribes the speech into text. This paper proposes a machine-learning model that builds on Hidden Markov Model. The model will be trained and tested on this system and the proposed pre-processed methodology is Mel Frequency Cepstral Coefficients. The methodology was widely used in the prospect of recognition of voice automatically. The results demonstrate that this system effectively accurately transcribes user speech into text, simplifying the form-filling process significantly. By providing these results, we hope to demonstrate how this technology has the potential to revolutionize data entry and accessibility while also establishing a strong case for speech recognition as a convenient way to speed up form completion.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Syiem, B., Dutta, S.K., Binong, J., Singh, L.J.: Comparison of Khasi speech representations with different spectral features and hidden Markov states. J. Electron. Sci. Technol. 19(2), 100079 (2021)CrossRef Syiem, B., Dutta, S.K., Binong, J., Singh, L.J.: Comparison of Khasi speech representations with different spectral features and hidden Markov states. J. Electron. Sci. Technol. 19(2), 100079 (2021)CrossRef
2.
go back to reference Cui, X., Afify, M., Gao, Y., Zhou, B.: Stereo hidden Markov modeling for noise robust speech recognition. Comput. Speech Lang. 27(2), 407–419 (2013)CrossRef Cui, X., Afify, M., Gao, Y., Zhou, B.: Stereo hidden Markov modeling for noise robust speech recognition. Comput. Speech Lang. 27(2), 407–419 (2013)CrossRef
3.
go back to reference Najkar, N., Razzazi, F., Sameti, H.: A novel approach to HMM-based speech recognition systems using particle swarm optimization. Math. Comput. Model. 52(11–12), 1910–1920 (2010)CrossRef Najkar, N., Razzazi, F., Sameti, H.: A novel approach to HMM-based speech recognition systems using particle swarm optimization. Math. Comput. Model. 52(11–12), 1910–1920 (2010)CrossRef
4.
go back to reference Siddiqi, M.H.: An improved Gaussian mixture hidden conditional random fields model for audio-based emotions classification. Egypt. Inform. J. 22(1), 45–51 (2021)CrossRef Siddiqi, M.H.: An improved Gaussian mixture hidden conditional random fields model for audio-based emotions classification. Egypt. Inform. J. 22(1), 45–51 (2021)CrossRef
5.
go back to reference Al-Anzi, F.S., AbuZeina, D.: Synopsis on Arabic speech recognition. Ain Shams Eng. J. 13(2), 101534 (2022)CrossRef Al-Anzi, F.S., AbuZeina, D.: Synopsis on Arabic speech recognition. Ain Shams Eng. J. 13(2), 101534 (2022)CrossRef
6.
go back to reference Gámiz, M.L., Limnios, N., del Carmen Segovia-García, M.: Hidden Markov models in reliability and maintenance. Eur. J. Oper. Res. 304(3), 1242–1255 (2023)MathSciNetCrossRef Gámiz, M.L., Limnios, N., del Carmen Segovia-García, M.: Hidden Markov models in reliability and maintenance. Eur. J. Oper. Res. 304(3), 1242–1255 (2023)MathSciNetCrossRef
7.
go back to reference Champion, C., Houghton, S.M.: Application of continuous state hidden Markov models to a classical problem in speech recognition. Comput. Speech Lang. 36, 347–364 (2016)CrossRef Champion, C., Houghton, S.M.: Application of continuous state hidden Markov models to a classical problem in speech recognition. Comput. Speech Lang. 36, 347–364 (2016)CrossRef
8.
go back to reference Mouaz, B., Abderrahim, B.H., Abdelmajid, E.: Speech recognition of Moroccan dialect using hidden Markov models. Procedia Comput. Sci. 151, 985–991 (2019)CrossRef Mouaz, B., Abderrahim, B.H., Abdelmajid, E.: Speech recognition of Moroccan dialect using hidden Markov models. Procedia Comput. Sci. 151, 985–991 (2019)CrossRef
9.
go back to reference Nedjah, N., Bonilla, A.D., de Macedo Mourelle, L.: Automatic speech recognition of Portuguese phonemes using neural networks ensemble. Expert Syst. Appl. 229, 120378 (2023)CrossRef Nedjah, N., Bonilla, A.D., de Macedo Mourelle, L.: Automatic speech recognition of Portuguese phonemes using neural networks ensemble. Expert Syst. Appl. 229, 120378 (2023)CrossRef
10.
go back to reference Lee, L.M., Jean, F.R.: Adaptation of hidden Markov models for recognizing speech of reduced frame rate. IEEE Trans. Cybern. 43(6), 2114–2121 (2013)CrossRef Lee, L.M., Jean, F.R.: Adaptation of hidden Markov models for recognizing speech of reduced frame rate. IEEE Trans. Cybern. 43(6), 2114–2121 (2013)CrossRef
11.
go back to reference Chen, Y., Zheng, H.: The application of HMM algorithm based music note feature recognition teaching in universities. Intell. Syst. Appl. 20, 200277 (2023) Chen, Y., Zheng, H.: The application of HMM algorithm based music note feature recognition teaching in universities. Intell. Syst. Appl. 20, 200277 (2023)
12.
go back to reference Mannepalli, K., Sastry, P.N., Suman, M.: MFCC-GMM based accent recognition system for Telugu speech signals. Int. J. Speech Technol. 19, 87–93 (2016)CrossRef Mannepalli, K., Sastry, P.N., Suman, M.: MFCC-GMM based accent recognition system for Telugu speech signals. Int. J. Speech Technol. 19, 87–93 (2016)CrossRef
13.
go back to reference Chandrakala, S.: Investigation of DNN-HMM and lattice free maximum mutual information approaches for impaired speech recognition. IEEE Access 9, 168840–168849 (2021)CrossRef Chandrakala, S.: Investigation of DNN-HMM and lattice free maximum mutual information approaches for impaired speech recognition. IEEE Access 9, 168840–168849 (2021)CrossRef
14.
go back to reference Li, Q., Zhang, C., Woodland, P.C.: Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring. Speech Commun. 147, 12–21 (2023)CrossRef Li, Q., Zhang, C., Woodland, P.C.: Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring. Speech Commun. 147, 12–21 (2023)CrossRef
15.
go back to reference Ma, Z., Zhang, J., Li, T., Yang, R., Wang, H.: A parameter transfer method for HMM-DNN heterogeneous model with the scarce mongolian data set. Procedia Comput. Sci. 187, 258–263 (2021)CrossRef Ma, Z., Zhang, J., Li, T., Yang, R., Wang, H.: A parameter transfer method for HMM-DNN heterogeneous model with the scarce mongolian data set. Procedia Comput. Sci. 187, 258–263 (2021)CrossRef
16.
go back to reference Das, T.K., Nahar, K.M.: A voice identification system using hidden Markov model. Indian J. Sci. Technol. 9(4), 1–6 (2016)CrossRef Das, T.K., Nahar, K.M.: A voice identification system using hidden Markov model. Indian J. Sci. Technol. 9(4), 1–6 (2016)CrossRef
17.
go back to reference Ranjan, A., Jegadeesan, K.: Hybrid ASR for resource-constrained robots: HMM-deep learning fusion. arXiv preprint arXiv:2309.07164 (2023) Ranjan, A., Jegadeesan, K.: Hybrid ASR for resource-constrained robots: HMM-deep learning fusion. arXiv preprint arXiv:​2309.​07164 (2023)
18.
go back to reference Yadava, G.T., Nagaraja, B.G., Jayanna, H.S.: An end-to-end continuous Kannada ASR system under uncontrolled environment. Multimed. Tools Appl. 1–14 (2023) Yadava, G.T., Nagaraja, B.G., Jayanna, H.S.: An end-to-end continuous Kannada ASR system under uncontrolled environment. Multimed. Tools Appl. 1–14 (2023)
19.
go back to reference Trabelsi, A., Warichet, S., Aajaoun, Y., Soussilane, S.: Evaluation of the efficiency of state-of-the-art Speech Recognition engines. Procedia Comput. Sci. 207, 2242–2252 (2022)CrossRef Trabelsi, A., Warichet, S., Aajaoun, Y., Soussilane, S.: Evaluation of the efficiency of state-of-the-art Speech Recognition engines. Procedia Comput. Sci. 207, 2242–2252 (2022)CrossRef
20.
go back to reference Jaradat, G.A., Alzubaidi, M.A., Otoom, M.: A novel human-vehicle interaction assistive device for Arab drivers using speech recognition. IEEE Access 10, 127514–127529 (2022)CrossRef Jaradat, G.A., Alzubaidi, M.A., Otoom, M.: A novel human-vehicle interaction assistive device for Arab drivers using speech recognition. IEEE Access 10, 127514–127529 (2022)CrossRef
Metadata
Title
Voice Enabled Form Filling Using Hidden Markov Model
Authors
Babu Sallagundla
Bharath Naik Kethavath
Shaik Arshad Hussain Mitaigiri
Siddartha Kata
Kodandaram Sri Satya Sai Merla
Copyright Year
2024
DOI
https://doi.org/10.1007/978-3-031-56700-1_18

Premium Partner