Skip to main content

2024 | OriginalPaper | Buchkapitel

Voice Enabled Form Filling Using Hidden Markov Model

verfasst von : Babu Sallagundla, Bharath Naik Kethavath, Shaik Arshad Hussain Mitaigiri, Siddartha Kata, Kodandaram Sri Satya Sai Merla

Erschienen in: Advanced Computing

Verlag: Springer Nature Switzerland

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Speech Recognition technology is widely used for voice-enabled form filling. The manual process of filling out forms by typing has become increasingly challenging and time-consuming. This issue is particularly evident in various locations such as job applications and internships. To address this problem, a solution is proposed as a system that automates the form-filling process using speech recognition technology. The ability to operate anything with voice command is a crucial factor in today’s environment. The proposed system is that it automatically fills out the forms. i.e., the system analyses the user’s unique voice, identifies the user’s speech, and then transcribes the speech into text. This paper proposes a machine-learning model that builds on Hidden Markov Model. The model will be trained and tested on this system and the proposed pre-processed methodology is Mel Frequency Cepstral Coefficients. The methodology was widely used in the prospect of recognition of voice automatically. The results demonstrate that this system effectively accurately transcribes user speech into text, simplifying the form-filling process significantly. By providing these results, we hope to demonstrate how this technology has the potential to revolutionize data entry and accessibility while also establishing a strong case for speech recognition as a convenient way to speed up form completion.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Syiem, B., Dutta, S.K., Binong, J., Singh, L.J.: Comparison of Khasi speech representations with different spectral features and hidden Markov states. J. Electron. Sci. Technol. 19(2), 100079 (2021)CrossRef Syiem, B., Dutta, S.K., Binong, J., Singh, L.J.: Comparison of Khasi speech representations with different spectral features and hidden Markov states. J. Electron. Sci. Technol. 19(2), 100079 (2021)CrossRef
2.
Zurück zum Zitat Cui, X., Afify, M., Gao, Y., Zhou, B.: Stereo hidden Markov modeling for noise robust speech recognition. Comput. Speech Lang. 27(2), 407–419 (2013)CrossRef Cui, X., Afify, M., Gao, Y., Zhou, B.: Stereo hidden Markov modeling for noise robust speech recognition. Comput. Speech Lang. 27(2), 407–419 (2013)CrossRef
3.
Zurück zum Zitat Najkar, N., Razzazi, F., Sameti, H.: A novel approach to HMM-based speech recognition systems using particle swarm optimization. Math. Comput. Model. 52(11–12), 1910–1920 (2010)CrossRef Najkar, N., Razzazi, F., Sameti, H.: A novel approach to HMM-based speech recognition systems using particle swarm optimization. Math. Comput. Model. 52(11–12), 1910–1920 (2010)CrossRef
4.
Zurück zum Zitat Siddiqi, M.H.: An improved Gaussian mixture hidden conditional random fields model for audio-based emotions classification. Egypt. Inform. J. 22(1), 45–51 (2021)CrossRef Siddiqi, M.H.: An improved Gaussian mixture hidden conditional random fields model for audio-based emotions classification. Egypt. Inform. J. 22(1), 45–51 (2021)CrossRef
5.
Zurück zum Zitat Al-Anzi, F.S., AbuZeina, D.: Synopsis on Arabic speech recognition. Ain Shams Eng. J. 13(2), 101534 (2022)CrossRef Al-Anzi, F.S., AbuZeina, D.: Synopsis on Arabic speech recognition. Ain Shams Eng. J. 13(2), 101534 (2022)CrossRef
6.
Zurück zum Zitat Gámiz, M.L., Limnios, N., del Carmen Segovia-García, M.: Hidden Markov models in reliability and maintenance. Eur. J. Oper. Res. 304(3), 1242–1255 (2023)MathSciNetCrossRef Gámiz, M.L., Limnios, N., del Carmen Segovia-García, M.: Hidden Markov models in reliability and maintenance. Eur. J. Oper. Res. 304(3), 1242–1255 (2023)MathSciNetCrossRef
7.
Zurück zum Zitat Champion, C., Houghton, S.M.: Application of continuous state hidden Markov models to a classical problem in speech recognition. Comput. Speech Lang. 36, 347–364 (2016)CrossRef Champion, C., Houghton, S.M.: Application of continuous state hidden Markov models to a classical problem in speech recognition. Comput. Speech Lang. 36, 347–364 (2016)CrossRef
8.
Zurück zum Zitat Mouaz, B., Abderrahim, B.H., Abdelmajid, E.: Speech recognition of Moroccan dialect using hidden Markov models. Procedia Comput. Sci. 151, 985–991 (2019)CrossRef Mouaz, B., Abderrahim, B.H., Abdelmajid, E.: Speech recognition of Moroccan dialect using hidden Markov models. Procedia Comput. Sci. 151, 985–991 (2019)CrossRef
9.
Zurück zum Zitat Nedjah, N., Bonilla, A.D., de Macedo Mourelle, L.: Automatic speech recognition of Portuguese phonemes using neural networks ensemble. Expert Syst. Appl. 229, 120378 (2023)CrossRef Nedjah, N., Bonilla, A.D., de Macedo Mourelle, L.: Automatic speech recognition of Portuguese phonemes using neural networks ensemble. Expert Syst. Appl. 229, 120378 (2023)CrossRef
10.
Zurück zum Zitat Lee, L.M., Jean, F.R.: Adaptation of hidden Markov models for recognizing speech of reduced frame rate. IEEE Trans. Cybern. 43(6), 2114–2121 (2013)CrossRef Lee, L.M., Jean, F.R.: Adaptation of hidden Markov models for recognizing speech of reduced frame rate. IEEE Trans. Cybern. 43(6), 2114–2121 (2013)CrossRef
11.
Zurück zum Zitat Chen, Y., Zheng, H.: The application of HMM algorithm based music note feature recognition teaching in universities. Intell. Syst. Appl. 20, 200277 (2023) Chen, Y., Zheng, H.: The application of HMM algorithm based music note feature recognition teaching in universities. Intell. Syst. Appl. 20, 200277 (2023)
12.
Zurück zum Zitat Mannepalli, K., Sastry, P.N., Suman, M.: MFCC-GMM based accent recognition system for Telugu speech signals. Int. J. Speech Technol. 19, 87–93 (2016)CrossRef Mannepalli, K., Sastry, P.N., Suman, M.: MFCC-GMM based accent recognition system for Telugu speech signals. Int. J. Speech Technol. 19, 87–93 (2016)CrossRef
13.
Zurück zum Zitat Chandrakala, S.: Investigation of DNN-HMM and lattice free maximum mutual information approaches for impaired speech recognition. IEEE Access 9, 168840–168849 (2021)CrossRef Chandrakala, S.: Investigation of DNN-HMM and lattice free maximum mutual information approaches for impaired speech recognition. IEEE Access 9, 168840–168849 (2021)CrossRef
14.
Zurück zum Zitat Li, Q., Zhang, C., Woodland, P.C.: Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring. Speech Commun. 147, 12–21 (2023)CrossRef Li, Q., Zhang, C., Woodland, P.C.: Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring. Speech Commun. 147, 12–21 (2023)CrossRef
15.
Zurück zum Zitat Ma, Z., Zhang, J., Li, T., Yang, R., Wang, H.: A parameter transfer method for HMM-DNN heterogeneous model with the scarce mongolian data set. Procedia Comput. Sci. 187, 258–263 (2021)CrossRef Ma, Z., Zhang, J., Li, T., Yang, R., Wang, H.: A parameter transfer method for HMM-DNN heterogeneous model with the scarce mongolian data set. Procedia Comput. Sci. 187, 258–263 (2021)CrossRef
16.
Zurück zum Zitat Das, T.K., Nahar, K.M.: A voice identification system using hidden Markov model. Indian J. Sci. Technol. 9(4), 1–6 (2016)CrossRef Das, T.K., Nahar, K.M.: A voice identification system using hidden Markov model. Indian J. Sci. Technol. 9(4), 1–6 (2016)CrossRef
17.
Zurück zum Zitat Ranjan, A., Jegadeesan, K.: Hybrid ASR for resource-constrained robots: HMM-deep learning fusion. arXiv preprint arXiv:2309.07164 (2023) Ranjan, A., Jegadeesan, K.: Hybrid ASR for resource-constrained robots: HMM-deep learning fusion. arXiv preprint arXiv:​2309.​07164 (2023)
18.
Zurück zum Zitat Yadava, G.T., Nagaraja, B.G., Jayanna, H.S.: An end-to-end continuous Kannada ASR system under uncontrolled environment. Multimed. Tools Appl. 1–14 (2023) Yadava, G.T., Nagaraja, B.G., Jayanna, H.S.: An end-to-end continuous Kannada ASR system under uncontrolled environment. Multimed. Tools Appl. 1–14 (2023)
19.
Zurück zum Zitat Trabelsi, A., Warichet, S., Aajaoun, Y., Soussilane, S.: Evaluation of the efficiency of state-of-the-art Speech Recognition engines. Procedia Comput. Sci. 207, 2242–2252 (2022)CrossRef Trabelsi, A., Warichet, S., Aajaoun, Y., Soussilane, S.: Evaluation of the efficiency of state-of-the-art Speech Recognition engines. Procedia Comput. Sci. 207, 2242–2252 (2022)CrossRef
20.
Zurück zum Zitat Jaradat, G.A., Alzubaidi, M.A., Otoom, M.: A novel human-vehicle interaction assistive device for Arab drivers using speech recognition. IEEE Access 10, 127514–127529 (2022)CrossRef Jaradat, G.A., Alzubaidi, M.A., Otoom, M.: A novel human-vehicle interaction assistive device for Arab drivers using speech recognition. IEEE Access 10, 127514–127529 (2022)CrossRef
Metadaten
Titel
Voice Enabled Form Filling Using Hidden Markov Model
verfasst von
Babu Sallagundla
Bharath Naik Kethavath
Shaik Arshad Hussain Mitaigiri
Siddartha Kata
Kodandaram Sri Satya Sai Merla
Copyright-Jahr
2024
DOI
https://doi.org/10.1007/978-3-031-56700-1_18

Premium Partner