Skip to main content
Erschienen in: International Journal of Speech Technology 3/2022

30.04.2021

RETRACTED ARTICLE: Nonlinear acoustic noise cancellation based automatic speech recognition system (NANC-ASR) with convolutional neural networks

verfasst von: Rabie A. Ramadan, Kusum Yadav

Erschienen in: International Journal of Speech Technology | Ausgabe 3/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Automatic Speech Recognition (ASR) is a self-governing, computer-based spoken language transcript for real-time applications. It is used in various real time applications and it listens the speech signals through a microphone, identifies the words, and assists a network in converting the written text. When we use the ASR system in multiple environments there is a possibility of ambient noise captured by a microphone unit and ASR system doesn’t predicting correct words. The Non-linear Acoustic Noise Cancellation (NANC) approach based automatic speech recognition method focused on the properties of non-linear sound noise cancellation. There are several distinct small segments in this approach, such as speech signal sounds, syllables, and so on. As an acrylic symbol associated with organs, these units analyze syllables to find acoustic properties of speech signals. This experimental study has adopted Convolutional Neural Network (CNN) based noise reduction in the speech recognition system with an accuracy of 98.5%. Finally, a speech signal has been identified through the ASR's vocabulary, which has been obtained with correct words after all phonetic signs are present.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Ambalika & Saini, E. S. (2016) Performance comparison of speech enhancement algorithms using different parameters. International Journal of Advanced Research in Electronics and Communication Engineering, 5(7), 21-31 Ambalika & Saini, E. S. (2016) Performance comparison of speech enhancement algorithms using different parameters. International Journal of Advanced Research in Electronics and Communication Engineering, 5(7), 21-31
Zurück zum Zitat Anusuya, A. M., & Katti, K. S. (2010). Speech recognition by machine, a review. International Journal of Computer Science and Information Security, 6(3), 321–335 Anusuya, A. M., & Katti, K. S. (2010). Speech recognition by machine, a review. International Journal of Computer Science and Information Security, 6(3), 321–335
Zurück zum Zitat Aranda-Uribe, O., Nakano-Miyatake, M., & Perez-Meana, H. (2006). Environmental sounds recognition. Telecommunications and Radio Engineering, 65(3), 271–279CrossRef Aranda-Uribe, O., Nakano-Miyatake, M., & Perez-Meana, H. (2006). Environmental sounds recognition. Telecommunications and Radio Engineering, 65(3), 271–279CrossRef
Zurück zum Zitat Avci, E. (2007). An automatic system for Turkish word recognition using Discrete Wavelet Neural Network based on adaptive entropy. Arabian Journal for Science and Engineering, 32(3), 239–250 Avci, E. (2007). An automatic system for Turkish word recognition using Discrete Wavelet Neural Network based on adaptive entropy. Arabian Journal for Science and Engineering, 32(3), 239–250
Zurück zum Zitat Boll, S. F. (2018). Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics on Speech and Signal Processing, 27(2), 113–120CrossRef Boll, S. F. (2018). Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics on Speech and Signal Processing, 27(2), 113–120CrossRef
Zurück zum Zitat Chattopadhyay, S., Pratihar, D., & Sarkar, S. (2011). A comparative study of fuzzy c-means algorithm and entropy-based fuzzy clustering algorithms. Computing and Informatics, 30(2), 701–720MATH Chattopadhyay, S., Pratihar, D., & Sarkar, S. (2011). A comparative study of fuzzy c-means algorithm and entropy-based fuzzy clustering algorithms. Computing and Informatics, 30(2), 701–720MATH
Zurück zum Zitat Chi, H. F., Gao, S. X., Soli, S. D., & Alwan, A. (2003). Band-limited feedback cancellation with a modified filtered-X LMS algorithm for hearing aids. Speech Communication, 39(1), 147–161CrossRefMATH Chi, H. F., Gao, S. X., Soli, S. D., & Alwan, A. (2003). Band-limited feedback cancellation with a modified filtered-X LMS algorithm for hearing aids. Speech Communication, 39(1), 147–161CrossRefMATH
Zurück zum Zitat Choudhary, A. (2013a). Automatic speech recognition system for isolated & connected words of Hindi language by using hidden Markov model toolkit. Association of Computer Electronics and Electrical Engineers, 2(4), 38–48 Choudhary, A. (2013a). Automatic speech recognition system for isolated & connected words of Hindi language by using hidden Markov model toolkit. Association of Computer Electronics and Electrical Engineers, 2(4), 38–48
Zurück zum Zitat Choudhary, A., Chauhan, M. R. & Gupta, M. G. (2013). Automatic speech recognition system for isolated and connected words of Hindi language by using hidden Markov Model Toolkit (HTK). In Proceedings of international conference on emerging trends in engineering and technology (pp 244–252) Choudhary, A., Chauhan, M. R. & Gupta, M. G. (2013). Automatic speech recognition system for isolated and connected words of Hindi language by using hidden Markov Model Toolkit (HTK). In Proceedings of international conference on emerging trends in engineering and technology (pp 244–252)
Zurück zum Zitat Chougule, A. (2014). Survey of noise estimation algorithms for speech enhancement using spectral subtraction. International Journal on Recent and Innovation Trends in Computing and Communication, 2(12), 345–358 Chougule, A. (2014). Survey of noise estimation algorithms for speech enhancement using spectral subtraction. International Journal on Recent and Innovation Trends in Computing and Communication, 2(12), 345–358
Zurück zum Zitat Cohen, I., & Baruch, B. (2009). Speech enhancement for non-stationary noise environments. Signal Processing, 81(2), 2403–2418MATH Cohen, I., & Baruch, B. (2009). Speech enhancement for non-stationary noise environments. Signal Processing, 81(2), 2403–2418MATH
Zurück zum Zitat Cohen, I., & Berdugo, B. (2002). Noise estimation by minima controlled recursive averaging for robust speech enhancement. Signal Processing Letters, 9(3), 12–15CrossRef Cohen, I., & Berdugo, B. (2002). Noise estimation by minima controlled recursive averaging for robust speech enhancement. Signal Processing Letters, 9(3), 12–15CrossRef
Zurück zum Zitat El-Emary, I. M., Fezari, M., & Attoui, H. (2011). Hidden Markov model/Gaussian mixture models (HMM/GMM) based voice command system: A way to improve the control of remotely operated robot arm TR45. Scientific Research and Essays, 6(2), 341–350 El-Emary, I. M., Fezari, M., & Attoui, H. (2011). Hidden Markov model/Gaussian mixture models (HMM/GMM) based voice command system: A way to improve the control of remotely operated robot arm TR45. Scientific Research and Essays, 6(2), 341–350
Zurück zum Zitat Elminir, H. K. (2012). Evaluation of different feature extraction techniques for continuous speech recognition. International Journal of Science and Technology, 2(2), 346–357 Elminir, H. K. (2012). Evaluation of different feature extraction techniques for continuous speech recognition. International Journal of Science and Technology, 2(2), 346–357
Zurück zum Zitat Nikita, B., & Prajakta, C. (2017). Speech enhancement based on spectral subtraction involving magnitude and phase components. International Research Journal of Engineering and Technology, 4(3), 2483–2488 Nikita, B., & Prajakta, C. (2017). Speech enhancement based on spectral subtraction involving magnitude and phase components. International Research Journal of Engineering and Technology, 4(3), 2483–2488
Zurück zum Zitat Venkateswarlu, S. C., Reddy, A. S. R., & Prasad, K. S. (2014). Speech enhancement using Boll’s spectral subtraction method based on Gaussian window. Global Journal of Researches in Engineering: Electrical and Electronics Engineering, 14(6), 201–212 Venkateswarlu, S. C., Reddy, A. S. R., & Prasad, K. S. (2014). Speech enhancement using Boll’s spectral subtraction method based on Gaussian window. Global Journal of Researches in Engineering: Electrical and Electronics Engineering, 14(6), 201–212
Metadaten
Titel
RETRACTED ARTICLE: Nonlinear acoustic noise cancellation based automatic speech recognition system (NANC-ASR) with convolutional neural networks
verfasst von
Rabie A. Ramadan
Kusum Yadav
Publikationsdatum
30.04.2021
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 3/2022
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-021-09848-6

Weitere Artikel der Ausgabe 3/2022

International Journal of Speech Technology 3/2022 Zur Ausgabe

Neuer Inhalt