Skip to main content
Erschienen in: Wireless Personal Communications 2/2020

24.01.2020

A Wavelet Based Hybrid Threshold Transform Method for Speech Intelligibility and Quality in Noisy Speech Patterns of English Language

verfasst von: Harjeet Kaur Ojhla, Sharada Patil

Erschienen in: Wireless Personal Communications | Ausgabe 2/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The paper proposes a method to improve the performance of speech communication system in a highly noisy industrial environment. For the improvement, different speech signals are considered which includes signals from different environments such as car noise, railway station, babble noise, street noise which are corrupted with additional noise as input data set for processing. This database is processed using suitable filters which will remove the effect of noise to some extent. Different algorithms have been proposed to minimize the effect of noise to a certain limit. The denoising algorithms are generally the different wavelet thresholding method which removes the noise from the speech signal. Many researchers have worked on soft and hard thresholding for image processing. The proposed method of hybrid thresholding comprises of both soft and hard thresholding process which is comparatively better method than the previous methods. The method can be implemented for the non-stationary noise and it also removes the problems of edges. Unlike the traditional way of using single value, different values are used for the adaptive filtering to remove the edges. During the course of experiments, the dataset of IIIT-H with a set of noisy files from Noizeus and AURORA database having sampling rate of 16 kHz has been used. Results are calculated with subjective and objective measures for fine and broad level quality assessment. SNR, SSNR, PSNR, NRMSE, and PESQ parameters are used as performance parameters and outperform with other combinations as compared to conventional methods. The hybrid threshold method yields better results with significant improvement in speech quality and intelligibility.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Singh, S., Tripathy, M., & Anand, R. S. (2015). A wavelet based method for removal of highly non-stationary noises from single channel Hindi speech patterns of low input SNR. International Journal of Speech Technology,18(2), 157–166.CrossRef Singh, S., Tripathy, M., & Anand, R. S. (2015). A wavelet based method for removal of highly non-stationary noises from single channel Hindi speech patterns of low input SNR. International Journal of Speech Technology,18(2), 157–166.CrossRef
2.
Zurück zum Zitat Singh, S., Tripathy, M., & Anand, R. S. (2015). Binary mask based method for enhancement of mixed noise speech of low SNR input. International Journal of Speech Technology,18(4), 609–617.CrossRef Singh, S., Tripathy, M., & Anand, R. S. (2015). Binary mask based method for enhancement of mixed noise speech of low SNR input. International Journal of Speech Technology,18(4), 609–617.CrossRef
3.
Zurück zum Zitat Boll, S. F. (1979). Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech, and Signal Processing,27(2), 113–120.CrossRef Boll, S. F. (1979). Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech, and Signal Processing,27(2), 113–120.CrossRef
4.
Zurück zum Zitat Widrow, B., Glover, J. G. R., & Mccool, J. M. (1975). Adaptive noise cancelling: principles and applications. Proceedings of IEEE,63(12), 1692–1716.CrossRef Widrow, B., Glover, J. G. R., & Mccool, J. M. (1975). Adaptive noise cancelling: principles and applications. Proceedings of IEEE,63(12), 1692–1716.CrossRef
5.
Zurück zum Zitat Deller, J. R., Hansen, J. H. L., & Proakis, J. G. (2000). Discrete time processing of speech signals. 2nd, IEEE Press, New York. Deller, J. R., Hansen, J. H. L., & Proakis, J. G. (2000). Discrete time processing of speech signals. 2nd, IEEE Press, New York.
6.
Zurück zum Zitat Ephraim, Y., & Malah, D. (1984). Speech enhancement using a minimum mean square error short time spectral amplitude estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing,32(6), 1109–1121.CrossRef Ephraim, Y., & Malah, D. (1984). Speech enhancement using a minimum mean square error short time spectral amplitude estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing,32(6), 1109–1121.CrossRef
7.
Zurück zum Zitat Ephraim, Y., & Malah, D. (1985). Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing,23(2), 443–445.CrossRef Ephraim, Y., & Malah, D. (1985). Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing,23(2), 443–445.CrossRef
8.
Zurück zum Zitat Singh, S., Tripathy, M., & Anand, R. S. (2014). Single channel speech enhancement for mixed noise stationary noise environments. Advances in Signal processing and intelligent Recognition Systems,264, 545–555.CrossRef Singh, S., Tripathy, M., & Anand, R. S. (2014). Single channel speech enhancement for mixed noise stationary noise environments. Advances in Signal processing and intelligent Recognition Systems,264, 545–555.CrossRef
9.
Zurück zum Zitat Singh, S., Tripathy, M., & Anand, R. S. (2017). A wavelet packet based approach for speech enhancement using modulation selection. Wireless Personal Communications,95(4), 4441–4456.CrossRef Singh, S., Tripathy, M., & Anand, R. S. (2017). A wavelet packet based approach for speech enhancement using modulation selection. Wireless Personal Communications,95(4), 4441–4456.CrossRef
10.
Zurück zum Zitat Weiss, M., Aschkenasy, E., & Parsons, T. W. (1974). Study and the development of the INTEL technique for improving speech intelligibility. Technical Report NSC-FR/4023, Nicolet Scientific Corporation. Weiss, M., Aschkenasy, E., & Parsons, T. W. (1974). Study and the development of the INTEL technique for improving speech intelligibility. Technical Report NSC-FR/4023, Nicolet Scientific Corporation.
11.
Zurück zum Zitat Huang, Y. A., & Benesty, J. (2012). A multi-frame approach to the frequency domain single channel noise reduction problem. IEEE Transactions on Audio, Speech, and Language Processing,20(4), 1256–1269.CrossRef Huang, Y. A., & Benesty, J. (2012). A multi-frame approach to the frequency domain single channel noise reduction problem. IEEE Transactions on Audio, Speech, and Language Processing,20(4), 1256–1269.CrossRef
12.
Zurück zum Zitat Loizou, P. C. (2005). Speech enhancement based on perceptually motivated bayesian estimators of the magnitude spectrum. IEEE Transactions on Audio, Speech, and Language Processing,13(5), 857–869.CrossRef Loizou, P. C. (2005). Speech enhancement based on perceptually motivated bayesian estimators of the magnitude spectrum. IEEE Transactions on Audio, Speech, and Language Processing,13(5), 857–869.CrossRef
13.
Zurück zum Zitat Loizou, P. C., & Kim, G. (2011). Reasons why current speech enhancement algorithms do not improve speech intelligibility and suggest solutions. IEEE Transactions on Audio, Speech, and Language Processing,19(1), 47–56.CrossRef Loizou, P. C., & Kim, G. (2011). Reasons why current speech enhancement algorithms do not improve speech intelligibility and suggest solutions. IEEE Transactions on Audio, Speech, and Language Processing,19(1), 47–56.CrossRef
14.
Zurück zum Zitat Singh, S., & Mutawa, A. M. (2016). A wavelet based transform method for quality improvement in noisy speech patterns of Arabic Language. International Journal of Speech Technology, 18(2), 157–166.CrossRef Singh, S., & Mutawa, A. M. (2016). A wavelet based transform method for quality improvement in noisy speech patterns of Arabic Language. International Journal of Speech Technology, 18(2), 157–166.CrossRef
15.
Zurück zum Zitat Kaur, H., & Talwar, R. (2015). Overlapping frame approach to estimate and reduce noise from single channel speech. International Journal of Signal processing, Image Processing and Pattern Recognition,8(4), 49–58.CrossRef Kaur, H., & Talwar, R. (2015). Overlapping frame approach to estimate and reduce noise from single channel speech. International Journal of Signal processing, Image Processing and Pattern Recognition,8(4), 49–58.CrossRef
16.
Zurück zum Zitat Lollmann, H. W., & Vary, P. (2009). A blind speech enhancement algorithm for the suppression of late reverberation and noise. In IEEE international conference on acoustics, speech and signal processing (pp. 3989–3992). Lollmann, H. W., & Vary, P. (2009). A blind speech enhancement algorithm for the suppression of late reverberation and noise. In IEEE international conference on acoustics, speech and signal processing (pp. 3989–3992).
17.
Zurück zum Zitat Ruwei, L., Changchun, B., Bingyin, X., & Jaoshen, J. (2012). Speech enhancement using the combination of adaptive wavelet threshold and spectral subtraction based on wavelet packet decomposition. In International conference on signal processing ICSP (pp. 481–484. Ruwei, L., Changchun, B., Bingyin, X., & Jaoshen, J. (2012). Speech enhancement using the combination of adaptive wavelet threshold and spectral subtraction based on wavelet packet decomposition. In International conference on signal processing ICSP (pp. 481–484.
18.
Zurück zum Zitat Polikar, R. (1996). The wavelet tutorial. [Internet] [Cited 2017 March 30]. Polikar, R. (1996). The wavelet tutorial. [Internet] [Cited 2017 March 30].
19.
Zurück zum Zitat Donoho, D. L. (1995). De-noising by soft thresholding. IEEE Transactions on Information Theory,41(3), 933–936.MathSciNetCrossRef Donoho, D. L. (1995). De-noising by soft thresholding. IEEE Transactions on Information Theory,41(3), 933–936.MathSciNetCrossRef
20.
Zurück zum Zitat Hamid, M. E., Molla, M. K. I., Dang, X., & Nakai, T. (2013). Single channel speech enhancement using adaptive soft-thresholding with bivariate EMD. ISRN Signal Processing,2013(2013), 1–9.CrossRef Hamid, M. E., Molla, M. K. I., Dang, X., & Nakai, T. (2013). Single channel speech enhancement using adaptive soft-thresholding with bivariate EMD. ISRN Signal Processing,2013(2013), 1–9.CrossRef
21.
Zurück zum Zitat Aggarwal, R., Singh, J., Gupta, V., & Khare, A. (2011). Elimination of white noise from speech signal using wavelet transform by soft and hard thresholding. IJEECE,1(2), 62–71. Aggarwal, R., Singh, J., Gupta, V., & Khare, A. (2011). Elimination of white noise from speech signal using wavelet transform by soft and hard thresholding. IJEECE,1(2), 62–71.
Metadaten
Titel
A Wavelet Based Hybrid Threshold Transform Method for Speech Intelligibility and Quality in Noisy Speech Patterns of English Language
verfasst von
Harjeet Kaur Ojhla
Sharada Patil
Publikationsdatum
24.01.2020
Verlag
Springer US
Erschienen in
Wireless Personal Communications / Ausgabe 2/2020
Print ISSN: 0929-6212
Elektronische ISSN: 1572-834X
DOI
https://doi.org/10.1007/s11277-020-07093-9

Weitere Artikel der Ausgabe 2/2020

Wireless Personal Communications 2/2020 Zur Ausgabe

Neuer Inhalt