nach oben

Wireless Personal Communications

Erschienen in:

24.01.2020

A Wavelet Based Hybrid Threshold Transform Method for Speech Intelligibility and Quality in Noisy Speech Patterns of English Language

verfasst von: Harjeet Kaur Ojhla, Sharada Patil

Erschienen in: Wireless Personal Communications | Ausgabe 2/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

The paper proposes a method to improve the performance of speech communication system in a highly noisy industrial environment. For the improvement, different speech signals are considered which includes signals from different environments such as car noise, railway station, babble noise, street noise which are corrupted with additional noise as input data set for processing. This database is processed using suitable filters which will remove the effect of noise to some extent. Different algorithms have been proposed to minimize the effect of noise to a certain limit. The denoising algorithms are generally the different wavelet thresholding method which removes the noise from the speech signal. Many researchers have worked on soft and hard thresholding for image processing. The proposed method of hybrid thresholding comprises of both soft and hard thresholding process which is comparatively better method than the previous methods. The method can be implemented for the non-stationary noise and it also removes the problems of edges. Unlike the traditional way of using single value, different values are used for the adaptive filtering to remove the edges. During the course of experiments, the dataset of IIIT-H with a set of noisy files from Noizeus and AURORA database having sampling rate of 16 kHz has been used. Results are calculated with subjective and objective measures for fine and broad level quality assessment. SNR, SSNR, PSNR, NRMSE, and PESQ parameters are used as performance parameters and outperform with other combinations as compared to conventional methods. The hybrid threshold method yields better results with significant improvement in speech quality and intelligibility.

Vorheriger Artikel HTMS: Fuzzy Based Hierarchical Trust Management Scheme in WSN

Nächster Artikel An Energy Efficient Stable Clustering Approach Using Fuzzy Type-2 Bat Flower Pollinator for Wireless Sensor Networks

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Singh, S., Tripathy, M., & Anand, R. S. (2015). A wavelet based method for removal of highly non-stationary noises from single channel Hindi speech patterns of low input SNR. International Journal of Speech Technology,18(2), 157–166.CrossRef

Singh, S., Tripathy, M., & Anand, R. S. (2015). Binary mask based method for enhancement of mixed noise speech of low SNR input. International Journal of Speech Technology,18(4), 609–617.CrossRef

Boll, S. F. (1979). Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech, and Signal Processing,27(2), 113–120.CrossRef

Widrow, B., Glover, J. G. R., & Mccool, J. M. (1975). Adaptive noise cancelling: principles and applications. Proceedings of IEEE,63(12), 1692–1716.CrossRef

Deller, J. R., Hansen, J. H. L., & Proakis, J. G. (2000). Discrete time processing of speech signals. 2nd, IEEE Press, New York.

Ephraim, Y., & Malah, D. (1984). Speech enhancement using a minimum mean square error short time spectral amplitude estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing,32(6), 1109–1121.CrossRef

Ephraim, Y., & Malah, D. (1985). Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing,23(2), 443–445.CrossRef

Singh, S., Tripathy, M., & Anand, R. S. (2014). Single channel speech enhancement for mixed noise stationary noise environments. Advances in Signal processing and intelligent Recognition Systems,264, 545–555.CrossRef

Singh, S., Tripathy, M., & Anand, R. S. (2017). A wavelet packet based approach for speech enhancement using modulation selection. Wireless Personal Communications,95(4), 4441–4456.CrossRef

10.

Weiss, M., Aschkenasy, E., & Parsons, T. W. (1974). Study and the development of the INTEL technique for improving speech intelligibility. Technical Report NSC-FR/4023, Nicolet Scientific Corporation.

11.

Huang, Y. A., & Benesty, J. (2012). A multi-frame approach to the frequency domain single channel noise reduction problem. IEEE Transactions on Audio, Speech, and Language Processing,20(4), 1256–1269.CrossRef

12.

Loizou, P. C. (2005). Speech enhancement based on perceptually motivated bayesian estimators of the magnitude spectrum. IEEE Transactions on Audio, Speech, and Language Processing,13(5), 857–869.CrossRef

13.

Loizou, P. C., & Kim, G. (2011). Reasons why current speech enhancement algorithms do not improve speech intelligibility and suggest solutions. IEEE Transactions on Audio, Speech, and Language Processing,19(1), 47–56.CrossRef

14.

Singh, S., & Mutawa, A. M. (2016). A wavelet based transform method for quality improvement in noisy speech patterns of Arabic Language. International Journal of Speech Technology, 18(2), 157–166.CrossRef

15.

Kaur, H., & Talwar, R. (2015). Overlapping frame approach to estimate and reduce noise from single channel speech. International Journal of Signal processing, Image Processing and Pattern Recognition,8(4), 49–58.CrossRef

16.

Lollmann, H. W., & Vary, P. (2009). A blind speech enhancement algorithm for the suppression of late reverberation and noise. In IEEE international conference on acoustics, speech and signal processing (pp. 3989–3992).

17.

Ruwei, L., Changchun, B., Bingyin, X., & Jaoshen, J. (2012). Speech enhancement using the combination of adaptive wavelet threshold and spectral subtraction based on wavelet packet decomposition. In International conference on signal processing ICSP (pp. 481–484.

18.

Polikar, R. (1996). The wavelet tutorial. [Internet] [Cited 2017 March 30].

19.

Donoho, D. L. (1995). De-noising by soft thresholding. IEEE Transactions on Information Theory,41(3), 933–936.MathSciNetCrossRef

20.

Hamid, M. E., Molla, M. K. I., Dang, X., & Nakai, T. (2013). Single channel speech enhancement using adaptive soft-thresholding with bivariate EMD. ISRN Signal Processing,2013(2013), 1–9.CrossRef

21.

Aggarwal, R., Singh, J., Gupta, V., & Khare, A. (2011). Elimination of white noise from speech signal using wavelet transform by soft and hard thresholding. IJEECE,1(2), 62–71.

22.

Farouk, M. H. (2018). Application of wavelets in speech processing. Springer briefs in speech technology (2nd ed.). Springer. https://doi.org/10.1007/978-3-319-69002-5.

23.

Hirsch, H. G., & Pearce, D. (2000). The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In ISCA ITRW ASR2000, Paris, France, 18–20, 2000. http://www.utdallas.edu/*loizou/speech/noizeus/.

Titel: A Wavelet Based Hybrid Threshold Transform Method for Speech Intelligibility and Quality in Noisy Speech Patterns of English Language
verfasst von: Harjeet Kaur Ojhla
Sharada Patil
Publikationsdatum: 24.01.2020
Verlag: Springer US
Erschienen in: Wireless Personal Communications / Ausgabe 2/2020
Print ISSN: 0929-6212
Elektronische ISSN: 1572-834X
DOI: https://doi.org/10.1007/s11277-020-07093-9

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Die Gewinner und Laudatoren des Sustainability Award in Automotive 2024/© Uli Regenscheit | ATZlive, Search Icon, Banner Hanser, Sebastian Glenschek/© Hermes International, Dinko Eror/© Red Hat GmbH, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade, chassis.tech plus 2023/© [M] ATZlive / TÜV SÜD PRODUCT SERVICE GMBH, adäsion-Webinar-Matinee/© krystiannawrocki_ Getty Images

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 2/2020

Theoretical Analysis of NOMA Within Massive MIMO Systems

TelMED: Dynamic User Clustering Resource Allocation Technique for MooM Datasets Under Optimizing Telemedicine Network

VLSI Architecture of Block Matching Algorithms for Motion Estimation in High Efficiency Video Coding

Implementation of Reconfigurable Transceiver using GNU Radio and HackRF One

FreeBW-RPL: A New RPL Protocol Objective Function for Internet of Multimedia Things

A Crest Factor Reduction Scheme with Optimum Spacing Peak Cancellation for Intra-band Non-contiguous Carrier Aggregated OFDM Signals

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.