Skip to main content
Top

2019 | OriginalPaper | Chapter

A Threshold Denoising Algorithm Based on Mathematical Morphology for Speech Enhancement

Authors : Guangyan Li, Caixia Zheng, Tingfa Xu, Xiaolin Cao, Mao Xingpeng, Shuangwei Wang

Published in: Communications, Signal Processing, and Systems

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The presence of noise in speech signals can significantly degrade the performance of speech recognition systems. A threshold denoising method based on mathematical morphology is proposed to reduce background white noise. In the method we consider speech spectrograms as images and construct binary images from a normalized 256-level gray scale spectrogram image. We take advantage of a sudden slowing in the average value (ratio of the number of ‘1’ pixels to the total pixel number) of the binary image, and use it as the threshold value to zero spectrogram elements below the threshold, normalize the spectrogram, and finally, reconstruct the original speech signal to achieve the goal of speech enhancement. The main advantage of the algorithm is fast speed that is highly desired in real-time speech processing.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Ajmera, P.K., Jadhav, D.V., Holambe, R.S.: Text-independent speaker identification using Radon and discrete cosine transforms based features from speech spectrogram. Pattern Recogn. 44, 2749–2759 (2011) Ajmera, P.K., Jadhav, D.V., Holambe, R.S.: Text-independent speaker identification using Radon and discrete cosine transforms based features from speech spectrogram. Pattern Recogn. 44, 2749–2759 (2011)
go back to reference Alsteris, L.D., Paliwal, K.K.: Iterative reconstruction of speech from short-time Fourier transform phase and magnitude spectra. Comput. Speech Lang. 21, 174–186 (2007) Alsteris, L.D., Paliwal, K.K.: Iterative reconstruction of speech from short-time Fourier transform phase and magnitude spectra. Comput. Speech Lang. 21, 174–186 (2007)
go back to reference Berouti, M., Schwartz, R., Makhoul, J.: Enhancement of speech corrupted by acoustic noise. In: IEEE, pp. 208–211 (1979) Berouti, M., Schwartz, R., Makhoul, J.: Enhancement of speech corrupted by acoustic noise. In: IEEE, pp. 208–211 (1979)
go back to reference Boll, S.F.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. 27, 113–120 (1979) Boll, S.F.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. 27, 113–120 (1979)
go back to reference Cohen, L.: Time-frequency distributions - a review. Proc. IEEE 77, 941–981 (1989) Cohen, L.: Time-frequency distributions - a review. Proc. IEEE 77, 941–981 (1989)
go back to reference Dennis, J., Tran, H.D., Li, H.: Spectrogram image feature for sound event classification in mismatched conditions. IEEE Signal Process. Lett. 18, 130–133 (2011) Dennis, J., Tran, H.D., Li, H.: Spectrogram image feature for sound event classification in mismatched conditions. IEEE Signal Process. Lett. 18, 130–133 (2011)
go back to reference Mallawaarachchi, A., Ong, S.H., Chitre, M., Taylor, E.: Spectrogram denoising and automated extraction of the fundamental frequency variation of dolphin whistles. J. Acoust. Soc. Am. 124, 1159–1170 (2008) Mallawaarachchi, A., Ong, S.H., Chitre, M., Taylor, E.: Spectrogram denoising and automated extraction of the fundamental frequency variation of dolphin whistles. J. Acoust. Soc. Am. 124, 1159–1170 (2008)
go back to reference Pinkowski, B.: Principal component analysis of speech spectrogram images. Pattern Recogn. 30, 777–787 (1997) Pinkowski, B.: Principal component analysis of speech spectrogram images. Pattern Recogn. 30, 777–787 (1997)
go back to reference Soille, P.: Morphological image analysis: principles and applications. Springer Science & Business Media, Heidelberg (2013) Soille, P.: Morphological image analysis: principles and applications. Springer Science & Business Media, Heidelberg (2013)
go back to reference Steinberg, R., Shaughnessy, D.O.: Segmentation of a speech spectrogram using mathematical morphology. In: IEEE, pp. 1637–1640 (2008) Steinberg, R., Shaughnessy, D.O.: Segmentation of a speech spectrogram using mathematical morphology. In: IEEE, pp. 1637–1640 (2008)
go back to reference Xu, H., Tan, Z.-H., Dalsgaard, P., Lindberg, B.: Robust speech recognition by nonlocal means denoising processing. IEEE Signal Process. Lett. 15, 701–704 (2008) Xu, H., Tan, Z.-H., Dalsgaard, P., Lindberg, B.: Robust speech recognition by nonlocal means denoising processing. IEEE Signal Process. Lett. 15, 701–704 (2008)
Metadata
Title
A Threshold Denoising Algorithm Based on Mathematical Morphology for Speech Enhancement
Authors
Guangyan Li
Caixia Zheng
Tingfa Xu
Xiaolin Cao
Mao Xingpeng
Shuangwei Wang
Copyright Year
2019
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-6571-2_215