Skip to main content
Top

2013 | OriginalPaper | Chapter

Variable Quantile Level Based Noise Suppression for Robust Speech Recognition

Authors : Kangyeoul Lee, Gil-Jin Jang, Jeong-Sik Park, Ji-Hwan Kim

Published in: Information Technology Convergence

Publisher: Springer Netherlands

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper addresses the issues of single microphone based noise estimation technique for speech recognition in noisy environments. Many researches have been performed on the environmental noise estimation; however, most of them require voice activity detection (VAD) for accurate estimation of noise characteristics. We propose an approach for efficient noise estimation without VAD, aiming at improving the conventional quantile-based noise estimation (QBNE). We fostered the QBNE by adjusting the quantile level according to the relative amount of added noise to the target speech. From the observation that the power spectral density (PSD) of noise is close to the Gaussian distribution, while that of speech is more narrowly populated, the level of additive noise is measured by the selected Gaussianity functions. We compared the proposed method with the conventional QBNE and minimum statistics based method on a simple speech recognition task in various SNR levels. The experimental results show that the proposed method is superior to the conventional methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Boll S (1979) Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans Acoust Speech Signal Process 2:113–120CrossRef Boll S (1979) Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans Acoust Speech Signal Process 2:113–120CrossRef
2.
go back to reference Martin R (1994) Spectral subtraction based on minimum statistics. In: Proceeding of 7th European signal processing conference, EUSIPCO-94, Edinburgh, Scotland, pp 1182–1185 Martin R (1994) Spectral subtraction based on minimum statistics. In: Proceeding of 7th European signal processing conference, EUSIPCO-94, Edinburgh, Scotland, pp 1182–1185
3.
go back to reference Stahl V, Fischer A, Bippus R (2000) Quantile based noise estimation for spectral subtraction and wiener filtering. In: Proceeding of ICASSP, vol 3, pp 1875–1878 Stahl V, Fischer A, Bippus R (2000) Quantile based noise estimation for spectral subtraction and wiener filtering. In: Proceeding of ICASSP, vol 3, pp 1875–1878
4.
go back to reference Lee T-W, Girolami M, Sejnowski TJ (1999) Independent component analysis using an extended infomax algorithm for mixed subgaussian and supergaussian sources. Neural Comput 11(2):417–441CrossRef Lee T-W, Girolami M, Sejnowski TJ (1999) Independent component analysis using an extended infomax algorithm for mixed subgaussian and supergaussian sources. Neural Comput 11(2):417–441CrossRef
5.
go back to reference Cooke M, Hershey J, Rennie S (2010) Monaural speech separation and recognition challenge. Comput Speech Lang 24(1):1–15CrossRef Cooke M, Hershey J, Rennie S (2010) Monaural speech separation and recognition challenge. Comput Speech Lang 24(1):1–15CrossRef
6.
go back to reference Jang G-J, Cho H-Y (2011) Efficient spectrum estimation of noise using line spectral pairs for robust speech recognition. Electron Lett 47(25):1399–1401CrossRef Jang G-J, Cho H-Y (2011) Efficient spectrum estimation of noise using line spectral pairs for robust speech recognition. Electron Lett 47(25):1399–1401CrossRef
7.
go back to reference Pearce D, Hirsch H (2000) The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy condition. In: Proceeding of INTERSPEECH, pp 29–32 Pearce D, Hirsch H (2000) The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy condition. In: Proceeding of INTERSPEECH, pp 29–32
Metadata
Title
Variable Quantile Level Based Noise Suppression for Robust Speech Recognition
Authors
Kangyeoul Lee
Gil-Jin Jang
Jeong-Sik Park
Ji-Hwan Kim
Copyright Year
2013
Publisher
Springer Netherlands
DOI
https://doi.org/10.1007/978-94-007-6996-0_110