Top

Published in:

2008 | OriginalPaper | Chapter

43. Fundamentals of Noise Reduction

Authors : Jingdong Chen, Dr., Jacob Benesty, Prof., Yiteng (Arden) Huang, Dr., Eric J. Diethorn, Dr.

Published in: Springer Handbook of Speech Processing

Publisher: Springer Berlin Heidelberg

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

The existence of noise is inevitable. In all applications that are related to voice and speech, from sound recording, telecommunications, and telecollaborations, to human-machine interfaces, the signal of interest that is picked up by a microphone is generally contaminated by noise. As a result, the microphone signal has to be cleaned up with digital signal-processing tools before it is stored, analyzed, transmitted, or played out. The cleaning process, which is often referred to as either noise reduction or speech enhancement, has attracted a considerable amount of research and engineering attention for several decades. Remarkable advances have already been made, and this area is continuing to progress, with the aim of creating processors that can extract the desired speech signal as if there is no noise. This chapter presents a methodical overview of the state of the art of noise-reduction algorithms. Based on their theoretical origin, the algorithms are categorized into three fundamental classes: filtering techniques, spectral restoration, and model-based methods. We outline the basic ideas underlying these approaches, discuss their characteristics, explain their intrinsic relationships, and review their advantages and disadvantages.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

previous chapter Vector-Based Spoken Language Classification

next chapter Spectral Enhancement Methods

43.1.

J. Benesty, S. Makino, J. Chen (Eds.): Speech Enhancement (Springer, Berlin, Heidelberg 2005)

43.2.

D.H. Johnson, D.E. Dudgeon: Array Signal Processing: Concepts and Techniques (Prentice Hall, Upper Saddle River 1993)MATH

43.3.

M. Brandstein, D. Ward (Eds.): Microphone Arrays: Signal Processing Techniques and Applications (Springer, Berlin, Heidelberg 2001)

43.4.

Y. Huang, J. Benesty (Eds.): Audio Signal Processing for Next-Generation Multimedia Communication Systems (Kluwer Academic, Boston 2004)

43.5.

B. Widrow, J.R. Glover, J.M. McCool, J. Kaunitz, C.S. Williams, R.H. Hearn, J.R. Zeidler, E. Dong, R.C. Goodwin: Adaptive noise canceling: principles and applications, Proc. IEEE 63, 1692-1716 (1975)CrossRef

43.6.

B. Widrow, S.D. Stearns: Adaptive Signal Processing (Prentice Hall, Englewood Cliffs 1985)MATH

43.7.

M.M. Goulding, J.S. Bird: Speech enhancement for mobile telephony, IEEE Trans. Veh. Technol. 39, 316-326 (1990)CrossRef

43.8.

H.J. Kushner: On closed-loop adaptive noise cancellation, IEEE Trans. Automat. Contr. 43, 1103-1107 (1998)MathSciNetCrossRefMATH

43.9.

A.S. Abutaled: An adaptive filter for noise canceling, IEEE Trans. Circuits Syst. 35, 1201-1209 (1998)CrossRef

43.10.

M. R. Schroeder: U.S. Patent No. 3180936, filed Dec. 1, 1960, issued Apr. 27, 1965

43.11.

M. R. Schroeder: U.S. Patent No. 3403224, filed May 28, 1965, issued Sept. 24, 1968

43.12.

S.F. Boll: Suppression of acoustic noise in speech using spectral subtraction, IEEE Trans. Acoust. Speech Signal Process. ASSP-27, 113-120 (1979)CrossRef

43.13.

J.S. Lim, A.V. Oppenheim: Enhancement and bandwidth compression of noisy speech, Proc. IEEE 67, 1586-1604 (1979)CrossRef

43.14.

J.S. Lim (Ed.): Speech Enhancement (Prentice Hall, Englewood Cliffs 1983)

43.15.

P. Vary: Noise suppression by spectral magnitude estimation-mechanism and theoretical limits, Signal Process. 8, 387-400 (1985)CrossRef

43.16.

R. Martin: Noise power spectral density estimation based on optimal smoothing and minimum statistics, IEEE Trans. Speech Audio Process. 9, 504-512 (2001)CrossRef

43.17.

W. Etter, G.S. Moschytz: Noise reduction by noise-adaptive spectral magnitude expansion, J. Audio Eng. Soc. 42, 341-349 (1994)

43.18.

J. Chen, J. Benesty, Y. Huang, S. Doclo: New insights into the noise reduction Wiener filter, IEEE Trans. Speech Audio Process. 14, 1218-1234 (2006)CrossRef

43.19.

Y. Ephraim, H.L. Van Trees: A signal subspace approach for speech enhancement, IEEE Trans. Speech Audio Process. 3, 251-266 (1995)CrossRef

43.20.

M. Dendrinos, S. Bakamidis, G. Garayannis: Speech enhancement from noise: A regenerative approach, Speech Commun. 10, 45-57 (1991)CrossRef

43.21.

P.S.K. Hansen: Signal Subspace Methods for Speech Enhancement, Ph.D. Dissertation (Tech. Univ. Denmark, Lyngby 1997)

43.22.

S.H. Jensen, P.C. Hansen, S.D. Hansen, J.A. Sørensen: Reduction of broad-band noise in speech by truncated QSVD, IEEE Trans. Speech Audio Process. 3, 439-448 (1995)CrossRefMATH

43.23.

H. Lev-Ari, Y. Ephraim: Extension of the signal subspace speech enhancement approach to colored noise, IEEE Trans. Speech Audio Process. 10, 104-106 (2003)

43.24.

A. Rezayee, S. Gazor: An adaptive KLT approach for speech enhancement, IEEE Trans. Speech Audio Process. 9, 87-95 (2001)CrossRef

43.25.

U. Mittal, N. Phamdo: Signal/noise KLT based approach for enhancing speech degraded by colored noise, IEEE Trans. Speech Audio Process. 8, 159-167 (2000)CrossRef

43.26.

Y. Hu, P.C. Loizou: A generalized subspace approach for enhancing spech corrupted by colored noise, IEEE Trans. Speech Audio Process. 11, 334-341 (2003)CrossRef

43.27.

Y. Ephraim, D. Malah: Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator, IEEE Trans. Acoust. Speech Signal Process. 32, 1109-1121 (1984)CrossRef

43.28.

Y. Ephraim, D. Malah: Speech enhancement using a minimum mean-square error log-spectral amplitude estimator, IEEE Trans. Acoust. Speech Signal Process. ASSP-33, 443-445 (1985)CrossRef

43.29.

R.J. McAulay, M.L. Malpass: Speech enhancement using a soft-decision noise suppression filter, IEEE Trans. Acoust. Speech Signal Process. 28, 137-145 (1980)CrossRef

43.30.

P.J. Wolfe, S.J. Godsill: Simple alternatives to the Ephraim and Malah suppression rule for speech ehancemnet, Proc. IEEE ICASSP 2001, 496-499 (2001)

43.31.

K.K. Paliwal, A. Basu: A speech enhancement method based on Kalman filtering, Proc. IEEE ICASSP 1987, 177-180 (1987)

43.32.

J.D. Gibson, B. Koo, S.D. Gray: Filtering of colored noise for speech enhancement and coding, IEEE Trans. Signal Process. 39, 1732-1742 (1991)CrossRef

43.33.

S. Gannot, D. Burshtein, E. Weinstein: Iterative and sequential Kalman filter-based speech enhancement algorithms, IEEE Trans. Speech Audio Process. 6, 373-385 (1998)CrossRef

43.34.

Y. Ephraim, D. Malah, B.-H. Juang: On the application of hidden Markov models for enhancing noisy speech, IEEE Trans. Acoust. Speech Signal Process. 37, 1846-1856 (1989)CrossRef

43.35.

Y. Ephraim: A Bayesian estimation approach for speech enhancement using hidden Markov models, IEEE Trans. Signal Process. 40, 725-735 (1992)CrossRef

43.36.

Y. Ephraim: Statstical-model-based speech enhancement systems, Proc. IEEE 80, 1526-1555 (1992)CrossRef

43.37.

D. Klatt: Review of test-to-speech conversion for English, J. Acoust. Soc. Am. 82, 737-793 (1987)CrossRef

43.38.

U. Jekosch: Speech quality assessment and evaluation, Proc. Eurospeech 1993, 1387-1394 (1993)

43.39.

C. Delogu, P. Paoloni, P. Pocci, C. Sementina: Quality evaluation of text-to-speech synthesizers using magnitude estimation, categorical estimation, pair comparison and reaction time methods, Proc. Eurospeech 1991, 353-356 (1991)

43.40.

S.R. Quackenbush, T.P. Barnwell, M.A. Clements: Objective Measures of Speech Quality (Prentice Hall, Englewood Cliffs 1988)

43.41.

L.R. Rabiner, B.H. Juang: Fundamentals of Speech Recognition (Prentice Hall, Englewood Cliffs 1993)MATH

43.42.

D. Mansour, B.H. Juang: A family of distortion meansures based upon projection operation for robust speech recognition, IEEE Trans. Acoust. Speech Signal Process. 37, 1659-1671 (1989)CrossRef

43.43.

F. Itakura, S. Saito: A statistical method for estimation of speech spectral density and formant frequencies, Electron. Commun. Jpn. 53A, 36-43 (1970)

43.44.

G. Chen, S.N. Koh, I.Y. Soon: Enhanced Itakura measure incorporating masking properties of human auditory system, Signal Process. 83, 1445-1456 (2003)CrossRefMATH

43.45.

K. Fukunaga: Introduction to Statistial Pattern Recognition (Academic, San Diego 1990)MATH

43.46.

N. Wiener: Extrapolation, Interpolation, and Smoothing of Stationary Time Series (Wiley, New York 1949)MATH

43.47.

H.L. Van Trees: Dection, Estimation, and Modulation Theory, Part I (Wiley, New York 1968)MATH

43.48.

R. Martin: Speech enhancement using MMSE short time spectral estimation with Gamma distributed speech priors, Proc. IEEE ICASSP 2002, I253-I256 (2002)

43.49.

I.S. Gradshteyn, I.M. Ryzhik, A. Jeffery, D. Zwillinger (Eds.): Table of Integrals, Series, and Products (Academic, San Diego 2000)MATH

43.50.

C. Breithaupt, R. Martin: MMSE estimation fo magnitude-square DFT coefficients with supergaussian priors, Proc. IEEE ICASSP 2003, I848-I851 (2003)

43.51.

I. Cohen: Speech enhancement using supergaussian speech models and noncausal a priori SNR estimation, Speech Commun. 47, 336-350 (2005)CrossRef

43.52.

S.O. Rice: Stasitical properties of a sinewave plus random noise, Bell System Tech. J. 0, 109-157 (1948)CrossRef

43.53.

D. Middleton, R. Esposito: Simultaneous optimum detection and estimation of signals in noise, IEEE Trans. Inform. Theory IT-14, 434-444 (1968)CrossRefMATH

43.54.

D.L. Wang, J.S. Lim: The unimportance of phase in speech enhancement, IEEE Trans. Acoust. Speech Signal Process. ASSP-30, 679-681 (1982)CrossRef

43.55.

H. Dudley, T.H. Tarnoczy: The speaking machine of Wolfgang von Kempelen, J. Acoust. Soc. Am. 22, 151-166 (1950)CrossRef

43.56.

Sir R. Paget: Human Speech (Harcourt, London, New York 1930)

43.57.

J.Q. Stewart: An electrical analogue of the vocal cords, Nature 110, 311-312 (1922)CrossRef

43.58.

H.K. Dunn: The calculation of vowel resonances, and an electrical vocal tract, J. Acoust. Soc. Am. 22, 740-753 (1950)CrossRef

43.59.

B.S. Atal, L.S. Hanauer: Speech analysis and synthesis by linear prediction of the speech wave, J. Acoust. Soc. Am. 50, 637-655 (1971)CrossRef

43.60.

F. Itakura: Minimum prediction residual principle applied to speech recognition, IEEE Trans. Acoust. Speech Signal Process. ASSP-23, 67-72 (1975)CrossRef

43.61.

T.W. Parsons: Separation of speech from interfering speech by means of harmonic selection, J. Acoust. Soc. Am. 60, 911-918 (1976)CrossRef

43.62.

R.H. Frazier, S. Samsam, L.D. Braida, A.V. Oppenheim: Enhancement of speech by adaptive filtering, Proc. IEEE ICASSP 1976, 251-253 (1976)

43.63.

R.J. McAulay, T.F. Quatieri: Mid-rate coding based on sinusoidal representation of speech, Proc IEEE ICASSP 1985, 945-948 (1985)

43.64.

D.P. Morgan, E.B. George, L.T. Lee, S.M. Kay: Cochannel speaker separation by harmonic enhancement and suprresion, IEEE Trans. Speech Audio Process. 5, 405-424 (1997)CrossRef

43.65.

E.B. George, M.J.T. Smith: Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model, IEEE Trans. Speech Audio Process. 5, 389-406 (1997)CrossRef

43.66.

D. OʼBrien, A.I.C. Monaghan: Concatenative synthesis based on a harmonic model, IEEE Trans. Speech Audio Process. 9, 11-20 (2001)CrossRef

43.67.

J.S. Lim, A.V. Oppenheim, L.D. Braida: Evaluation of an adpative comb filtering method for enhancing speech degraded by white noise addition, IEEE Trans. Acoust. Speech Signal Process. ASSP-26, 354-358 (1978)CrossRef

43.68.

J. Makhoul: Linear prediction: A tutorial review, Proc. IEEE 63, 561-580 (1975)CrossRef

43.69.

J.R. Deller, J.G. Proakis, J.H.L. Hansen: Discrete-Time Processing of Speech Signals (Macmillan, New York 1993)

43.70.

K.M. Malladi, R.V. Rajakumar: Estimation of time-varying AR models of speech through Gauss-Markov modeling, Proc. IEEE ICASSP 6, 305-308 (2003)

43.71.

M. Niedźwiecki, K. Cisowski: Adaptive scheme for elimination fo broadband noise and impulsive disturbance from AR and ARMA signals, IEEE Trans. Signal Process. 44, 528-537 (1996)CrossRef

43.72.

B. Koo, J.D. Gibson: Filtering of colored noise for speech enhancement and coding, Proc. IEEE ICASSP 1989, 345-352 (1989)

43.73.

B. Lee, K.Y. Lee, S. Ann: An EM-based approach for parameter enhancement with an application to speech signals, Signal Process. 46, 1-14 (1995)CrossRefMATH

43.74.

Z. Goh, K.C. Tan, B.T.G. Tan: Kalman-filtering speech enhancement method based on a voiced-unvoiced speech model, IEEE Trans. Speech Audio Process. 7, 510-524 (1999)CrossRef

43.75.

C. Li, S.V. Andersen: Intergrating Kalman filtering and multi-pulse coding for speech enhancement with a non-stationary model of the speech signal, Proc. IEEE ICASSP 2004, 2300-2304 (2004)

43.76.

L.R. Rabiner: A tutorial on hidden Markov models and selected applications in speech recognition, Proc. IEEE 77, 257-286 (1989)CrossRef

43.77.

B.H. Juang, L.R. Rabiner: Mixture autoregressive hidden Markov models for speech signals, IEEE Trans. Acoust. Speech Signal Process. ASSP-33, 1404-1413 (1985)CrossRef

43.78.

A.P. Dempster, N.M. Laird, D.B. Rubin: Maximum likelihood from incomplete data via the EM algorithm, J. Roy. Stat. Soc. B 39, 1-38 (1977)MathSciNetMATH

43.79.

H. Sameti, H. Sheikhzadeh, L. Deng, R.L. Brennan: HMM-based strategies for enhancement of speech signals embedded in nonstationary noise, IEEE Trans. Speech Audio Process. 6, 445-455 (1998)CrossRef

Title: Fundamentals of Noise Reduction
Authors: Jingdong Chen, Dr.
Jacob Benesty, Prof.
Yiteng (Arden) Huang, Dr.
Eric J. Diethorn, Dr.
Publisher: Springer Berlin Heidelberg
Book: Springer Handbook of Speech Processing
Print ISBN: 978-3-540-49125-5

Electronic ISBN: 978-3-540-49127-9

Copyright Year: 2008
DOI: https://doi.org/10.1007/978-3-540-49127-9_43

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"