Skip to main content
Top

2008 | OriginalPaper | Chapter

43. Fundamentals of Noise Reduction

Authors : Jingdong Chen, Dr., Jacob Benesty, Prof., Yiteng (Arden) Huang, Dr., Eric J. Diethorn, Dr.

Published in: Springer Handbook of Speech Processing

Publisher: Springer Berlin Heidelberg

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The existence of noise is inevitable. In all applications that are related to voice and speech, from sound recording, telecommunications, and telecollaborations, to human-machine interfaces, the signal of interest that is picked up by a microphone is generally contaminated by noise. As a result, the microphone signal has to be cleaned up with digital signal-processing tools before it is stored, analyzed, transmitted, or played out. The cleaning process, which is often referred to as either noise reduction or speech enhancement, has attracted a considerable amount of research and engineering attention for several decades. Remarkable advances have already been made, and this area is continuing to progress, with the aim of creating processors that can extract the desired speech signal as if there is no noise. This chapter presents a methodical overview of the state of the art of noise-reduction algorithms. Based on their theoretical origin, the algorithms are categorized into three fundamental classes: filtering techniques, spectral restoration, and model-based methods. We outline the basic ideas underlying these approaches, discuss their characteristics, explain their intrinsic relationships, and review their advantages and disadvantages.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
43.1.
go back to reference J. Benesty, S. Makino, J. Chen (Eds.): Speech Enhancement (Springer, Berlin, Heidelberg 2005) J. Benesty, S. Makino, J. Chen (Eds.): Speech Enhancement (Springer, Berlin, Heidelberg 2005)
43.2.
go back to reference D.H. Johnson, D.E. Dudgeon: Array Signal Processing: Concepts and Techniques (Prentice Hall, Upper Saddle River 1993)MATH D.H. Johnson, D.E. Dudgeon: Array Signal Processing: Concepts and Techniques (Prentice Hall, Upper Saddle River 1993)MATH
43.3.
go back to reference M. Brandstein, D. Ward (Eds.): Microphone Arrays: Signal Processing Techniques and Applications (Springer, Berlin, Heidelberg 2001) M. Brandstein, D. Ward (Eds.): Microphone Arrays: Signal Processing Techniques and Applications (Springer, Berlin, Heidelberg 2001)
43.4.
go back to reference Y. Huang, J. Benesty (Eds.): Audio Signal Processing for Next-Generation Multimedia Communication Systems (Kluwer Academic, Boston 2004) Y. Huang, J. Benesty (Eds.): Audio Signal Processing for Next-Generation Multimedia Communication Systems (Kluwer Academic, Boston 2004)
43.5.
go back to reference B. Widrow, J.R. Glover, J.M. McCool, J. Kaunitz, C.S. Williams, R.H. Hearn, J.R. Zeidler, E. Dong, R.C. Goodwin: Adaptive noise canceling: principles and applications, Proc. IEEE 63, 1692-1716 (1975)CrossRef B. Widrow, J.R. Glover, J.M. McCool, J. Kaunitz, C.S. Williams, R.H. Hearn, J.R. Zeidler, E. Dong, R.C. Goodwin: Adaptive noise canceling: principles and applications, Proc. IEEE 63, 1692-1716 (1975)CrossRef
43.6.
go back to reference B. Widrow, S.D. Stearns: Adaptive Signal Processing (Prentice Hall, Englewood Cliffs 1985)MATH B. Widrow, S.D. Stearns: Adaptive Signal Processing (Prentice Hall, Englewood Cliffs 1985)MATH
43.7.
go back to reference M.M. Goulding, J.S. Bird: Speech enhancement for mobile telephony, IEEE Trans. Veh. Technol. 39, 316-326 (1990)CrossRef M.M. Goulding, J.S. Bird: Speech enhancement for mobile telephony, IEEE Trans. Veh. Technol. 39, 316-326 (1990)CrossRef
43.9.
go back to reference A.S. Abutaled: An adaptive filter for noise canceling, IEEE Trans. Circuits Syst. 35, 1201-1209 (1998)CrossRef A.S. Abutaled: An adaptive filter for noise canceling, IEEE Trans. Circuits Syst. 35, 1201-1209 (1998)CrossRef
43.10.
go back to reference M. R. Schroeder: U.S. Patent No. 3180936, filed Dec. 1, 1960, issued Apr. 27, 1965 M. R. Schroeder: U.S. Patent No. 3180936, filed Dec. 1, 1960, issued Apr. 27, 1965
43.11.
go back to reference M. R. Schroeder: U.S. Patent No. 3403224, filed May 28, 1965, issued Sept. 24, 1968 M. R. Schroeder: U.S. Patent No. 3403224, filed May 28, 1965, issued Sept. 24, 1968
43.12.
go back to reference S.F. Boll: Suppression of acoustic noise in speech using spectral subtraction, IEEE Trans. Acoust. Speech Signal Process. ASSP-27, 113-120 (1979)CrossRef S.F. Boll: Suppression of acoustic noise in speech using spectral subtraction, IEEE Trans. Acoust. Speech Signal Process. ASSP-27, 113-120 (1979)CrossRef
43.13.
go back to reference J.S. Lim, A.V. Oppenheim: Enhancement and bandwidth compression of noisy speech, Proc. IEEE 67, 1586-1604 (1979)CrossRef J.S. Lim, A.V. Oppenheim: Enhancement and bandwidth compression of noisy speech, Proc. IEEE 67, 1586-1604 (1979)CrossRef
43.14.
go back to reference J.S. Lim (Ed.): Speech Enhancement (Prentice Hall, Englewood Cliffs 1983) J.S. Lim (Ed.): Speech Enhancement (Prentice Hall, Englewood Cliffs 1983)
43.15.
go back to reference P. Vary: Noise suppression by spectral magnitude estimation-mechanism and theoretical limits, Signal Process. 8, 387-400 (1985)CrossRef P. Vary: Noise suppression by spectral magnitude estimation-mechanism and theoretical limits, Signal Process. 8, 387-400 (1985)CrossRef
43.16.
go back to reference R. Martin: Noise power spectral density estimation based on optimal smoothing and minimum statistics, IEEE Trans. Speech Audio Process. 9, 504-512 (2001)CrossRef R. Martin: Noise power spectral density estimation based on optimal smoothing and minimum statistics, IEEE Trans. Speech Audio Process. 9, 504-512 (2001)CrossRef
43.17.
go back to reference W. Etter, G.S. Moschytz: Noise reduction by noise-adaptive spectral magnitude expansion, J. Audio Eng. Soc. 42, 341-349 (1994) W. Etter, G.S. Moschytz: Noise reduction by noise-adaptive spectral magnitude expansion, J. Audio Eng. Soc. 42, 341-349 (1994)
43.18.
go back to reference J. Chen, J. Benesty, Y. Huang, S. Doclo: New insights into the noise reduction Wiener filter, IEEE Trans. Speech Audio Process. 14, 1218-1234 (2006)CrossRef J. Chen, J. Benesty, Y. Huang, S. Doclo: New insights into the noise reduction Wiener filter, IEEE Trans. Speech Audio Process. 14, 1218-1234 (2006)CrossRef
43.19.
go back to reference Y. Ephraim, H.L. Van Trees: A signal subspace approach for speech enhancement, IEEE Trans. Speech Audio Process. 3, 251-266 (1995)CrossRef Y. Ephraim, H.L. Van Trees: A signal subspace approach for speech enhancement, IEEE Trans. Speech Audio Process. 3, 251-266 (1995)CrossRef
43.20.
go back to reference M. Dendrinos, S. Bakamidis, G. Garayannis: Speech enhancement from noise: A regenerative approach, Speech Commun. 10, 45-57 (1991)CrossRef M. Dendrinos, S. Bakamidis, G. Garayannis: Speech enhancement from noise: A regenerative approach, Speech Commun. 10, 45-57 (1991)CrossRef
43.21.
go back to reference P.S.K. Hansen: Signal Subspace Methods for Speech Enhancement, Ph.D. Dissertation (Tech. Univ. Denmark, Lyngby 1997) P.S.K. Hansen: Signal Subspace Methods for Speech Enhancement, Ph.D. Dissertation (Tech. Univ. Denmark, Lyngby 1997)
43.22.
go back to reference S.H. Jensen, P.C. Hansen, S.D. Hansen, J.A. Sørensen: Reduction of broad-band noise in speech by truncated QSVD, IEEE Trans. Speech Audio Process. 3, 439-448 (1995)CrossRefMATH S.H. Jensen, P.C. Hansen, S.D. Hansen, J.A. Sørensen: Reduction of broad-band noise in speech by truncated QSVD, IEEE Trans. Speech Audio Process. 3, 439-448 (1995)CrossRefMATH
43.23.
go back to reference H. Lev-Ari, Y. Ephraim: Extension of the signal subspace speech enhancement approach to colored noise, IEEE Trans. Speech Audio Process. 10, 104-106 (2003) H. Lev-Ari, Y. Ephraim: Extension of the signal subspace speech enhancement approach to colored noise, IEEE Trans. Speech Audio Process. 10, 104-106 (2003)
43.24.
go back to reference A. Rezayee, S. Gazor: An adaptive KLT approach for speech enhancement, IEEE Trans. Speech Audio Process. 9, 87-95 (2001)CrossRef A. Rezayee, S. Gazor: An adaptive KLT approach for speech enhancement, IEEE Trans. Speech Audio Process. 9, 87-95 (2001)CrossRef
43.25.
go back to reference U. Mittal, N. Phamdo: Signal/noise KLT based approach for enhancing speech degraded by colored noise, IEEE Trans. Speech Audio Process. 8, 159-167 (2000)CrossRef U. Mittal, N. Phamdo: Signal/noise KLT based approach for enhancing speech degraded by colored noise, IEEE Trans. Speech Audio Process. 8, 159-167 (2000)CrossRef
43.26.
go back to reference Y. Hu, P.C. Loizou: A generalized subspace approach for enhancing spech corrupted by colored noise, IEEE Trans. Speech Audio Process. 11, 334-341 (2003)CrossRef Y. Hu, P.C. Loizou: A generalized subspace approach for enhancing spech corrupted by colored noise, IEEE Trans. Speech Audio Process. 11, 334-341 (2003)CrossRef
43.27.
go back to reference Y. Ephraim, D. Malah: Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator, IEEE Trans. Acoust. Speech Signal Process. 32, 1109-1121 (1984)CrossRef Y. Ephraim, D. Malah: Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator, IEEE Trans. Acoust. Speech Signal Process. 32, 1109-1121 (1984)CrossRef
43.28.
go back to reference Y. Ephraim, D. Malah: Speech enhancement using a minimum mean-square error log-spectral amplitude estimator, IEEE Trans. Acoust. Speech Signal Process. ASSP-33, 443-445 (1985)CrossRef Y. Ephraim, D. Malah: Speech enhancement using a minimum mean-square error log-spectral amplitude estimator, IEEE Trans. Acoust. Speech Signal Process. ASSP-33, 443-445 (1985)CrossRef
43.29.
go back to reference R.J. McAulay, M.L. Malpass: Speech enhancement using a soft-decision noise suppression filter, IEEE Trans. Acoust. Speech Signal Process. 28, 137-145 (1980)CrossRef R.J. McAulay, M.L. Malpass: Speech enhancement using a soft-decision noise suppression filter, IEEE Trans. Acoust. Speech Signal Process. 28, 137-145 (1980)CrossRef
43.30.
go back to reference P.J. Wolfe, S.J. Godsill: Simple alternatives to the Ephraim and Malah suppression rule for speech ehancemnet, Proc. IEEE ICASSP 2001, 496-499 (2001) P.J. Wolfe, S.J. Godsill: Simple alternatives to the Ephraim and Malah suppression rule for speech ehancemnet, Proc. IEEE ICASSP 2001, 496-499 (2001)
43.31.
go back to reference K.K. Paliwal, A. Basu: A speech enhancement method based on Kalman filtering, Proc. IEEE ICASSP 1987, 177-180 (1987) K.K. Paliwal, A. Basu: A speech enhancement method based on Kalman filtering, Proc. IEEE ICASSP 1987, 177-180 (1987)
43.32.
go back to reference J.D. Gibson, B. Koo, S.D. Gray: Filtering of colored noise for speech enhancement and coding, IEEE Trans. Signal Process. 39, 1732-1742 (1991)CrossRef J.D. Gibson, B. Koo, S.D. Gray: Filtering of colored noise for speech enhancement and coding, IEEE Trans. Signal Process. 39, 1732-1742 (1991)CrossRef
43.33.
go back to reference S. Gannot, D. Burshtein, E. Weinstein: Iterative and sequential Kalman filter-based speech enhancement algorithms, IEEE Trans. Speech Audio Process. 6, 373-385 (1998)CrossRef S. Gannot, D. Burshtein, E. Weinstein: Iterative and sequential Kalman filter-based speech enhancement algorithms, IEEE Trans. Speech Audio Process. 6, 373-385 (1998)CrossRef
43.34.
go back to reference Y. Ephraim, D. Malah, B.-H. Juang: On the application of hidden Markov models for enhancing noisy speech, IEEE Trans. Acoust. Speech Signal Process. 37, 1846-1856 (1989)CrossRef Y. Ephraim, D. Malah, B.-H. Juang: On the application of hidden Markov models for enhancing noisy speech, IEEE Trans. Acoust. Speech Signal Process. 37, 1846-1856 (1989)CrossRef
43.35.
go back to reference Y. Ephraim: A Bayesian estimation approach for speech enhancement using hidden Markov models, IEEE Trans. Signal Process. 40, 725-735 (1992)CrossRef Y. Ephraim: A Bayesian estimation approach for speech enhancement using hidden Markov models, IEEE Trans. Signal Process. 40, 725-735 (1992)CrossRef
43.36.
go back to reference Y. Ephraim: Statstical-model-based speech enhancement systems, Proc. IEEE 80, 1526-1555 (1992)CrossRef Y. Ephraim: Statstical-model-based speech enhancement systems, Proc. IEEE 80, 1526-1555 (1992)CrossRef
43.37.
go back to reference D. Klatt: Review of test-to-speech conversion for English, J. Acoust. Soc. Am. 82, 737-793 (1987)CrossRef D. Klatt: Review of test-to-speech conversion for English, J. Acoust. Soc. Am. 82, 737-793 (1987)CrossRef
43.38.
go back to reference U. Jekosch: Speech quality assessment and evaluation, Proc. Eurospeech 1993, 1387-1394 (1993) U. Jekosch: Speech quality assessment and evaluation, Proc. Eurospeech 1993, 1387-1394 (1993)
43.39.
go back to reference C. Delogu, P. Paoloni, P. Pocci, C. Sementina: Quality evaluation of text-to-speech synthesizers using magnitude estimation, categorical estimation, pair comparison and reaction time methods, Proc. Eurospeech 1991, 353-356 (1991) C. Delogu, P. Paoloni, P. Pocci, C. Sementina: Quality evaluation of text-to-speech synthesizers using magnitude estimation, categorical estimation, pair comparison and reaction time methods, Proc. Eurospeech 1991, 353-356 (1991)
43.40.
go back to reference S.R. Quackenbush, T.P. Barnwell, M.A. Clements: Objective Measures of Speech Quality (Prentice Hall, Englewood Cliffs 1988) S.R. Quackenbush, T.P. Barnwell, M.A. Clements: Objective Measures of Speech Quality (Prentice Hall, Englewood Cliffs 1988)
43.41.
go back to reference L.R. Rabiner, B.H. Juang: Fundamentals of Speech Recognition (Prentice Hall, Englewood Cliffs 1993)MATH L.R. Rabiner, B.H. Juang: Fundamentals of Speech Recognition (Prentice Hall, Englewood Cliffs 1993)MATH
43.42.
go back to reference D. Mansour, B.H. Juang: A family of distortion meansures based upon projection operation for robust speech recognition, IEEE Trans. Acoust. Speech Signal Process. 37, 1659-1671 (1989)CrossRef D. Mansour, B.H. Juang: A family of distortion meansures based upon projection operation for robust speech recognition, IEEE Trans. Acoust. Speech Signal Process. 37, 1659-1671 (1989)CrossRef
43.43.
go back to reference F. Itakura, S. Saito: A statistical method for estimation of speech spectral density and formant frequencies, Electron. Commun. Jpn. 53A, 36-43 (1970) F. Itakura, S. Saito: A statistical method for estimation of speech spectral density and formant frequencies, Electron. Commun. Jpn. 53A, 36-43 (1970)
43.44.
go back to reference G. Chen, S.N. Koh, I.Y. Soon: Enhanced Itakura measure incorporating masking properties of human auditory system, Signal Process. 83, 1445-1456 (2003)CrossRefMATH G. Chen, S.N. Koh, I.Y. Soon: Enhanced Itakura measure incorporating masking properties of human auditory system, Signal Process. 83, 1445-1456 (2003)CrossRefMATH
43.45.
go back to reference K. Fukunaga: Introduction to Statistial Pattern Recognition (Academic, San Diego 1990)MATH K. Fukunaga: Introduction to Statistial Pattern Recognition (Academic, San Diego 1990)MATH
43.46.
go back to reference N. Wiener: Extrapolation, Interpolation, and Smoothing of Stationary Time Series (Wiley, New York 1949)MATH N. Wiener: Extrapolation, Interpolation, and Smoothing of Stationary Time Series (Wiley, New York 1949)MATH
43.47.
go back to reference H.L. Van Trees: Dection, Estimation, and Modulation Theory, Part I (Wiley, New York 1968)MATH H.L. Van Trees: Dection, Estimation, and Modulation Theory, Part I (Wiley, New York 1968)MATH
43.48.
go back to reference R. Martin: Speech enhancement using MMSE short time spectral estimation with Gamma distributed speech priors, Proc. IEEE ICASSP 2002, I253-I256 (2002) R. Martin: Speech enhancement using MMSE short time spectral estimation with Gamma distributed speech priors, Proc. IEEE ICASSP 2002, I253-I256 (2002)
43.49.
go back to reference I.S. Gradshteyn, I.M. Ryzhik, A. Jeffery, D. Zwillinger (Eds.): Table of Integrals, Series, and Products (Academic, San Diego 2000)MATH I.S. Gradshteyn, I.M. Ryzhik, A. Jeffery, D. Zwillinger (Eds.): Table of Integrals, Series, and Products (Academic, San Diego 2000)MATH
43.50.
go back to reference C. Breithaupt, R. Martin: MMSE estimation fo magnitude-square DFT coefficients with supergaussian priors, Proc. IEEE ICASSP 2003, I848-I851 (2003) C. Breithaupt, R. Martin: MMSE estimation fo magnitude-square DFT coefficients with supergaussian priors, Proc. IEEE ICASSP 2003, I848-I851 (2003)
43.51.
go back to reference I. Cohen: Speech enhancement using supergaussian speech models and noncausal a priori SNR estimation, Speech Commun. 47, 336-350 (2005)CrossRef I. Cohen: Speech enhancement using supergaussian speech models and noncausal a priori SNR estimation, Speech Commun. 47, 336-350 (2005)CrossRef
43.52.
go back to reference S.O. Rice: Stasitical properties of a sinewave plus random noise, Bell System Tech. J. 0, 109-157 (1948)CrossRef S.O. Rice: Stasitical properties of a sinewave plus random noise, Bell System Tech. J. 0, 109-157 (1948)CrossRef
43.53.
go back to reference D. Middleton, R. Esposito: Simultaneous optimum detection and estimation of signals in noise, IEEE Trans. Inform. Theory IT-14, 434-444 (1968)CrossRefMATH D. Middleton, R. Esposito: Simultaneous optimum detection and estimation of signals in noise, IEEE Trans. Inform. Theory IT-14, 434-444 (1968)CrossRefMATH
43.54.
go back to reference D.L. Wang, J.S. Lim: The unimportance of phase in speech enhancement, IEEE Trans. Acoust. Speech Signal Process. ASSP-30, 679-681 (1982)CrossRef D.L. Wang, J.S. Lim: The unimportance of phase in speech enhancement, IEEE Trans. Acoust. Speech Signal Process. ASSP-30, 679-681 (1982)CrossRef
43.55.
go back to reference H. Dudley, T.H. Tarnoczy: The speaking machine of Wolfgang von Kempelen, J. Acoust. Soc. Am. 22, 151-166 (1950)CrossRef H. Dudley, T.H. Tarnoczy: The speaking machine of Wolfgang von Kempelen, J. Acoust. Soc. Am. 22, 151-166 (1950)CrossRef
43.56.
go back to reference Sir R. Paget: Human Speech (Harcourt, London, New York 1930) Sir R. Paget: Human Speech (Harcourt, London, New York 1930)
43.57.
go back to reference J.Q. Stewart: An electrical analogue of the vocal cords, Nature 110, 311-312 (1922)CrossRef J.Q. Stewart: An electrical analogue of the vocal cords, Nature 110, 311-312 (1922)CrossRef
43.58.
go back to reference H.K. Dunn: The calculation of vowel resonances, and an electrical vocal tract, J. Acoust. Soc. Am. 22, 740-753 (1950)CrossRef H.K. Dunn: The calculation of vowel resonances, and an electrical vocal tract, J. Acoust. Soc. Am. 22, 740-753 (1950)CrossRef
43.59.
go back to reference B.S. Atal, L.S. Hanauer: Speech analysis and synthesis by linear prediction of the speech wave, J. Acoust. Soc. Am. 50, 637-655 (1971)CrossRef B.S. Atal, L.S. Hanauer: Speech analysis and synthesis by linear prediction of the speech wave, J. Acoust. Soc. Am. 50, 637-655 (1971)CrossRef
43.60.
go back to reference F. Itakura: Minimum prediction residual principle applied to speech recognition, IEEE Trans. Acoust. Speech Signal Process. ASSP-23, 67-72 (1975)CrossRef F. Itakura: Minimum prediction residual principle applied to speech recognition, IEEE Trans. Acoust. Speech Signal Process. ASSP-23, 67-72 (1975)CrossRef
43.61.
go back to reference T.W. Parsons: Separation of speech from interfering speech by means of harmonic selection, J. Acoust. Soc. Am. 60, 911-918 (1976)CrossRef T.W. Parsons: Separation of speech from interfering speech by means of harmonic selection, J. Acoust. Soc. Am. 60, 911-918 (1976)CrossRef
43.62.
go back to reference R.H. Frazier, S. Samsam, L.D. Braida, A.V. Oppenheim: Enhancement of speech by adaptive filtering, Proc. IEEE ICASSP 1976, 251-253 (1976) R.H. Frazier, S. Samsam, L.D. Braida, A.V. Oppenheim: Enhancement of speech by adaptive filtering, Proc. IEEE ICASSP 1976, 251-253 (1976)
43.63.
go back to reference R.J. McAulay, T.F. Quatieri: Mid-rate coding based on sinusoidal representation of speech, Proc IEEE ICASSP 1985, 945-948 (1985) R.J. McAulay, T.F. Quatieri: Mid-rate coding based on sinusoidal representation of speech, Proc IEEE ICASSP 1985, 945-948 (1985)
43.64.
go back to reference D.P. Morgan, E.B. George, L.T. Lee, S.M. Kay: Cochannel speaker separation by harmonic enhancement and suprresion, IEEE Trans. Speech Audio Process. 5, 405-424 (1997)CrossRef D.P. Morgan, E.B. George, L.T. Lee, S.M. Kay: Cochannel speaker separation by harmonic enhancement and suprresion, IEEE Trans. Speech Audio Process. 5, 405-424 (1997)CrossRef
43.65.
go back to reference E.B. George, M.J.T. Smith: Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model, IEEE Trans. Speech Audio Process. 5, 389-406 (1997)CrossRef E.B. George, M.J.T. Smith: Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model, IEEE Trans. Speech Audio Process. 5, 389-406 (1997)CrossRef
43.66.
go back to reference D. OʼBrien, A.I.C. Monaghan: Concatenative synthesis based on a harmonic model, IEEE Trans. Speech Audio Process. 9, 11-20 (2001)CrossRef D. OʼBrien, A.I.C. Monaghan: Concatenative synthesis based on a harmonic model, IEEE Trans. Speech Audio Process. 9, 11-20 (2001)CrossRef
43.67.
go back to reference J.S. Lim, A.V. Oppenheim, L.D. Braida: Evaluation of an adpative comb filtering method for enhancing speech degraded by white noise addition, IEEE Trans. Acoust. Speech Signal Process. ASSP-26, 354-358 (1978)CrossRef J.S. Lim, A.V. Oppenheim, L.D. Braida: Evaluation of an adpative comb filtering method for enhancing speech degraded by white noise addition, IEEE Trans. Acoust. Speech Signal Process. ASSP-26, 354-358 (1978)CrossRef
43.68.
go back to reference J. Makhoul: Linear prediction: A tutorial review, Proc. IEEE 63, 561-580 (1975)CrossRef J. Makhoul: Linear prediction: A tutorial review, Proc. IEEE 63, 561-580 (1975)CrossRef
43.69.
go back to reference J.R. Deller, J.G. Proakis, J.H.L. Hansen: Discrete-Time Processing of Speech Signals (Macmillan, New York 1993) J.R. Deller, J.G. Proakis, J.H.L. Hansen: Discrete-Time Processing of Speech Signals (Macmillan, New York 1993)
43.70.
go back to reference K.M. Malladi, R.V. Rajakumar: Estimation of time-varying AR models of speech through Gauss-Markov modeling, Proc. IEEE ICASSP 6, 305-308 (2003) K.M. Malladi, R.V. Rajakumar: Estimation of time-varying AR models of speech through Gauss-Markov modeling, Proc. IEEE ICASSP 6, 305-308 (2003)
43.71.
go back to reference M. Niedźwiecki, K. Cisowski: Adaptive scheme for elimination fo broadband noise and impulsive disturbance from AR and ARMA signals, IEEE Trans. Signal Process. 44, 528-537 (1996)CrossRef M. Niedźwiecki, K. Cisowski: Adaptive scheme for elimination fo broadband noise and impulsive disturbance from AR and ARMA signals, IEEE Trans. Signal Process. 44, 528-537 (1996)CrossRef
43.72.
go back to reference B. Koo, J.D. Gibson: Filtering of colored noise for speech enhancement and coding, Proc. IEEE ICASSP 1989, 345-352 (1989) B. Koo, J.D. Gibson: Filtering of colored noise for speech enhancement and coding, Proc. IEEE ICASSP 1989, 345-352 (1989)
43.73.
go back to reference B. Lee, K.Y. Lee, S. Ann: An EM-based approach for parameter enhancement with an application to speech signals, Signal Process. 46, 1-14 (1995)CrossRefMATH B. Lee, K.Y. Lee, S. Ann: An EM-based approach for parameter enhancement with an application to speech signals, Signal Process. 46, 1-14 (1995)CrossRefMATH
43.74.
go back to reference Z. Goh, K.C. Tan, B.T.G. Tan: Kalman-filtering speech enhancement method based on a voiced-unvoiced speech model, IEEE Trans. Speech Audio Process. 7, 510-524 (1999)CrossRef Z. Goh, K.C. Tan, B.T.G. Tan: Kalman-filtering speech enhancement method based on a voiced-unvoiced speech model, IEEE Trans. Speech Audio Process. 7, 510-524 (1999)CrossRef
43.75.
go back to reference C. Li, S.V. Andersen: Intergrating Kalman filtering and multi-pulse coding for speech enhancement with a non-stationary model of the speech signal, Proc. IEEE ICASSP 2004, 2300-2304 (2004) C. Li, S.V. Andersen: Intergrating Kalman filtering and multi-pulse coding for speech enhancement with a non-stationary model of the speech signal, Proc. IEEE ICASSP 2004, 2300-2304 (2004)
43.76.
go back to reference L.R. Rabiner: A tutorial on hidden Markov models and selected applications in speech recognition, Proc. IEEE 77, 257-286 (1989)CrossRef L.R. Rabiner: A tutorial on hidden Markov models and selected applications in speech recognition, Proc. IEEE 77, 257-286 (1989)CrossRef
43.77.
go back to reference B.H. Juang, L.R. Rabiner: Mixture autoregressive hidden Markov models for speech signals, IEEE Trans. Acoust. Speech Signal Process. ASSP-33, 1404-1413 (1985)CrossRef B.H. Juang, L.R. Rabiner: Mixture autoregressive hidden Markov models for speech signals, IEEE Trans. Acoust. Speech Signal Process. ASSP-33, 1404-1413 (1985)CrossRef
43.78.
go back to reference A.P. Dempster, N.M. Laird, D.B. Rubin: Maximum likelihood from incomplete data via the EM algorithm, J. Roy. Stat. Soc. B 39, 1-38 (1977)MathSciNetMATH A.P. Dempster, N.M. Laird, D.B. Rubin: Maximum likelihood from incomplete data via the EM algorithm, J. Roy. Stat. Soc. B 39, 1-38 (1977)MathSciNetMATH
43.79.
go back to reference H. Sameti, H. Sheikhzadeh, L. Deng, R.L. Brennan: HMM-based strategies for enhancement of speech signals embedded in nonstationary noise, IEEE Trans. Speech Audio Process. 6, 445-455 (1998)CrossRef H. Sameti, H. Sheikhzadeh, L. Deng, R.L. Brennan: HMM-based strategies for enhancement of speech signals embedded in nonstationary noise, IEEE Trans. Speech Audio Process. 6, 445-455 (1998)CrossRef
Metadata
Title
Fundamentals of Noise Reduction
Authors
Jingdong Chen, Dr.
Jacob Benesty, Prof.
Yiteng (Arden) Huang, Dr.
Eric J. Diethorn, Dr.
Copyright Year
2008
Publisher
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-540-49127-9_43