Elsevier

Signal Processing

Volume 130, January 2017, Pages 118-128
Signal Processing

A new pre-whitening transform domain LMS algorithm and its application to speech denoising

https://doi.org/10.1016/j.sigpro.2016.06.021Get rights and content

Highlights

  • A novel pre-whitening transform domain LMS algorithm is proposed.

  • This pre-whitening is introduced before applying the transform to reinforce its decorrelation using a simple finite impulse response decorrelation filter of order one.

  • In adaptive speech denoising, the proposed algorithm outperforms the conventional algorithms for different transforms in terms of the computational complexity, mean square error convergence speed, reached steady state level, residual noise in the denoised signal, steady state excess mean square error, misadjustment and output SNR.

Abstract

In this paper, we propose a new pre-whitening transform domain LMS algorithm. The main idea is to introduce a pre-whitening using a simple finite impulse response decorrelation filter of order one before applying the transform to reinforce its decorrelation. The resulting algorithm has the advantage of using any transform even with low decorrelation. This advantage can be exploited to consider transforms having lower computational and structural complexities than those of the classical transforms. For this purpose, we also investigate the use of other transforms, namely the parametric Fourier and Hartley transforms. This investigation is accomplished by studying the eigenvalue spreads obtained by a given parametric transform and then finding the value of the parameter corresponding to the minimum eigenvalue spread, which is equivalent to the best mean square error (MSE) convergence behavior. This approach provides new attractive transforms for the proposed algorithm. Moreover, we consider the adaptive speech denoising as an application to evaluate the performance of the proposed algorithm. The comparisons between the proposed and conventional algorithms for different transforms are performed in terms of the computational complexity, MSE convergence speed, reached steady state level, residual noise in the denoised signal, steady state excess MSE, misadjustment and output SNR.

Introduction

The least mean square (LMS) algorithm is the mostly used in the adaptive filtering for its simplicity and robustness [1]. However, it has a slow convergence in the case of highly correlated input signals [2], [3]. This is due to the fact that the autocorrelation matrix of the input signal has a large eigenvalue spread. To overcome this problem by reducing the eigenvalue spread, whitening or decorrelated adaptive algorithms have been proposed for time-domain LMS [4], [5], [6]. In [4], the authors proposed a joint decorrelation of both the input and error signals. The decorrelation is achieved in the time domain using an adaptive decorrelation filter based on the concept of prediction. The resulting decorrelated normalized LMS (NLMS) structure has the advantage of improving the mean square error (MSE) convergence speed and steady state compared to the conventional LMS and NLMS algorithms. The decorrelation of the input signal can also be achieved by an orthogonal transformation followed by power normalization [7]. These two operations have led to a new configuration named transform-domain LMS (TDLMS) adaptive filters, which outperform the time-domain LMS adaptive filters in terms of the MSE convergence speed and steady state [8], [9], [10]. The orthogonal transforms such as the discrete Fourier transform (DFT), the discrete Hartley transform (DHT) and the discrete cosine transform (DCT) have been used in the TDLMS [7]. The resulting adaptive filters have been named as DFT–LMS, DHT–LMS and DCT–LMS, respectively [11]. The convergence speed of these filters depends on the used transform [8], [11], [12]. In general, the DCT–LMS filter presents a convergence performance better than those of the DFT–LMS and DHT–LMS filters [11]. This is mainly due to the fact that the DCT is suboptimal in terms of decorrelation [13], [14]. Therefore, it is highly desirable to use in the TDLMS adaptive filters a transform having better decorrelation. However, the existing transforms are fixed and hence it is not possible to increase their decorrelation ability. Since the introduction of a decorrelation in the time-domain LMS has brought interesting improvements, it is important to investigate the introduction of such a decorrelation in the TDLMS. However, to the best of author’s knowledge, this investigation has not been reported in the literature.

One of the applications of LMS adaptive filters is speech ;denoising. It is a very crucial operation to enhance the quality and intelligibility of the voice and reduce communication fatigue in modern communication systems such as mobile phones, hands-free telephony and voice-controlled systems, which are generally used in noisy environments [15]. Most of the audio frequency acoustic noises, such as computer fan noise and noise from people and cars, have low-frequency spectra and hence are colored [15], [16]. These noises often corrupt the speech signal, which is also colored. Therefore, it is important to consider colored noises in designing speech denoising techniques.

In this paper, we propose a new pre-whitening TDLMS (PW-TDLMS) algorithm. It maintains the structure of the conventional TDLMS algorithm and introduces a pre-whitenning before applying the transform. This is a novel and interesting strategy to reinforce the decorrelation of the used fixed transform in the TDLMS. The proposed pre-whitening is achieved by using a simple finite impulse response (FIR) decorrelation filter of order one based on a fixed prediction concept. The resulting PW-TDLMS has the advantage of using any transform even with low decorrelation. For this purpose, we investigate the use of other transforms such as the parametric DFT and DHT transforms [17] and study the performance of the proposed PW-TDLMS algorithm in terms of MSE convergence speed and steady state. We carry out this comparative study by considering adaptive speech denoising as an application of the proposed algorithm and show the simulation results of the proposed PW-TDLMS and TDLMS for different transforms. Moreover, we compute the eigenvalue spread of the autocorrelation matrix obtained after applying the parametric DFT or DHT transform and power normalization in the case of highly correlated Markov-1 noise, which is equivalent to the first order autoregressive (AR) process [11], for different values of the independent parameter of the transform. The transform with the value of the parameter corresponding to a good compromise between the eigenvalue spread and computational complexity is selected for the proposed PW-TDLMS. It should be noted that the eigenvalue spreads in the case of the DCT and classical DFT and DHT transforms are known in the literature [9], [11], [18].

The reminder of this paper is organized as follows. Section 2 briefly presents the parametric DFT and DHT. In Section 3, we analyze the TDLMS adaptive filter by considering the stability, steady state and convergence performances in the case of a first order AR process. For the convergence analysis, we review the eigenvalue spreads in the cases of the DFT, DHT and DCT, and then find the eigenvalue spreads in the cases of the parametric DFT and DHT. The proposed PW-TDLMS adaptive filter is developed and compared with the conventional TDLMS adaptive filter in Section 4 in terms of the eigenvalue spreads in the cases of the DCT, DFT, DHT, parametric DFT and DHT transforms. Section 5 presents the computational complexities of the DCT–LMS and proposed parametric DHT-based PW-LMS algorithms. In order to compare the performance of the proposed PW-TDLMS with that of the TDLMS for different transforms, we consider in Section 6 the speech denoising application for the cases of speech-like and real speech signals. The simulation results and comparisons are given therein in terms of the MSE convergence speed, reached steady state level, residual noise in the denoised signal, steady state excess MSE, misadjustment and output SNR. Some concluding remarks are given in Section 7.

Section snippets

Parametric discrete Fourier and Hartley transforms

The three-parameter DFT transform of a complex sequence x(k) of order N=2r, r>3, is defined in [17] asXa,b,c(n)=k=0N1x(k)vFa,b,c(nkmodN),0nN1where vFa,b,c(i), 0iN1, are the entries of the parametric kernel vector given byVFa,b,c=[1VcjV1VcjV]with the vectorV=[WN1WN(N/16)1aWN(N/16)+1WN(N/8)1bWN(N/8)+1WN(3N/16)1ja*WN(3N/16)+1WN(N/4)1],

WN=exp(j(2π/N)), j=1, (.)* denotes the complex conjugate transpose, and a, b and c are three nonzero parameters. The matrix form of (1) can

Transform domain LMS adaptive filter analysis

In the TDLMS adaptive filter presented in Fig. 1, the correlated tap delayed input vector xk=[xk,xk1,,xkN+1]T is transformed into the vector uk=TNxk using a fixed data-independent orthogonal transform matrix TN, where N is the filter length. The resulting vector uk is less correlated then the vector xk. The transformed vector is then power normalized using the diagonal matrix Pk defined in [9], [11], [19] asPk=diag([σk2(i),i=0,1,,N1])where σk2(i)is the power estimate of the ith input uk(i)

Proposed pre-whitening transform domain LMS adaptive filter

In this section, we propose a new pre-whitening TDLMS (PW-TDLMS) adaptive noise canceler by introducing a pre-whitening filter g at the input of the TDLMS as presented in Fig. 4. The purpose of this filter is to obtain a decorrelated version x̃k of the signal xk at the input of TDLMS. This has a direct impact on further reducing the eigenvalue spread of the autocorrelation matrix SN obtained after transformation and power-normalization of the signal x̃k in TDLMS and consequently on accelerating

Computational complexity

According to [24], the computational complexity of the conventional TDLMS (without including the complexity of the used transform) is 6N+1 multiplications and 3N additions for each iteration. The computational complexity of the proposed PW-TDLMS is increased by only one addition and one multiplication due to the first order pre-whitening filter. Therefore, by including the complexity of the transform, the computational complexities of the DCT–LMS and proposed PW-DHTπ/6-LMS are given in Table 3

Simulation results

In this section, we present a set of simulations to test and compare the performance of the proposed PW-TDLMS with that of the TDLMS using the DCT, DFT, DHT, DFT-π/6 and DHT-π/6 transforms. The algorithms are implemented as adaptive noise cancelers as shown in Fig. 4.

Conclusion

In this paper, a new pre-whitening transform domain LMS (PW-TDLMS) algorithm has been developed by introducing a pre-whitening using a simple finite impulse response decorrelation filter of order one before applying the transform to reinforce its decorrelation. It has been shown that the proposed PW-TDLMS algorithm significantly decreases the eigenvalue spreads compared to the conventional TDLMS for all considered transforms, namely the DCT, DFT, DHT, and parametric DFT and DHT. In both the

References (27)

  • M. Mboup et al.

    LMS coupled adaptive prediction and system identification: a statistical model and transient mean analysis

    IEEE Trans. Signal Process.

    (1994)
  • S. Narayan et al.

    Transform domain LMS algorithm

    IEEE Trans. Acoust. Speech Signal Process.

    (1983)
  • F. Beaufays

    Transform-domain adaptive filters: an analytical approach

    IEEE Trans. Signal Process.

    (1995)
  • Cited by (16)

    • On-line harmonic signal denoising from the measurement with non-stationary and non-Gaussian noise

      2022, Signal Processing
      Citation Excerpt :

      Harmonic signals exist in many scenarios, such as mechanical condition monitoring [1,2], helicopter detection [3,4], speech acquisition [5–7], biomedical diagnosis [8], etc.

    • Peak-aware guided filtering for spectrum signal denoising

      2022, Chemometrics and Intelligent Laboratory Systems
      Citation Excerpt :

      The analysis of spectrum signals includes detecting and measuring peak value, reducing noise, improving the resolution of overlapping peak value, compensating instrument pseudo-factor, and decomposing complex signal into its component parts [1]. It is particularly important to identify and process the peak of a signal in a wide range of applications, including shape and the maxima-minima (QRS and ST-T level) in electrocardiogram (ECG) processing [2], seismological research [3], speech signal processing [4,5], and so on. However, signals are usually polluted by noise during data acquisition, storage and transmission, and the noise is larger at the peak [1].

    • Applying segmentation and classification to improve performance of smoothing

      2021, Digital Signal Processing: A Review Journal
      Citation Excerpt :

      For example, in seismological research, the resolution of seismic waveforms will directly affect the reliability of seismic data [1]. In speech signal processing, high-quality speech signals can help improve speech recognition [2,3]. The image restoration is to restore a clear scene from an image degraded by optical system aberration, atmospheric disturbance, motion, defocus, and other noise [4].

    • Wireless signal enhancement based on generative adversarial networks

      2020, Ad Hoc Networks
      Citation Excerpt :

      Linear methods of signal enhancement have been widely used for noise removal due to their relative simplicity. Typical techniques used for signal enhancement in received signals are least mean square (LMS) [2,3] and its variants [4]. However, these algorithms have limited performance on nonlinear signals and cannot achieve a global optimum solution for noise and interference elimination.

    • Enhancing Biomedical ReQA With Adversarial Hard In-Batch Negative Samples

      2023, IEEE/ACM Transactions on Computational Biology and Bioinformatics
    View all citing articles on Scopus
    View full text