Blind separation of speech signals based on wavelet transform and independent component analysis

Wu, Xiao; He, Jingjing; Jin, Shijiu; Xu, Antao; Wang, Weikui

doi:10.1007/s12209-010-0022-5

Blind separation of speech signals based on wavelet transform and independent component analysis

Published: 11 March 2010

Volume 16, pages 123–128, (2010)
Cite this article

Transactions of Tianjin University Aims and scope Submit manuscript

Xiao Wu (吴晓)^1,2,
Jingjing He (何静菁)¹,
Shijiu Jin (靳世久)¹,
Antao Xu (徐安桃)² &
…
Weikui Wang (王伟魁)¹

103 Accesses
5 Citations
Explore all metrics

Abstract

Speech signals in frequency domain were separated based on discrete wavelet transform (DWT) and independent component analysis (ICA). First, mixed speech signals were decomposed into different frequency domains by DWT and the subbands of speech signals were separated using ICA in each wavelet domain; then, the permutation and scaling problems of frequency domain blind source separation (BSS) were solved by utilizing the correlation between adjacent bins in speech signals; at last, source signals were reconstructed from single branches. Experiments were carried out with 2 sources and 6 microphones using speech signals at sampling rate of 40 kHz. The microphones were aligned with 2 sources in front of them, on the left and right. The separation of one male and one female speeches lasted 2.5 s. It is proved that the new method is better than single ICA method and the signal to noise ratio is improved by 1 dB approximately.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Proposed Integration Algorithm to Optimize the Separation of Audio Signals Using the ICA and Wavelet Transform

Blind signal separation with Noise Reduction for efficient speaker identification

Article 16 January 2021

Dual-Transform Source Separation Using Sparse Nonnegative Matrix Factorization

Article 23 October 2020

References

Brandstein M, Ward D (editors). Microphone Arrays: Signal Processing Techniques and Applications[M]. Springer-Verlag, Berlin, 2001.
Google Scholar
Park H, Shekhar Dhir C, Oh S et al. A filter bank approach to independent component analysis for convolved mixtures[J]. Neurocomputing, 2006, 69(16–18): 2065–2077.
Article Google Scholar
Makino S. Blind source separation of convolutive mixtures[C]. In: Proceedings of SPIE—The International Society for Optical Engineering. Kissimmee, FL, USA, 2006.
Robledo-Arnuncio E, Juang B. Blind source separation of acoustic mixtures with distributed microphones[C]. In: 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP’ 07. Honolulu, HI, USA. 2007. 949–952.
Ukai S, Takatam T, Saruwatari H et al. Multistage SIMOmodel-based blind source separation combining frequencydomain ICA and time-domain ICA[J]. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 2005, E88-A(3): 642–649.
Article Google Scholar
Sawada H, Mukai R, Araki S et al. A robust and precise method for solving the permutation problem of frequencydomain blind source separation[J]. IEEE Transactions on Speech and Audio Processing, 2004, 12(5): 530–538.
Article Google Scholar
Reju V G, Koh S N, Soon I Y. Partial separation method for solving permutation problem in frequency domain blind source separation of speech signals[J]. Neurocomputing, 2008, 71(10–12): 2098–2112.
Article Google Scholar
Li Wanlong, Ju L, Du Jun et al. Solving permutation problem in frequency-domain blind source separation using microphone sub-arrays[C]. In: IEEE International Conference Neural Networks and Signal Processing, ICNNSP. Zhejiang, China, 2008. 67–72.
Rennie S J, Aarabi P, Frey B J. Variational probabilistic speech separation using microphone arrays[J]. IEEE Transactions on Audio, Speech and Language Processing, 2007, 15(1): 135–149.
Article Google Scholar
Makino S, Sawada H, Mukai R et al. Blind source separa tion of convolutive mixtures of speech in frequency domain[J]. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 2005, E88-A(7): 1640–1654.
Article Google Scholar
Prasad R, Saruwatari H, Shikano K. Effect of central limit theorem non-compliance on blind separation of speech by negentropy maximization[J]. Speech Communication, 2005, 26(6): 511–522.
Google Scholar
Hyvarinen A. Fast and robust fixed-point algorithms for independent component analysis[J]. IEEE Transactions on Neural Networks, 1999, 10(3): 626–634.
Article Google Scholar
Hyvarinen A, Oja E. Independent component analysis: Algorithms and applications[J]. Neural Networks, 2000, 13(4/5): 411–430.
Article Google Scholar
Nishikawa T, Abe H, Saruwatari H et al. Overdetermined blind separation for real convolutive mixtures of speech based on multistage ICA using subarray processing[J]. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 2004, E87-A(8): 1924–1932.
Google Scholar
Saruwatari H, Kawamura T, Nishikawa T et al. Fast-convergence algorithm for blind source separation based on array signal processing[J]. IEEE Workshop on Statistical Signal Processing Proceedings, 2003, E86-A(3): 634–639.
Google Scholar
Sawada H, Mukai R, Araki S et al. A robust approach to the permutation problem of frequency-domain blind source separation[C]. In: IEEE International Conference on Acoustics, Speech and Signal Processing Proceedings. Hongkong, China, 2003. 381–384.
Murata N, Ikeda S, Ziehe A. An approach to blind source separation based on temporal structure of speech signals[J]. Neurocomputing, 2001, 41: 1–24.
Article MATH Google Scholar
Mukai R, Sawada H, de la Kethulle de Ryhove S et al. Array geometry arrangement for frequency domain blind source separation[C]. In: International Workshop on Acoustic Echo and Noise Control (IWAENC2003). Kyoto, Japan, 2003. 219–222.
Sawada H, Mukai R, Araki S et al. A robust and precise method for solving the permutation problem of frequencydomain blind source separation[J]. IEEE Transactions on Speech and Audio Processing, 2004, 12(5): 530–538.
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Precision Instrument and Opto-Electronics Engineering, Tianjin University, Tianjin, 300072, China
Xiao Wu (吴晓), Jingjing He (何静菁), Shijiu Jin (靳世久) & Weikui Wang (王伟魁)
Department of Automotive Engineering, Military Transportation Institute of Tianjin, Tianjin, 300161, China
Xiao Wu (吴晓) & Antao Xu (徐安桃)

Authors

Xiao Wu (吴晓)
View author publications
You can also search for this author in PubMed Google Scholar
Jingjing He (何静菁)
View author publications
You can also search for this author in PubMed Google Scholar
Shijiu Jin (靳世久)
View author publications
You can also search for this author in PubMed Google Scholar
Antao Xu (徐安桃)
View author publications
You can also search for this author in PubMed Google Scholar
Weikui Wang (王伟魁)
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shijiu Jin (靳世久).

Additional information

Supported by Tianjin Municipal Science and Technology Commission (No.09JCYBJC02200).

WU Xiao, born in 1979, male, doctorate student.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, X., He, J., Jin, S. et al. Blind separation of speech signals based on wavelet transform and independent component analysis. Trans. Tianjin Univ. 16, 123–128 (2010). https://doi.org/10.1007/s12209-010-0022-5

Download citation

Accepted: 14 May 2009
Published: 11 March 2010
Issue Date: April 2010
DOI: https://doi.org/10.1007/s12209-010-0022-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Blind separation of speech signals based on wavelet transform and independent component analysis

Abstract

Access this article

Similar content being viewed by others

Proposed Integration Algorithm to Optimize the Separation of Audio Signals Using the ICA and Wavelet Transform

Blind signal separation with Noise Reduction for efficient speaker identification

Dual-Transform Source Separation Using Sparse Nonnegative Matrix Factorization

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Blind separation of speech signals based on wavelet transform and independent component analysis

Abstract

Access this article

Similar content being viewed by others

Proposed Integration Algorithm to Optimize the Separation of Audio Signals Using the ICA and Wavelet Transform

Blind signal separation with Noise Reduction for efficient speaker identification

Dual-Transform Source Separation Using Sparse Nonnegative Matrix Factorization

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation