Skip to main content
Erschienen in: International Journal of Speech Technology 1/2013

01.03.2013

Blind separation of audio signals using trigonometric transforms and Kalman filtering

verfasst von: Mussa M. Ahmed, Fathi E. Abd El-Samie

Erschienen in: International Journal of Speech Technology | Ausgabe 1/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper deals with the problem of blind separation of audio signals from noisy mixtures. It proposes the application of a blind separation algorithm on the Discrete Cosine Transform (DCT) or the Discrete Sine Transform (DST) of the mixed signals, instead of performing the separation on the mixtures in the time domain. Kalman Filtering of the noisy separated signals is recommended in this paper as a post-processing step for noise reduction. Both the DCT and the DST have an energy compaction property, which concentrates most of the signal energy in a few coefficients in the transform domain, leaving the rest of the transform-domain coefficients close to zero. As a result, the separation is performed on a few coefficients in the transform domain. Another advantage of signal separation in transform domains is that the effect of noise on the signals in the transform domains is smaller than that in the time domain due to the averaging effect of the transform equations. The simulation results confirm the effectiveness of transform-domain signal separation and the feasibility of the post-processing Kalman filtering step.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Bozic, S. M. (1994). Digital and Kalman filtering (2Rev ed.). British library cataloguing in publication data. Bozic, S. M. (1994). Digital and Kalman filtering (2Rev ed.). British library cataloguing in publication data.
Zurück zum Zitat Chan, D. C. (1997). Blind Signal Separation. A PhD dissertation, University of Cambridge, January. Chan, D. C. (1997). Blind Signal Separation. A PhD dissertation, University of Cambridge, January.
Zurück zum Zitat Crochiere, R. E., Tribolet, J. E., & Rabiner, L. R. (1980). An interpretation of the log likelihood ratio as a measure of waveform coder performance. IEEE Transactions on Acoustics, Speech, and Signal Processing, Assp-28(3), 318–323. CrossRef Crochiere, R. E., Tribolet, J. E., & Rabiner, L. R. (1980). An interpretation of the log likelihood ratio as a measure of waveform coder performance. IEEE Transactions on Acoustics, Speech, and Signal Processing, Assp-28(3), 318–323. CrossRef
Zurück zum Zitat Curnew, S. R., & How, J. (2007). Blind signal separation in MIMO OFDM systems using ICA and fractional sampling. In Signals, systems and electronics, 2007. ISSSE ’07. International symposium (pp. 67–70). CrossRef Curnew, S. R., & How, J. (2007). Blind signal separation in MIMO OFDM systems using ICA and fractional sampling. In Signals, systems and electronics, 2007. ISSSE ’07. International symposium (pp. 67–70). CrossRef
Zurück zum Zitat Dam, H. H., Nordholm, S., Low, S. Y., & Cantoni, A. (2007). Blind signal separation using steepest descent method. IEEE Transactions on Signal Processing, 55(8), 4198–4207. MathSciNetCrossRef Dam, H. H., Nordholm, S., Low, S. Y., & Cantoni, A. (2007). Blind signal separation using steepest descent method. IEEE Transactions on Signal Processing, 55(8), 4198–4207. MathSciNetCrossRef
Zurück zum Zitat Hammam, H., Abu El-Azm, A. E., Elhalawany, M. E., & Abd El-Samie, F. E. (2010). Blind separation of audio signals using trigonometric transforms and wavelet denoising. Int. J. Speech Technol., 13(1), 1–12. doi:10.1007/s10772-010-9066-0. CrossRef Hammam, H., Abu El-Azm, A. E., Elhalawany, M. E., & Abd El-Samie, F. E. (2010). Blind separation of audio signals using trigonometric transforms and wavelet denoising. Int. J. Speech Technol., 13(1), 1–12. doi:10.​1007/​s10772-010-9066-0. CrossRef
Zurück zum Zitat Kalman, R. E. (1960). A new approach to linear filtering and prediction problems. Trans. ASME J. of Basic Engineering, 35–45. Kalman, R. E. (1960). A new approach to linear filtering and prediction problems. Trans. ASME J. of Basic Engineering, 35–45.
Zurück zum Zitat Kubichek, R. (1993). Mel-cepstral distance measure for objective speech quality assessment. In Proceedings of the IEEE pacific Rim conference on communications, computers and signal processing (pp. 125–128). CrossRef Kubichek, R. (1993). Mel-cepstral distance measure for objective speech quality assessment. In Proceedings of the IEEE pacific Rim conference on communications, computers and signal processing (pp. 125–128). CrossRef
Zurück zum Zitat Manmontri, U., & Naylor, P. A. (2008). A class of Frobenius norm-based algorithms using penalty term and natural gradient for blind signal separation. IEEE Transactions on Audio, Speech, and Language Processing, 16(6), 1181–1193. CrossRef Manmontri, U., & Naylor, P. A. (2008). A class of Frobenius norm-based algorithms using penalty term and natural gradient for blind signal separation. IEEE Transactions on Audio, Speech, and Language Processing, 16(6), 1181–1193. CrossRef
Zurück zum Zitat Moreau, M., Pesquet, J. C., & Thirion-Moreau, N. (2007). Convolutive blind signal separation based on asymmetrical contrast functions. IEEE Transactions on Signal Processing, 55(1), 356–371. MathSciNetCrossRef Moreau, M., Pesquet, J. C., & Thirion-Moreau, N. (2007). Convolutive blind signal separation based on asymmetrical contrast functions. IEEE Transactions on Signal Processing, 55(1), 356–371. MathSciNetCrossRef
Zurück zum Zitat Prochazka, A., Uhlir, J., Rayner, P. J. W., & Kingsbury, N. J. (1998). Signal analysis and prediction. Basel: Birkhauser. MATH Prochazka, A., Uhlir, J., Rayner, P. J. W., & Kingsbury, N. J. (1998). Signal analysis and prediction. Basel: Birkhauser. MATH
Zurück zum Zitat Sakai, Y., & Mitsuhashi, W. (2008). A study on the property of blind source separation for preprocessing of an acoustic echo cancellar. In SICE annual conference (pp. 13–18). CrossRef Sakai, Y., & Mitsuhashi, W. (2008). A study on the property of blind source separation for preprocessing of an acoustic echo cancellar. In SICE annual conference (pp. 13–18). CrossRef
Zurück zum Zitat Szupiluk, R., Wojewnik, P., & Zabkowski, T. (2006). Blind signal separation methods for integration of neural networks results. In Information fusion, 9th international conference (pp. 1–6). CrossRef Szupiluk, R., Wojewnik, P., & Zabkowski, T. (2006). Blind signal separation methods for integration of neural networks results. In Information fusion, 9th international conference (pp. 1–6). CrossRef
Zurück zum Zitat Walker, J. S. (1999). A primer on wavelets and their scientific applications. Boca Raton: CRC Press. MATHCrossRef Walker, J. S. (1999). A primer on wavelets and their scientific applications. Boca Raton: CRC Press. MATHCrossRef
Zurück zum Zitat Wang, S., Sekey, A., & Gersho, A. (1992). An objective measure for predicting subjective quality of speech coders. IEEE Journal on Selected Areas in Communications, 10(5), 819–829. CrossRef Wang, S., Sekey, A., & Gersho, A. (1992). An objective measure for predicting subjective quality of speech coders. IEEE Journal on Selected Areas in Communications, 10(5), 819–829. CrossRef
Zurück zum Zitat Won, Y. G., & Lee, S. Y. (2008). Convolutive blind signal separation by estimating mixing channels in time domain. Electronics Letters, 44(21), 1277–1279. CrossRef Won, Y. G., & Lee, S. Y. (2008). Convolutive blind signal separation by estimating mixing channels in time domain. Electronics Letters, 44(21), 1277–1279. CrossRef
Zurück zum Zitat Yang, W., Benbouchta, M., & Yantorno, R. (1998). Performance of the modified bark spectral distortion as an objective speech quality measure. In Proceedings of the IEEE international conf. on acoustic, speech and signal processing (ICASSP), Washington, USA (Vol. 1, pp. 541–544). Yang, W., Benbouchta, M., & Yantorno, R. (1998). Performance of the modified bark spectral distortion as an objective speech quality measure. In Proceedings of the IEEE international conf. on acoustic, speech and signal processing (ICASSP), Washington, USA (Vol. 1, pp. 541–544).
Metadaten
Titel
Blind separation of audio signals using trigonometric transforms and Kalman filtering
verfasst von
Mussa M. Ahmed
Fathi E. Abd El-Samie
Publikationsdatum
01.03.2013
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 1/2013
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-012-9143-7

Weitere Artikel der Ausgabe 1/2013

International Journal of Speech Technology 1/2013 Zur Ausgabe

Neuer Inhalt