Skip to main content

2017 | OriginalPaper | Buchkapitel

Acoustic Source Localization by Combination of Supervised Direction-of-Arrival Estimation with Disjoint Component Analysis

verfasst von : Jörn Anemüller, Hendrik Kayser

Erschienen in: Latent Variable Analysis and Signal Separation

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Analysis and processing in reverberant, multi-source acoustic environments encompasses a multitude of techniques that estimate from sensor signals a spatially resolved “image” of acoustic space, a high-level representation of physical sources that consolidates several source components into a single sound object, and the estimation of filter parameters that would permit enhancement of target and attenuation of interfering signal components.
The contribution of the present manuscript is the introduction of a combination of different algorithms from the field of supervised learning, unsupervised subspace decomposition and multi-channel signal enhancement to accomplish these goals.
Specifically, we propose a system that (1) uses a bank of trained support vector machine classifiers to estimate source activity probability for each spatial position and (2) employs disjoint component analysis (DCA) to obtain from this probabilistic spatial source activity map those components that pertain to individual sound objects. We conclude with a brief outline for (3) estimation of multi-channel filter parameters based on DCA components in order to perform target source enhancement.
We illustrate the proposed method with decomposition results obtained with a four-channel hearing aid geometry setup that comprises two localized sources plus isotropic background noise in an anechoic environment.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Amari, S.I.: Natural gradient works efficiently in learning. Neural Comput. 10, 251–276 (1998)CrossRef Amari, S.I.: Natural gradient works efficiently in learning. Neural Comput. 10, 251–276 (1998)CrossRef
2.
Zurück zum Zitat Anemüller, J.: Maximization of component disjointness: a criterion for blind source separation. In: Davies, M.E., James, C.J., Abdallah, S.A., Plumbley, M.D. (eds.) ICA 2007. LNCS, vol. 4666, pp. 325–332. Springer, Heidelberg (2007). doi:10.1007/978-3-540-74494-8_41 CrossRef Anemüller, J.: Maximization of component disjointness: a criterion for blind source separation. In: Davies, M.E., James, C.J., Abdallah, S.A., Plumbley, M.D. (eds.) ICA 2007. LNCS, vol. 4666, pp. 325–332. Springer, Heidelberg (2007). doi:10.​1007/​978-3-540-74494-8_​41 CrossRef
3.
Zurück zum Zitat Bell, A., Sejnowski, T.: An information-maximization approach to blind separation and blind deconvolution. Neural Comput. 7, 1129–1159 (1995)CrossRef Bell, A., Sejnowski, T.: An information-maximization approach to blind separation and blind deconvolution. Neural Comput. 7, 1129–1159 (1995)CrossRef
4.
Zurück zum Zitat Dreschler, W.a., Verschuure, H., Ludvigsen, C., Westermann, S.: ICRA noises: artificial noise signals with speech-like spectral and temporal properties for hearing instrument assessment. Audiology 40(3), 148–157 (2001) Dreschler, W.a., Verschuure, H., Ludvigsen, C., Westermann, S.: ICRA noises: artificial noise signals with speech-like spectral and temporal properties for hearing instrument assessment. Audiology 40(3), 148–157 (2001)
5.
Zurück zum Zitat Garofolo, J.S., Lamel, L.F., Fisher, W.M., Fiscus, J.G., Pallett, D.S., Dahlgren, N.L., Zue, V.: TIMIT Acoustic-Phonetic Continuous Speech Corpus. CDROM (1993) Garofolo, J.S., Lamel, L.F., Fisher, W.M., Fiscus, J.G., Pallett, D.S., Dahlgren, N.L., Zue, V.: TIMIT Acoustic-Phonetic Continuous Speech Corpus. CDROM (1993)
6.
Zurück zum Zitat Kayser, H., Anemüller, J.: A discriminative learning approach to probabilistic acoustic source localization. In: Proceedings of IWAENC 2014 - International Workshop on Acoustic Echo and Noise Control, pp. 100–104 (2014) Kayser, H., Anemüller, J.: A discriminative learning approach to probabilistic acoustic source localization. In: Proceedings of IWAENC 2014 - International Workshop on Acoustic Echo and Noise Control, pp. 100–104 (2014)
7.
Zurück zum Zitat Kayser, H., Ewert, S.D., Anemüller, J., Rohdenburg, T., Hohmann, V., Kollmeier, B.: Database of multichannel in-ear and behind-the-ear head-related and binaural room impulse responses. EURASIP J. Adv. Sig. Process. 2009(1), 1–10 (2009). ID 298605 Kayser, H., Ewert, S.D., Anemüller, J., Rohdenburg, T., Hohmann, V., Kollmeier, B.: Database of multichannel in-ear and behind-the-ear head-related and binaural room impulse responses. EURASIP J. Adv. Sig. Process. 2009(1), 1–10 (2009). ID 298605
8.
Zurück zum Zitat Kayser, H., Hohmann, V., Ewert, S.D., Kollmeier, B., Anemüller, J.: Robust auditory localization using probabilistic inference and coherence-based weighting of interaural cues. J. Acoust. Soc. Am. 138(5), 2635–2648 (2015)CrossRef Kayser, H., Hohmann, V., Ewert, S.D., Kollmeier, B., Anemüller, J.: Robust auditory localization using probabilistic inference and coherence-based weighting of interaural cues. J. Acoust. Soc. Am. 138(5), 2635–2648 (2015)CrossRef
9.
Zurück zum Zitat Kayser, H., Moritz, N., Anemüller, J.: Probabilistic spatial filter estimation for signal enhancement in multi-channel automatic speech recognition. In: Proceedings of INTERSPEECH 2016 (2016) Kayser, H., Moritz, N., Anemüller, J.: Probabilistic spatial filter estimation for signal enhancement in multi-channel automatic speech recognition. In: Proceedings of INTERSPEECH 2016 (2016)
10.
Zurück zum Zitat Knapp, C., Carter, G.: The generalized correlation method for estimation of time delay. IEEE Trans. Acoust. Speech Sig. Process. 24(4), 320–327 (1976)CrossRef Knapp, C., Carter, G.: The generalized correlation method for estimation of time delay. IEEE Trans. Acoust. Speech Sig. Process. 24(4), 320–327 (1976)CrossRef
11.
Zurück zum Zitat May, T., van de Par, S., Kohlrausch, A.: A probabilistic model for robust localization based on a binaural auditory front-end. IEEE Trans. Audio Speech Lang. Process. 19, 1–13 (2011)CrossRef May, T., van de Par, S., Kohlrausch, A.: A probabilistic model for robust localization based on a binaural auditory front-end. IEEE Trans. Audio Speech Lang. Process. 19, 1–13 (2011)CrossRef
12.
Zurück zum Zitat Woodruff, J., Wang, D.: Binaural localization of multiple sources in reverberant and noisy environments. IEEE Trans. Audio Speech Lang. Process. 20, 1913–1928 (2012)CrossRef Woodruff, J., Wang, D.: Binaural localization of multiple sources in reverberant and noisy environments. IEEE Trans. Audio Speech Lang. Process. 20, 1913–1928 (2012)CrossRef
Metadaten
Titel
Acoustic Source Localization by Combination of Supervised Direction-of-Arrival Estimation with Disjoint Component Analysis
verfasst von
Jörn Anemüller
Hendrik Kayser
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-53547-0_10