Skip to main content
Top
Published in: Soft Computing 12/2020

15-10-2019 | Methodologies and Application

Underdetermined blind source separation using CapsNet

Authors: M. Kumar, V. E. Jayanthi

Published in: Soft Computing | Issue 12/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, we consider the problem of separating the speech source signal from the underdetermined convolutive mixture signals using capsule network (CapsNet). The objective of this paper is twofold. They are (1) to improve the underdetermined convolutive blind source separation algorithm in terms of signal-to-distortion ratio, signal-to-interference ratio and signal-to-artifact ratio; (2) to minimize the computational burden of the algorithm so that it is useful for applications like speech recognition system. The time–frequency points of the observed mixture signals are input to the first layer of CapsNet. In the first layer, single-source active point (SSP) is calculated using the ratio of mixtures. These SSPs are lower-level capsules in our system. In the second layer, we find a cluster center using a dynamic routing algorithm and these clusters are used to construct a binary mask. Finally, the algorithm solves the permutation problem by determining the correlation between the amplitudes of adjacent frequency bins. We test our algorithm on the live recording mixture signals obtained in the real environment and synthetically convoluted mixture signals. The test result shows the effectiveness of the proposed method when compared with the existing algorithms in terms of computational load, signal-to-distortion ratio and signal-to-interference ratio.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Araki S et al (2012) The 2011 signal separation evaluation campaign (SiSEC2011): audio source separation. In: Theis F, Cichocki A, Yeredor A, Zibulevsky M (eds) Latent variable analysis and signal separation. LVA/ICA 2012. Lecture notes in computer science, vol 7191. Springer, Berlin, Heidelberg Araki S et al (2012) The 2011 signal separation evaluation campaign (SiSEC2011): audio source separation. In: Theis F, Cichocki A, Yeredor A, Zibulevsky M (eds) Latent variable analysis and signal separation. LVA/ICA 2012. Lecture notes in computer science, vol 7191. Springer, Berlin, Heidelberg
go back to reference Blin A, Araki S, Makino S (2005) Underdetermined blind separation of convolutive mixtures of speech using time–frequency mask and mixing matrix estimation. IEICE Trans Fundam Electron Commun Comput Sci E88A(7):1693–1700CrossRef Blin A, Araki S, Makino S (2005) Underdetermined blind separation of convolutive mixtures of speech using time–frequency mask and mixing matrix estimation. IEICE Trans Fundam Electron Commun Comput Sci E88A(7):1693–1700CrossRef
go back to reference Fevotte C, Gribonval R, Vincent E (2005) BSS_EVAL toolbox user guide—revision 2.0 [Technical Report]: 19 inria-00564760 Fevotte C, Gribonval R, Vincent E (2005) BSS_EVAL toolbox user guide—revision 2.0 [Technical Report]: 19 inria-00564760
Metadata
Title
Underdetermined blind source separation using CapsNet
Authors
M. Kumar
V. E. Jayanthi
Publication date
15-10-2019
Publisher
Springer Berlin Heidelberg
Published in
Soft Computing / Issue 12/2020
Print ISSN: 1432-7643
Electronic ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-019-04430-4

Other articles of this Issue 12/2020

Soft Computing 12/2020 Go to the issue

Premium Partner