Skip to main content
main-content
Top

Hint

Swipe to navigate through the articles of this issue

18-04-2018 | Issue 2/2018

International Journal of Speech Technology 2/2018

Singing voice separation using mono-channel mask

Journal:
International Journal of Speech Technology > Issue 2/2018
Authors:
Pallavi P. Ingale, Sanjay L. Nalbalwar

Abstract

Separating singing voice from monaural song recording is a highly difficult task. Still it is important because it has many applications such as singer identification, lyrics recognition, and melody extraction. Difficulty arises due to many musical instruments involved and time-varying spectral overlap between singing voice and music. The goal of singing voice separation is to extract singing voice from the given monaural song recording with minimum artefacts and musical interference. We propose a three stage system for singing voice separation which helps to improve intelligibility and perceptual quality of the separated output. In the first stage, modified sub-harmonic summation algorithm finds pitch of the singing voice and its harmonic components. Here, we create a binary mask. In the second stage, frames i.e. the masked spectral amplitudes are classified as singing and non-singing frames by using a combination of Gammatone frequency cepstral coefficients (GFCC) and Mel-frequency cepstral coefficients (MFCC) features. Lastly, mono-channel mask is created and signal amplitude correction is done using kurtosis measure. We synthesize the estimate of singing voice using both binary mask and mono-channel mask. It is observed that the singing voice separated using mono-channel mask improves the GNSDR score. Performance of the proposed system is compared with the other methods, where it presents excellent improvement in terms of GNSDR. It produces higher GNSDR scores in case of two different datasets.

Please log in to get access to this content

To get access to this content you need the following product:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 69.000 Bücher
  • über 500 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Umwelt
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Testen Sie jetzt 30 Tage kostenlos.

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 50.000 Bücher
  • über 380 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Umwelt
  • Maschinenbau + Werkstoffe




Testen Sie jetzt 30 Tage kostenlos.

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 58.000 Bücher
  • über 300 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Testen Sie jetzt 30 Tage kostenlos.

Literature
About this article

Other articles of this Issue 2/2018

International Journal of Speech Technology 2/2018 Go to the issue