Skip to main content
Erschienen in: Mobile Networks and Applications 6/2017

18.05.2017

Speaker Recognition Exploiting D2D Communications Paradigm: Performance Evaluation of Multiple Observations Approaches

verfasst von: Igor Bisio, Fabio Lavagetto, Chiara Garibotto, Andrea Sciarrone

Erschienen in: Mobile Networks and Applications | Ausgabe 6/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The diffusion of Device-to-Device (D2D) communications opens the door to exploit the contributions of multiple Mobile Devices (MDs) to accomplish collaborative tasks. In this paper a speaker recognition algorithm for MDs based on a multiple-observations approach is presented. We propose various fusion and clustering algorithms aimed at efficiently exploiting data coming from MDs. Numerical results show that in many cases our multiple-observation approach is able to significantly improve the accuracy of the considered speaker recognition algorithm.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
2.
Zurück zum Zitat Bao HC, Juan ZC (2012) The research of speaker recognition based on gmm and svm. In: 2012 international conference on system science and engineering (ICSSE), pp 373–375 Bao HC, Juan ZC (2012) The research of speaker recognition based on gmm and svm. In: 2012 international conference on system science and engineering (ICSSE), pp 373–375
3.
Zurück zum Zitat Barghi A, Bayani H (2014) Design and impelmentation of a speaker verification system using i-vector and support vector machines. In: 2014 second RSI/ISM international conference on robotics and mechatronics (ICROm), pp 434–439 Barghi A, Bayani H (2014) Design and impelmentation of a speaker verification system using i-vector and support vector machines. In: 2014 second RSI/ISM international conference on robotics and mechatronics (ICROm), pp 434–439
4.
Zurück zum Zitat Bisio I, Delfino A, Lavagetto F, Marchese M, Sciarrone A (2013) Gender-driven emotion recognition through speech signals for ambient intelligence applications. IEEE Trans Emerg Topics Comput 1(2):244–257CrossRef Bisio I, Delfino A, Lavagetto F, Marchese M, Sciarrone A (2013) Gender-driven emotion recognition through speech signals for ambient intelligence applications. IEEE Trans Emerg Topics Comput 1(2):244–257CrossRef
5.
Zurück zum Zitat Bisio I, Lavagetto F, Marchese M, Sciarrone A, Frá C, Valla M (2015) Spectra: A speech processing platform as smartphone application. In: 2015 IEEE international conference on communications (ICC), pp 7030–7035 Bisio I, Lavagetto F, Marchese M, Sciarrone A, Frá C, Valla M (2015) Spectra: A speech processing platform as smartphone application. In: 2015 IEEE international conference on communications (ICC), pp 7030–7035
7.
Zurück zum Zitat Golan SM, Gannot S, Cohen I (2010) Subspace tracking of multiple sources and its application to speakers extraction. In: International conference on acoustics, speech and signal processing, pp 201–204 Golan SM, Gannot S, Cohen I (2010) Subspace tracking of multiple sources and its application to speakers extraction. In: International conference on acoustics, speech and signal processing, pp 201–204
8.
Zurück zum Zitat Hansen JHL, Hasan T (2015) Speaker recognition by machines and humans: a tutorial review. IEEE Signal Proc Mag 32(6):74–99CrossRef Hansen JHL, Hasan T (2015) Speaker recognition by machines and humans: a tutorial review. IEEE Signal Proc Mag 32(6):74–99CrossRef
9.
Zurück zum Zitat Hermansky H (1990) Perceptual linear predictive (plp) analysis of speech. J Acoust Soc Am 87(4):1738–1752CrossRef Hermansky H (1990) Perceptual linear predictive (plp) analysis of speech. J Acoust Soc Am 87(4):1738–1752CrossRef
10.
Zurück zum Zitat Homayounpour MM, Rezaian I (2008) Robust speaker verification based on multi stage vector quantization of mfcc parameters on narrow bandwidth channels. In: International conference on advanced communication technology, vol 1, pp 336–340 Homayounpour MM, Rezaian I (2008) Robust speaker verification based on multi stage vector quantization of mfcc parameters on narrow bandwidth channels. In: International conference on advanced communication technology, vol 1, pp 336–340
11.
Zurück zum Zitat Hsu C-W, Lin C-J (2002) A comparison of methods for multiclass support vector machines. IEEE Trans Neural Netw 13(2):415–425CrossRef Hsu C-W, Lin C-J (2002) A comparison of methods for multiclass support vector machines. IEEE Trans Neural Netw 13(2):415–425CrossRef
12.
Zurück zum Zitat Li H, Ma B, Lee K-A, Sun H, Zhu D, Sim KC, You C, Tong R, Kärkkäinen I, Huang C-L et al (2009) The i4u system in nist 2008 speaker recognition evaluation. In: International conference on acoustics, speech and signal processing, pp 4201–4204 Li H, Ma B, Lee K-A, Sun H, Zhu D, Sim KC, You C, Tong R, Kärkkäinen I, Huang C-L et al (2009) The i4u system in nist 2008 speaker recognition evaluation. In: International conference on acoustics, speech and signal processing, pp 4201–4204
13.
Zurück zum Zitat Liu Y, Fu T, Fan Y, Qian Y, Yu K (2014) Speaker verification with deep features. In: 2014 International joint conference on neural networks (IJCNN), pp 747–753 Liu Y, Fu T, Fan Y, Qian Y, Yu K (2014) Speaker verification with deep features. In: 2014 International joint conference on neural networks (IJCNN), pp 747–753
14.
Zurück zum Zitat McLaren M, van Leeuwen D (2012) Source-norMalized lda for robust speaker recognition using i-vectors from multiple speech sources. IEEE Trans Audio Speech Lang Process 20(3):755–766CrossRef McLaren M, van Leeuwen D (2012) Source-norMalized lda for robust speaker recognition using i-vectors from multiple speech sources. IEEE Trans Audio Speech Lang Process 20(3):755–766CrossRef
15.
Zurück zum Zitat Moattar MH, Homayounpour MM (2009) A simple but efficient real-time voice activity detection algorithm Signal Processing Conference, 2009 17th European, pp 2549–2553 Moattar MH, Homayounpour MM (2009) A simple but efficient real-time voice activity detection algorithm Signal Processing Conference, 2009 17th European, pp 2549–2553
16.
Zurück zum Zitat Reynolds DA, Rose RC (1995) Robust text-independent speaker identification using gaussian mixture speaker models. IEEE Trans Speech Audio Process 3(1):72–83CrossRef Reynolds DA, Rose RC (1995) Robust text-independent speaker identification using gaussian mixture speaker models. IEEE Trans Speech Audio Process 3(1):72–83CrossRef
17.
Zurück zum Zitat Stolcke A, Friedland G, Imseng D (2010) Leveraging speaker diarization for meeting recognition from distant microphones. In: 2010 IEEE International conference on acoustics speech and signal processing (ICASSP), pp 4390–4393 Stolcke A, Friedland G, Imseng D (2010) Leveraging speaker diarization for meeting recognition from distant microphones. In: 2010 IEEE International conference on acoustics speech and signal processing (ICASSP), pp 4390–4393
18.
Zurück zum Zitat Tripathy A, Kumar L, Hegde RM (2012) Robust two dimensional source localization using the music-group delay spectrum International Conference on Signal Processing and Communications (SPCOM), pp 1–5 Tripathy A, Kumar L, Hegde RM (2012) Robust two dimensional source localization using the music-group delay spectrum International Conference on Signal Processing and Communications (SPCOM), pp 1–5
Metadaten
Titel
Speaker Recognition Exploiting D2D Communications Paradigm: Performance Evaluation of Multiple Observations Approaches
verfasst von
Igor Bisio
Fabio Lavagetto
Chiara Garibotto
Andrea Sciarrone
Publikationsdatum
18.05.2017
Verlag
Springer US
Erschienen in
Mobile Networks and Applications / Ausgabe 6/2017
Print ISSN: 1383-469X
Elektronische ISSN: 1572-8153
DOI
https://doi.org/10.1007/s11036-017-0876-z

Weitere Artikel der Ausgabe 6/2017

Mobile Networks and Applications 6/2017 Zur Ausgabe

Neuer Inhalt