Skip to main content

2019 | OriginalPaper | Buchkapitel

Speaker Recognition Using Occurrence Pattern of Speech Signal

verfasst von : Saptarshi Sengupta, Ghazaala Yasmin, Arijit Ghosal

Erschienen in: Recent Trends in Signal and Image Processing

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Speaker recognition is a highly studied area in the field of speech processing. Its application domains are many ranging from the forensic sciences to telephone banking and intelligent voice-driven applications such as answering machines. The area of study of this paper is a sub-field of speaker recognition called speaker identification. A new approach for tackling this problem with the use of one of the most powerful features of audio signals i.e. MFCC is proposed in this paper. Our work also makes use of the concept of co-occurrence matrices and derives statistical measures from it which are incorporated into the proposed feature vector. Finally, we apply a classifier which correctly identifies the person based on their speech sample. The work proposed here is perhaps one of the first to make use of such an arrangement, and results show that it is a highly promising strategy.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Reynolds DA (1995) Automatic speaker recognition using Gaussian mixture speaker models. Lincoln Lab J Reynolds DA (1995) Automatic speaker recognition using Gaussian mixture speaker models. Lincoln Lab J
2.
Zurück zum Zitat Dudeja K, Kharbanda A (2015) Applications of digital signal processing to speech recognition. Int J Res 2(5):191–194 Dudeja K, Kharbanda A (2015) Applications of digital signal processing to speech recognition. Int J Res 2(5):191–194
3.
Zurück zum Zitat XU HH (2015) Text dependent speaker recognition study XU HH (2015) Text dependent speaker recognition study
4.
Zurück zum Zitat Revathi A, Ganapathy R, Venkataramani Y (2009) Text independent speaker recognition and speaker independent speech recognition using iterative clustering approach. Int J Comput Sci Inf Technol (IJCSIT) 1(2):30–42 Revathi A, Ganapathy R, Venkataramani Y (2009) Text independent speaker recognition and speaker independent speech recognition using iterative clustering approach. Int J Comput Sci Inf Technol (IJCSIT) 1(2):30–42
5.
Zurück zum Zitat Reynolds DA, Quatieri TF, Dunn RB (2000) Speaker verification using adapted Gaussian mixture models. Digit Signal Process 10(1–3):19–41CrossRef Reynolds DA, Quatieri TF, Dunn RB (2000) Speaker verification using adapted Gaussian mixture models. Digit Signal Process 10(1–3):19–41CrossRef
6.
Zurück zum Zitat Kua JMK et al (2010) Investigation of spectral centroid magnitude and frequency for speaker recognition. Odyssey 34–39 Kua JMK et al (2010) Investigation of spectral centroid magnitude and frequency for speaker recognition. Odyssey 34–39
7.
Zurück zum Zitat Doddington GR (2001) Speaker recognition based on idiolectal differences between speakers. Interspeech 2521–2524 Doddington GR (2001) Speaker recognition based on idiolectal differences between speakers. Interspeech 2521–2524
8.
Zurück zum Zitat Suraina K, Vig R (2015) A mfcc integrated vector quantization model for speaker recognition. Int J Comput Sci Mob Comput 4(5):294–400 Suraina K, Vig R (2015) A mfcc integrated vector quantization model for speaker recognition. Int J Comput Sci Mob Comput 4(5):294–400
9.
Zurück zum Zitat Paul D, Parekh Ranjan (2011) Automated speech recognition of isolated words using neural networks. Int J Eng Sci Technol (IJEST) 3(6):4993–5000 Paul D, Parekh Ranjan (2011) Automated speech recognition of isolated words using neural networks. Int J Eng Sci Technol (IJEST) 3(6):4993–5000
10.
Zurück zum Zitat Otero PL (2015) Improved strategies for speaker segmentation and emotional state detection Otero PL (2015) Improved strategies for speaker segmentation and emotional state detection
11.
Zurück zum Zitat Campbell JP (1997) Speaker recognition: a tutorial. Proc IEEE 85(9):1437–1462CrossRef Campbell JP (1997) Speaker recognition: a tutorial. Proc IEEE 85(9):1437–1462CrossRef
12.
Zurück zum Zitat Atame S, Shanthi Therese S, Madhuri G (2015) A survey on: continuous voice recognition techniques. Int J Emerg Trends Technol Comput Sci (IJETTCS) 4(3):37–41 Atame S, Shanthi Therese S, Madhuri G (2015) A survey on: continuous voice recognition techniques. Int J Emerg Trends Technol Comput Sci (IJETTCS) 4(3):37–41
13.
Zurück zum Zitat Mermelstein P (1976) Distance measures for speech recognition, psychological and instrumental. Pattern Recog Artif Intell 116:374–388 Mermelstein P (1976) Distance measures for speech recognition, psychological and instrumental. Pattern Recog Artif Intell 116:374–388
14.
Zurück zum Zitat Haralick RM (1979) Statistical and structural approaches to texture. Proc IEEE 67(5):786–804CrossRef Haralick RM (1979) Statistical and structural approaches to texture. Proc IEEE 67(5):786–804CrossRef
15.
Zurück zum Zitat Lartillot O, Toiviainen P, Eerola T (2008) A matlab toolbox for music information retrieval. In: Data analysis, machine learning and applications, pp 261–268CrossRef Lartillot O, Toiviainen P, Eerola T (2008) A matlab toolbox for music information retrieval. In: Data analysis, machine learning and applications, pp 261–268CrossRef
16.
Zurück zum Zitat Perrachione TK (2017) Speaker recognition across languages. Oxford University Press Perrachione TK (2017) Speaker recognition across languages. Oxford University Press
Metadaten
Titel
Speaker Recognition Using Occurrence Pattern of Speech Signal
verfasst von
Saptarshi Sengupta
Ghazaala Yasmin
Arijit Ghosal
Copyright-Jahr
2019
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-8863-6_21

Neuer Inhalt