Skip to main content

2015 | OriginalPaper | Buchkapitel

An Artificial Neural Networks Model by Using Wavelet Analysis for Speaker Recognition

verfasst von : Kanaka Durga Returi, Y. Radhika

Erschienen in: Information Systems Design and Intelligent Applications

Verlag: Springer India

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

An Artificial Neural Networks Model by using Wavelet Analysis for Speaker Recognition has been presented in this paper. The wavelet analysis was used to extract the features. These extracted features were trained using Artificial Neural Networks with popular Back Propagation Learning Algorithm. In this analysis of testing, the speakers speak out the same set of words, with these set words the features were extracted and fed into the training of the neural network. The neural network notifies the identity of the speaker. In order to test the system, the voice data of the speakers were recorded. The experiments were carried out by using 800 data sets of total 40 individual speakers. For each of these speakers, 20 speech signals were used for training. All these signals were used for training, validation and testing. This approach reveals that the overall performance of system is 95 %.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Daqrouq, K.: Wavelet entropy and neural network for text-independent speaker identification. Eng. Appl. Artif. Intell. 24(5), 796–802 (2011)CrossRef Daqrouq, K.: Wavelet entropy and neural network for text-independent speaker identification. Eng. Appl. Artif. Intell. 24(5), 796–802 (2011)CrossRef
2.
Zurück zum Zitat Adami, A.G., Barone, D.A.C.: A speaker identification system using a model of artificial neural networks for an elevator application. Inf. Sci. 138(1–4), 1–5 (2001) Adami, A.G., Barone, D.A.C.: A speaker identification system using a model of artificial neural networks for an elevator application. Inf. Sci. 138(1–4), 1–5 (2001)
3.
Zurück zum Zitat Kinnunen,T., Sidoroff, I., Tuononen, M., Fränti, P.: Comparison of clustering methods: a case study of text-independent speaker modeling. Patt. Recogn. Lett. 32(13), 1604–1617 (2011) Kinnunen,T., Sidoroff, I., Tuononen, M., Fränti, P.: Comparison of clustering methods: a case study of text-independent speaker modeling. Patt. Recogn. Lett. 32(13), 1604–1617 (2011)
4.
Zurück zum Zitat Ganchev, T.D., Fakotakis, N.D.: Generalized locally recurrent probabilistic neural networks with application to text independent speaker verification. Neurocomputing 70(7–9), 1424–1438 (2007) Ganchev, T.D., Fakotakis, N.D.: Generalized locally recurrent probabilistic neural networks with application to text independent speaker verification. Neurocomputing 70(7–9), 1424–1438 (2007)
5.
Zurück zum Zitat Tomi, K., Lib, H.: An overview of text-independent speaker recognition: from features to supervectors. Speech Commun. 52(1), 12–40 (2010)CrossRef Tomi, K., Lib, H.: An overview of text-independent speaker recognition: from features to supervectors. Speech Commun. 52(1), 12–40 (2010)CrossRef
6.
Zurück zum Zitat Solera-Ureña, R., Martín-Iglesias, D., Gallardo-Antolín, A., Peláez-Moreno, C., Díaz-de-María, F.: Robust ASR using support vector machines. Speech Commun. 49(4), 253–267 (2007)CrossRef Solera-Ureña, R., Martín-Iglesias, D., Gallardo-Antolín, A., Peláez-Moreno, C., Díaz-de-María, F.: Robust ASR using support vector machines. Speech Commun. 49(4), 253–267 (2007)CrossRef
7.
Zurück zum Zitat Huenupán, F., Yoma, N.B., Molina, C., Garretón. C.: Confidence based multiple classifier fusion in speaker verification. Patt. Recogn. Lett. 29(7), 957–966 (2008) Huenupán, F., Yoma, N.B., Molina, C., Garretón. C.: Confidence based multiple classifier fusion in speaker verification. Patt. Recogn. Lett. 29(7), 957–966 (2008)
8.
Zurück zum Zitat Lina, S.-Y., Guhb, R.-S., Shiuec, Y.-R.: Effective recognition of control chart patterns in auto correlated data using a support vector machine based approach. Comput. Ind. Eng. 61(4), 1123–1134 (2011)CrossRef Lina, S.-Y., Guhb, R.-S., Shiuec, Y.-R.: Effective recognition of control chart patterns in auto correlated data using a support vector machine based approach. Comput. Ind. Eng. 61(4), 1123–1134 (2011)CrossRef
9.
Zurück zum Zitat Moattar, M.H., Homayounpour, M.M.: Variational conditional random fields for online speaker detection and tracking. Speech Commun. 54(6), 763–780 (2012)CrossRef Moattar, M.H., Homayounpour, M.M.: Variational conditional random fields for online speaker detection and tracking. Speech Commun. 54(6), 763–780 (2012)CrossRef
10.
Zurück zum Zitat Nemati, S., Basiri, M.E.: Text-independent speaker verification using ant colony optimization-based selected features. Expert Syst. Appl. 38(1), 620–630 (2011) Nemati, S., Basiri, M.E.: Text-independent speaker verification using ant colony optimization-based selected features. Expert Syst. Appl. 38(1), 620–630 (2011)
11.
Zurück zum Zitat Chen, L., Mao, X., Xue, Y., Lee, L.C.: Speech emotion recognition: features and classification models. Digital Signal Process. 22(6), 1154–1160 (2012) Chen, L., Mao, X., Xue, Y., Lee, L.C.: Speech emotion recognition: features and classification models. Digital Signal Process. 22(6), 1154–1160 (2012)
12.
Zurück zum Zitat Gajšek, R., Mihelič, F., Dobrišek, S.: Speaker state recognition using an HMM-based feature extraction method. Comput. Speech Lang. 27(1), 135–150 (2013) Gajšek, R., Mihelič, F., Dobrišek, S.: Speaker state recognition using an HMM-based feature extraction method. Comput. Speech Lang. 27(1), 135–150 (2013)
13.
Zurück zum Zitat Bennani, Y., Gallinari, P.: Neural networks for discrimination and modelization of speakers. Speech Commun. 17(1–2), 159–175 (1995) Bennani, Y., Gallinari, P.: Neural networks for discrimination and modelization of speakers. Speech Commun. 17(1–2), 159–175 (1995)
14.
Zurück zum Zitat Mak, M.W., Allen, W.G., Sexton, G.G.: Speaker identification using multilayer perceptrons and radial basis function networks. Neurocomputing 6(1), 99–117 (1994) Mak, M.W., Allen, W.G., Sexton, G.G.: Speaker identification using multilayer perceptrons and radial basis function networks. Neurocomputing 6(1), 99–117 (1994)
15.
Zurück zum Zitat Thévenaz, P., Hügli, H.: Usefulness of the LPC-residue in text-independent speaker verification. Speech Commun. 17(1–2), 145–157 (1995) Thévenaz, P., Hügli, H.: Usefulness of the LPC-residue in text-independent speaker verification. Speech Commun. 17(1–2), 145–157 (1995)
16.
Zurück zum Zitat Gong, Y.: Speech recognition in noisy environments: a survey. Speech Commun. 16(3), 261–291 (1995)CrossRef Gong, Y.: Speech recognition in noisy environments: a survey. Speech Commun. 16(3), 261–291 (1995)CrossRef
Metadaten
Titel
An Artificial Neural Networks Model by Using Wavelet Analysis for Speaker Recognition
verfasst von
Kanaka Durga Returi
Y. Radhika
Copyright-Jahr
2015
Verlag
Springer India
DOI
https://doi.org/10.1007/978-81-322-2247-7_87

Premium Partner