Skip to main content

2016 | OriginalPaper | Buchkapitel

Influence of Noise and Voice Activity Detection on Speaker Verification

verfasst von : Adam Dustor

Erschienen in: Computer Networks

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The scope of this paper is to check influence of voice activity detection VAD procedure and its accuracy on speaker verification error rates. It is shown that for speech of high quality, it is absolutely necessary to remove silence from the signal as the errors increase radically. It is better to remove more than less from the signal as the equal error rate EER is the worst for the original speech with silence. Additionally influence of white noise, which was added to speech utterances, was examined. Presented results show that in order to achieve highly reliable speaker verification system it must be insensitive to low quality of speech, since noise is the most important factor responsible for high error rates.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Dustor, A.: Voice verification based on nonlinear Ho-Kashyap classifier. In: International Conference on Computational Technologies in Electrical and Electronics Engineering SIBIRCON 2008, pp. 296–300. Novosibirsk (2008) Dustor, A.: Voice verification based on nonlinear Ho-Kashyap classifier. In: International Conference on Computational Technologies in Electrical and Electronics Engineering SIBIRCON 2008, pp. 296–300. Novosibirsk (2008)
2.
Zurück zum Zitat Dustor, A., Szwarc, P.: Application of GMM models to spoken language recognition. In: Napieralski, A. (ed.) MIXDES 2009: Proceedings of the 16th International Conference Mixed Design of Integrated Circuits and Systems Lodz, Poland, pp. 603–606 (2009) Dustor, A., Szwarc, P.: Application of GMM models to spoken language recognition. In: Napieralski, A. (ed.) MIXDES 2009: Proceedings of the 16th International Conference Mixed Design of Integrated Circuits and Systems Lodz, Poland, pp. 603–606 (2009)
3.
Zurück zum Zitat Dustor, A.: Speaker verification based on fuzzy classifier. In: Cyran, K.A., Kozielski, S., Peters, J.F., Stańczyk, U., Wakulicz-Deja, A. (eds.) Man-Machine Interactions. AISC, vol. 59, pp. 389–397. Springer, Heidelberg (2009)CrossRef Dustor, A.: Speaker verification based on fuzzy classifier. In: Cyran, K.A., Kozielski, S., Peters, J.F., Stańczyk, U., Wakulicz-Deja, A. (eds.) Man-Machine Interactions. AISC, vol. 59, pp. 389–397. Springer, Heidelberg (2009)CrossRef
4.
Zurück zum Zitat Dustor, A., Szwarc, P.: Spoken language identification based on GMM models. In: Pulka, A., Golonek, T. (eds.) Inetrnational Conference on Signals and Electronic Systems (ICSES 2010): Conference Proceedings, Poland, Gliwice, pp. 105–108 (2010) Dustor, A., Szwarc, P.: Spoken language identification based on GMM models. In: Pulka, A., Golonek, T. (eds.) Inetrnational Conference on Signals and Electronic Systems (ICSES 2010): Conference Proceedings, Poland, Gliwice, pp. 105–108 (2010)
5.
Zurück zum Zitat Dustor, A., Kłosowski, P.: Biometric voice identification based on fuzzy kernel classifier. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2013. CCIS, vol. 370, pp. 456–465. Springer, Heidelberg (2013)CrossRef Dustor, A., Kłosowski, P.: Biometric voice identification based on fuzzy kernel classifier. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2013. CCIS, vol. 370, pp. 456–465. Springer, Heidelberg (2013)CrossRef
6.
Zurück zum Zitat Kłosowski, P., Dustor, A.: Automatic speech segmentation for automatic speech translation. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2013. CCIS, vol. 370, pp. 466–475. Springer, Heidelberg (2013)CrossRef Kłosowski, P., Dustor, A.: Automatic speech segmentation for automatic speech translation. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2013. CCIS, vol. 370, pp. 466–475. Springer, Heidelberg (2013)CrossRef
7.
Zurück zum Zitat Dustor, A., Kłosowski, P., Izydorczyk, J.: Influence of feature dimensionality and model complexity on speaker verification performance. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2014. CCIS, vol. 431, pp. 177–186. Springer, Heidelberg (2014)CrossRef Dustor, A., Kłosowski, P., Izydorczyk, J.: Influence of feature dimensionality and model complexity on speaker verification performance. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2014. CCIS, vol. 431, pp. 177–186. Springer, Heidelberg (2014)CrossRef
8.
Zurück zum Zitat Dustor, A., Klosowski, P., Izydorczyk, J.: Speaker recognition system with good generalization properties. In: 2014 International Conference on Multimedia Computing and Systems (ICMCS), Marrakech, Morocco, pp. 206–210 (2014) Dustor, A., Klosowski, P., Izydorczyk, J.: Speaker recognition system with good generalization properties. In: 2014 International Conference on Multimedia Computing and Systems (ICMCS), Marrakech, Morocco, pp. 206–210 (2014)
9.
Zurück zum Zitat Kłosowski, P., Dustor, A., Izydorczyk, J., Kotas, J., Ślimok, J.: Speech recognition based on open source speech processing software. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2014. CCIS, vol. 431, pp. 308–317. Springer, Heidelberg (2014)CrossRef Kłosowski, P., Dustor, A., Izydorczyk, J., Kotas, J., Ślimok, J.: Speech recognition based on open source speech processing software. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2014. CCIS, vol. 431, pp. 308–317. Springer, Heidelberg (2014)CrossRef
10.
Zurück zum Zitat Dustor, A., Kłosowski, P., Izydorczyk, J., Kopański, R.: Influence of corpus size on speaker verification. In: Gaj, P., Kwiecień, A., Stera, P. (eds.) CN 2015. CCIS, vol. 522, pp. 242–249. Springer, Heidelberg (2015)CrossRef Dustor, A., Kłosowski, P., Izydorczyk, J., Kopański, R.: Influence of corpus size on speaker verification. In: Gaj, P., Kwiecień, A., Stera, P. (eds.) CN 2015. CCIS, vol. 522, pp. 242–249. Springer, Heidelberg (2015)CrossRef
11.
Zurück zum Zitat Kłosowski, P., Dustor, A., Izydorczyk, J.: Speaker verification performance evaluation based on open source speech processing software and TIMIT speech corpus. In: Gaj, P., Kwiecień, A., Stera, P. (eds.) CN 2015. CCIS, vol. 522, pp. 400–409. Springer, Heidelberg (2015)CrossRef Kłosowski, P., Dustor, A., Izydorczyk, J.: Speaker verification performance evaluation based on open source speech processing software and TIMIT speech corpus. In: Gaj, P., Kwiecień, A., Stera, P. (eds.) CN 2015. CCIS, vol. 522, pp. 400–409. Springer, Heidelberg (2015)CrossRef
12.
Zurück zum Zitat Fazel, A., Chakrabartty, S.: An overview of statistical pattern recognition techniques for speaker verification. IEEE Circuits Syst. Mag. 11(2), 62–81 (2011)CrossRef Fazel, A., Chakrabartty, S.: An overview of statistical pattern recognition techniques for speaker verification. IEEE Circuits Syst. Mag. 11(2), 62–81 (2011)CrossRef
13.
Zurück zum Zitat Adamczyk, B., Adamczyk, K., Trawiński, K.: Zasób mowy ROBOT. Biuletyn Instytutu Automatyki i Robotyki WAT 12, 179–192 (2000) Adamczyk, B., Adamczyk, K., Trawiński, K.: Zasób mowy ROBOT. Biuletyn Instytutu Automatyki i Robotyki WAT 12, 179–192 (2000)
Metadaten
Titel
Influence of Noise and Voice Activity Detection on Speaker Verification
verfasst von
Adam Dustor
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-39207-3_18

Premium Partner