Skip to main content
Top

2017 | OriginalPaper | Chapter

Speaker Verification Method Based on Two-Layer GMM-UBM Model in the Complex Environment

Authors : Qiang He, Zhijiang Wan, Haiyan Zhou, Jie Yang, Ning Zhong

Published in: Brain Informatics

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In order to improve speaker verification accuracy in the complex environment, a two-layer Gaussian mixture model-universal background model (GMM-UBM) model based on speaker verification method is proposed. For different layer, a GMM-UBM model was trained by different combination of speech features. The voice data of 3 days (36 h) were recorded from the complex environment, and the collected data was manually segmented into four classes: quiet, noise, target speaker and other speaker. Not only the segment data can be used to train GMM-UBM model, but also it can provide a criterion to assess the effectiveness of the model. The results show that the highest recall for the second and third day were 0.75 and 0.74 respectively, and the corresponding specificity were 0.29 and 0.19, which indicates the proposed GMM-UBM model is viable to verify the target speaker in the complex environment.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Chen, J., Zhong, N.: Data-brain modeling for systematic brain informatics. In: Brain Informatics, International Conference, BI, Beijing, China, October 22–24 (2009) Chen, J., Zhong, N.: Data-brain modeling for systematic brain informatics. In: Brain Informatics, International Conference, BI, Beijing, China, October 22–24 (2009)
2.
go back to reference Zhong, N., Motomura, S.: Agent-enriched data mining: a case study in brain informatics. IEEE Intell. Syst. 24, 38–45 (2009)CrossRef Zhong, N., Motomura, S.: Agent-enriched data mining: a case study in brain informatics. IEEE Intell. Syst. 24, 38–45 (2009)CrossRef
3.
go back to reference Li, N., Mak, M.W.: SNR-invariant PLDA modeling in nonparametric subspace for robust speaker verification. IEEE/ACM Trans. Audio Speech/Language Process. 23, 1648–1659 (2015)CrossRef Li, N., Mak, M.W.: SNR-invariant PLDA modeling in nonparametric subspace for robust speaker verification. IEEE/ACM Trans. Audio Speech/Language Process. 23, 1648–1659 (2015)CrossRef
4.
go back to reference Ding, I.J., Yen, C.T., Ou, D.C.: A method to integrate GMM, SVM and DTW for speaker recognition. Int. J. Eng. Technol. Innov. 4, 38–47 (2014) Ding, I.J., Yen, C.T., Ou, D.C.: A method to integrate GMM, SVM and DTW for speaker recognition. Int. J. Eng. Technol. Innov. 4, 38–47 (2014)
5.
go back to reference Wu, H., Wang, Y., Huang, J.: Blind detection of electronic disguised voice. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 3013–3017 (2013) Wu, H., Wang, Y., Huang, J.: Blind detection of electronic disguised voice. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 3013–3017 (2013)
6.
go back to reference Wang, X., Yang, T., Yu, Y., Zhang, R., Guo, F.: Footstep-identification system based on walking interval. Intell. Syst. IEEE 30, 46–52 (2015)CrossRef Wang, X., Yang, T., Yu, Y., Zhang, R., Guo, F.: Footstep-identification system based on walking interval. Intell. Syst. IEEE 30, 46–52 (2015)CrossRef
7.
go back to reference Turner, C., Joseph, A.: A wavelet packet and mel-frequency cepstral coefficients-based feature extraction method for speaker identification. Procedia Comput. Sci. 61, 416–421 (2015)CrossRef Turner, C., Joseph, A.: A wavelet packet and mel-frequency cepstral coefficients-based feature extraction method for speaker identification. Procedia Comput. Sci. 61, 416–421 (2015)CrossRef
8.
go back to reference Haris, B.C., Sinha, R.: Low-Complexity Speaker Verification with Decimated Supervector Representations. Elsevier Science Publishers B. V., Amsterdam (2015) Haris, B.C., Sinha, R.: Low-Complexity Speaker Verification with Decimated Supervector Representations. Elsevier Science Publishers B. V., Amsterdam (2015)
9.
go back to reference Wang, Y., Wu, H., Huang, J.: Verification of Hidden Speaker Behind Transformation Disguised Voices. Academic Press, Inc, Orlando (2015) Wang, Y., Wu, H., Huang, J.: Verification of Hidden Speaker Behind Transformation Disguised Voices. Academic Press, Inc, Orlando (2015)
10.
go back to reference Kanagasundaram, A., Dean, D., Sridharan, S.: Improving PLDA speaker verification with limited development data. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1665–1669 (2014) Kanagasundaram, A., Dean, D., Sridharan, S.: Improving PLDA speaker verification with limited development data. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1665–1669 (2014)
11.
go back to reference Rajan, P., Afanasyev, A., Hautamki, V., Kinnunen, T.: From single to multiple enrollment i-vectors: practical PLDA scoring variants for speaker verification. Digit. Signal Proc. 31, 93–101 (2014)CrossRef Rajan, P., Afanasyev, A., Hautamki, V., Kinnunen, T.: From single to multiple enrollment i-vectors: practical PLDA scoring variants for speaker verification. Digit. Signal Proc. 31, 93–101 (2014)CrossRef
12.
go back to reference Xu, L., Yang, Z.: Speaker identification based on state space model. Int. J. Speech Technol. 19, 1–8 (2016)CrossRef Xu, L., Yang, Z.: Speaker identification based on state space model. Int. J. Speech Technol. 19, 1–8 (2016)CrossRef
13.
go back to reference Rakhmanenko, I., Meshcheryakov, R.: Speech Features Evaluation for Small Set Automatic Speaker Verification Using GMM-UBM System (2016) Rakhmanenko, I., Meshcheryakov, R.: Speech Features Evaluation for Small Set Automatic Speaker Verification Using GMM-UBM System (2016)
14.
go back to reference Sarkar, A.K., Tan, Z.H.: Text Dependent Speaker Verification Using un-supervised HMM-UBM and Temporal GMM-UBM. In: Interspeech (2016) Sarkar, A.K., Tan, Z.H.: Text Dependent Speaker Verification Using un-supervised HMM-UBM and Temporal GMM-UBM. In: Interspeech (2016)
15.
go back to reference Dehak, N., Kenny, P.J., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Language Process. 19, 788–798 (2011)CrossRef Dehak, N., Kenny, P.J., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Language Process. 19, 788–798 (2011)CrossRef
16.
go back to reference Jagtap, S.S., Bhalke, D.G.: Speaker verification using Gaussian mixture model. In: International Conference on Pervasive Computing, pp. 1–5 (2015) Jagtap, S.S., Bhalke, D.G.: Speaker verification using Gaussian mixture model. In: International Conference on Pervasive Computing, pp. 1–5 (2015)
Metadata
Title
Speaker Verification Method Based on Two-Layer GMM-UBM Model in the Complex Environment
Authors
Qiang He
Zhijiang Wan
Haiyan Zhou
Jie Yang
Ning Zhong
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-70772-3_14

Premium Partner