Top

Published in:

2017 | OriginalPaper | Chapter

Speaker Verification Method Based on Two-Layer GMM-UBM Model in the Complex Environment

Authors : Qiang He, Zhijiang Wan, Haiyan Zhou, Jie Yang, Ning Zhong

Published in: Brain Informatics

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

In order to improve speaker verification accuracy in the complex environment, a two-layer Gaussian mixture model-universal background model (GMM-UBM) model based on speaker verification method is proposed. For different layer, a GMM-UBM model was trained by different combination of speech features. The voice data of 3 days (36 h) were recorded from the complex environment, and the collected data was manually segmented into four classes: quiet, noise, target speaker and other speaker. Not only the segment data can be used to train GMM-UBM model, but also it can provide a criterion to assess the effectiveness of the model. The results show that the highest recall for the second and third day were 0.75 and 0.74 respectively, and the corresponding specificity were 0.29 and 0.19, which indicates the proposed GMM-UBM model is viable to verify the target speaker in the complex environment.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

previous chapter A Study on Automatic Sleep Stage Classification Based on Clustering Algorithm

next chapter Emotion Recognition from EEG Using Rhythm Synchronization Patterns with Joint Time-Frequency-Space Correlation

Chen, J., Zhong, N.: Data-brain modeling for systematic brain informatics. In: Brain Informatics, International Conference, BI, Beijing, China, October 22–24 (2009)

Zhong, N., Motomura, S.: Agent-enriched data mining: a case study in brain informatics. IEEE Intell. Syst. 24, 38–45 (2009)CrossRef

Li, N., Mak, M.W.: SNR-invariant PLDA modeling in nonparametric subspace for robust speaker verification. IEEE/ACM Trans. Audio Speech/Language Process. 23, 1648–1659 (2015)CrossRef

Ding, I.J., Yen, C.T., Ou, D.C.: A method to integrate GMM, SVM and DTW for speaker recognition. Int. J. Eng. Technol. Innov. 4, 38–47 (2014)

Wu, H., Wang, Y., Huang, J.: Blind detection of electronic disguised voice. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 3013–3017 (2013)

Wang, X., Yang, T., Yu, Y., Zhang, R., Guo, F.: Footstep-identification system based on walking interval. Intell. Syst. IEEE 30, 46–52 (2015)CrossRef

Turner, C., Joseph, A.: A wavelet packet and mel-frequency cepstral coefficients-based feature extraction method for speaker identification. Procedia Comput. Sci. 61, 416–421 (2015)CrossRef

Haris, B.C., Sinha, R.: Low-Complexity Speaker Verification with Decimated Supervector Representations. Elsevier Science Publishers B. V., Amsterdam (2015)

Wang, Y., Wu, H., Huang, J.: Verification of Hidden Speaker Behind Transformation Disguised Voices. Academic Press, Inc, Orlando (2015)

10.

Kanagasundaram, A., Dean, D., Sridharan, S.: Improving PLDA speaker verification with limited development data. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1665–1669 (2014)

11.

Rajan, P., Afanasyev, A., Hautamki, V., Kinnunen, T.: From single to multiple enrollment i-vectors: practical PLDA scoring variants for speaker verification. Digit. Signal Proc. 31, 93–101 (2014)CrossRef

12.

Xu, L., Yang, Z.: Speaker identification based on state space model. Int. J. Speech Technol. 19, 1–8 (2016)CrossRef

13.

Rakhmanenko, I., Meshcheryakov, R.: Speech Features Evaluation for Small Set Automatic Speaker Verification Using GMM-UBM System (2016)

14.

Sarkar, A.K., Tan, Z.H.: Text Dependent Speaker Verification Using un-supervised HMM-UBM and Temporal GMM-UBM. In: Interspeech (2016)

15.

Dehak, N., Kenny, P.J., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Language Process. 19, 788–798 (2011)CrossRef

16.

Jagtap, S.S., Bhalke, D.G.: Speaker verification using Gaussian mixture model. In: International Conference on Pervasive Computing, pp. 1–5 (2015)

Title: Speaker Verification Method Based on Two-Layer GMM-UBM Model in the Complex Environment
Authors: Qiang He
Zhijiang Wan
Haiyan Zhou
Jie Yang
Ning Zhong
Publisher: Springer International Publishing
Book: Brain Informatics
Print ISBN: 978-3-319-70771-6

Electronic ISBN: 978-3-319-70772-3

Copyright Year: 2017
DOI: https://doi.org/10.1007/978-3-319-70772-3_14

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner