Speech/speaker recognition using a HMM/GMM hybrid model

Rodríguez, Elena; Ruíz, Belén; García-Crespo, Ángel; García, Fernando

doi:10.1007/BFb0016000

Elena Rodríguez¹,
Belén Ruíz¹,
Ángel García-Crespo¹ &
…
Fernando García¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1206))

Included in the following conference series:

International Conference on Audio- and Video-Based Biometric Person Authentication

2812 Accesses
6 Citations

Abstract

In this paper, a speaker recognition voice based system is presented [5]. We have implemented it in a Sun platform.We train (and test) the system using a Database recorded in several sessions in order to repair the huge effects that the speech variability with time has in the recognition rate system. Several experiments have been made in order to achieve the best configuration in the system set up. This is an important point to take into account in a real world system in which users train the system once and the models generated in the training process are not updated for strategic reasons. The recognition rate obtained for the proposed system is around 93% if the speech came from a microphone is around 90% when the speech came from a phone line.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Furui & Sondhi “Advances in Speech Signal Processing”. Ed. MARCEL DEKKER, INC. 1989.
Google Scholar
Reynolds. (95) “Robust Test-Independent Speaker Identification Using Gaussian Mixture Speaker Models”, Speech Communication 17 (1995) 91–108
Google Scholar
Ruiz-Mezcua, Lorenzo-Speranzini, García-Gomez. “Sistema de verificación automática de locutores”, Internal Documente ALCATEL-SESA.
Google Scholar
Ruiz-Mezcua, Gerbolés-Espina, Escrihuela-Langa, Gomez-Mena, Veiga. (92) “Reconocimiento de grandes vocabularios independientes del locutor”. URSI92 Conference.
Google Scholar
Ruiz-Mezcua, Hernadez, Domingo, Rodriguez. “Acceso a servicios multimedia a traves de la voz”. URSI96 Conference.
Google Scholar
Veth& Bourlard “Comparison of Hidden Markov Model techniques for automatic speaker verification in real-world conditions”, Speech Communication 17 (1995) 81–90
Google Scholar

Download references

Author information

Authors and Affiliations

Universidad Carlos III de Madrid, c/ Butarque, 15, 28911, Leganés(Madrid), Spain
Elena Rodríguez, Belén Ruíz, Ángel García-Crespo & Fernando García

Authors

Elena Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Belén Ruíz
View author publications
You can also search for this author in PubMed Google Scholar
Ángel García-Crespo
View author publications
You can also search for this author in PubMed Google Scholar
Fernando García
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Josef Bigün Gérard Chollet Gunilla Borgefors

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rodríguez, E., Ruíz, B., García-Crespo, Á., García, F. (1997). Speech/speaker recognition using a HMM/GMM hybrid model. In: Bigün, J., Chollet, G., Borgefors, G. (eds) Audio- and Video-based Biometric Person Authentication. AVBPA 1997. Lecture Notes in Computer Science, vol 1206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0016000

Download citation

DOI: https://doi.org/10.1007/BFb0016000
Published: 10 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-62660-2
Online ISBN: 978-3-540-68425-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics