Skip to main content

Speech/speaker recognition using a HMM/GMM hybrid model

  • Text-dependent Speaker Authentication
  • Conference paper
  • First Online:
Audio- and Video-based Biometric Person Authentication (AVBPA 1997)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1206))

Abstract

In this paper, a speaker recognition voice based system is presented [5]. We have implemented it in a Sun platform.We train (and test) the system using a Database recorded in several sessions in order to repair the huge effects that the speech variability with time has in the recognition rate system. Several experiments have been made in order to achieve the best configuration in the system set up. This is an important point to take into account in a real world system in which users train the system once and the models generated in the training process are not updated for strategic reasons. The recognition rate obtained for the proposed system is around 93% if the speech came from a microphone is around 90% when the speech came from a phone line.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Furui & Sondhi “Advances in Speech Signal Processing”. Ed. MARCEL DEKKER, INC. 1989.

    Google Scholar 

  2. Reynolds. (95) “Robust Test-Independent Speaker Identification Using Gaussian Mixture Speaker Models”, Speech Communication 17 (1995) 91–108

    Google Scholar 

  3. Ruiz-Mezcua, Lorenzo-Speranzini, García-Gomez. “Sistema de verificación automática de locutores”, Internal Documente ALCATEL-SESA.

    Google Scholar 

  4. Ruiz-Mezcua, Gerbolés-Espina, Escrihuela-Langa, Gomez-Mena, Veiga. (92) “Reconocimiento de grandes vocabularios independientes del locutor”. URSI92 Conference.

    Google Scholar 

  5. Ruiz-Mezcua, Hernadez, Domingo, Rodriguez. “Acceso a servicios multimedia a traves de la voz”. URSI96 Conference.

    Google Scholar 

  6. Veth& Bourlard “Comparison of Hidden Markov Model techniques for automatic speaker verification in real-world conditions”, Speech Communication 17 (1995) 81–90

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Josef Bigün Gérard Chollet Gunilla Borgefors

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Rodríguez, E., Ruíz, B., García-Crespo, Á., García, F. (1997). Speech/speaker recognition using a HMM/GMM hybrid model. In: Bigün, J., Chollet, G., Borgefors, G. (eds) Audio- and Video-based Biometric Person Authentication. AVBPA 1997. Lecture Notes in Computer Science, vol 1206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0016000

Download citation

  • DOI: https://doi.org/10.1007/BFb0016000

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-62660-2

  • Online ISBN: 978-3-540-68425-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics