Skip to main content

2016 | OriginalPaper | Buchkapitel

Probabilistic Prediction for Text-Prompted Speaker Verification Capable of Accepting Spoken Words with the Same Meaning but Different Pronunciations

verfasst von : Shota Sakashita, Satoshi Takeguchi, Kazuya Matsuo, Shuichi Kurogi

Erschienen in: Neural Information Processing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

So far, we have presented a method of probabilistic prediction using GEBI (Gibbs-distribution based Bayesian inference) for flexible text-prompted speaker verification. For more flexible and practical verification, this paper presents a method of verification capable of accepting spoken words with the same meaning but different pronunciations. For example, Japanese language has different pronunciations for a digit, such as /yon/ and /shi/ for 4, /nana/ and /shichi/ for 7, which are usually uttered via unintentional selection, and then it is a practical problem in speech verification of words involving digits, such as ID numbers. With several assumptions, we present a modification of GEBI for dealing with such words. By means of numerical experiments using recorded real speech data, we examine the properties of the present method and show the validity and the effectiveness.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Kurogi, S., Sakashita, S., Takeguchi, S., Ueki, T., Matsuo, K.: Probabilistic prediction in multiclass classification derived for flexible text-prompted speaker verification. In: Arik, S., Huang, T., Lai, W.K., Liu, Q. (eds.) ICONIP 2015. LNCS, vol. 9489, pp. 216–225. Springer, Heidelberg (2015). doi:10.1007/978-3-319-26532-2_24 CrossRef Kurogi, S., Sakashita, S., Takeguchi, S., Ueki, T., Matsuo, K.: Probabilistic prediction in multiclass classification derived for flexible text-prompted speaker verification. In: Arik, S., Huang, T., Lai, W.K., Liu, Q. (eds.) ICONIP 2015. LNCS, vol. 9489, pp. 216–225. Springer, Heidelberg (2015). doi:10.​1007/​978-3-319-26532-2_​24 CrossRef
2.
Zurück zum Zitat Beigi, H.: Fundamentals of Speaker Recognition. Springer-Verlag New York Inc., New York (2011)CrossRefMATH Beigi, H.: Fundamentals of Speaker Recognition. Springer-Verlag New York Inc., New York (2011)CrossRefMATH
3.
Zurück zum Zitat Slingo, J., Palmer, T.: Uncertainty in weather and climate prediction. Phil. Trans. R. Soc. A 369, 4751–4767 (2011)CrossRefMATH Slingo, J., Palmer, T.: Uncertainty in weather and climate prediction. Phil. Trans. R. Soc. A 369, 4751–4767 (2011)CrossRefMATH
4.
Zurück zum Zitat Kurogi, S., Ueno, T., Sawa, M.: A batch learning method for competitive associative net and its application to function approximation. In: Proceedings of the SCI 2004, vol. V, pp. 24–28 (2004) Kurogi, S., Ueno, T., Sawa, M.: A batch learning method for competitive associative net and its application to function approximation. In: Proceedings of the SCI 2004, vol. V, pp. 24–28 (2004)
5.
Zurück zum Zitat Kurogi, S., Mineishi, S., Sato, S.: An analysis of speaker recognition using bagging CAN2 and pole distribution of speech signals. In: Wong, K.W., Mendis, B.S.U., Bouzerdoum, A. (eds.) ICONIP 2010. LNCS, vol. 6443, pp. 363–370. Springer, Heidelberg (2010). doi:10.1007/978-3-642-17537-4_45 CrossRef Kurogi, S., Mineishi, S., Sato, S.: An analysis of speaker recognition using bagging CAN2 and pole distribution of speech signals. In: Wong, K.W., Mendis, B.S.U., Bouzerdoum, A. (eds.) ICONIP 2010. LNCS, vol. 6443, pp. 363–370. Springer, Heidelberg (2010). doi:10.​1007/​978-3-642-17537-4_​45 CrossRef
6.
Zurück zum Zitat Campbell, J.P.: Speaker recognition: a tutorial. Proc. IEEE 85(9), 1437–1462 (1997)CrossRef Campbell, J.P.: Speaker recognition: a tutorial. Proc. IEEE 85(9), 1437–1462 (1997)CrossRef
7.
Zurück zum Zitat Kurogi, S., Ueki, T., Takeguchi, S., Mizobe, Y.: Properties of text-prompted multistep speaker verification using gibbs-distribution-based extended Bayesian inference for rejecting unregistered speakers. In: Loo, C.K., Yap, K.S., Wong, K.W., Teoh, A., Huang, K. (eds.) ICONIP 2014, Part II. LNCS, vol. 8835, pp. 35–43. Springer, Heidelberg (2014) Kurogi, S., Ueki, T., Takeguchi, S., Mizobe, Y.: Properties of text-prompted multistep speaker verification using gibbs-distribution-based extended Bayesian inference for rejecting unregistered speakers. In: Loo, C.K., Yap, K.S., Wong, K.W., Teoh, A., Huang, K. (eds.) ICONIP 2014, Part II. LNCS, vol. 8835, pp. 35–43. Springer, Heidelberg (2014)
Metadaten
Titel
Probabilistic Prediction for Text-Prompted Speaker Verification Capable of Accepting Spoken Words with the Same Meaning but Different Pronunciations
verfasst von
Shota Sakashita
Satoshi Takeguchi
Kazuya Matsuo
Shuichi Kurogi
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-46681-1_38