Skip to main content

2013 | OriginalPaper | Buchkapitel

29. Time-Delay Neural Network with 3 Frequency Bands Based on Voiced Speech Discrimination in Noise

verfasst von : Jae Seung Choi

Erschienen in: Future Information Communication Technology and Applications

Verlag: Springer Netherlands

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Information on the time variation in a speech signal is significant when training a neural network for the speech signal input. Therefore, this paper proposes a time-delay neural network with 3 frequency bands based on voiced speech discrimination in the condition of background noises. The effectiveness of the proposed network is experimentally confirmed based on measuring the correct discrimination rates for speech degraded by various noises.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Juang CF, Chiou CT, Lai CL (2007) Hierarchical singleton-type recurrent neural fuzzy networks for noisy speech recognition. IEEE Trans Neural Netw 18(3):833–843CrossRef Juang CF, Chiou CT, Lai CL (2007) Hierarchical singleton-type recurrent neural fuzzy networks for noisy speech recognition. IEEE Trans Neural Netw 18(3):833–843CrossRef
2.
Zurück zum Zitat Knecht WG, Schenkel ME, Moschytz GS (1995) Neural network filters for speech enhancement. IEEE Trans. Speech Audio Process 3(6):433–438 Knecht WG, Schenkel ME, Moschytz GS (1995) Neural network filters for speech enhancement. IEEE Trans. Speech Audio Process 3(6):433–438
3.
Zurück zum Zitat Cong L, Asghar S, Cong B (2000) Robust speech recognition using neural networks and hidden Markov models. In: Proceedings of the international on Information technology: coding and computing, pp 350–354 Cong L, Asghar S, Cong B (2000) Robust speech recognition using neural networks and hidden Markov models. In: Proceedings of the international on Information technology: coding and computing, pp 350–354
4.
Zurück zum Zitat Choi JS (2012) Speech processing system using a noise reduction neural network based on FFT spectrums. J Inf Commun Convergence Eng 10(2):162–167CrossRef Choi JS (2012) Speech processing system using a noise reduction neural network based on FFT spectrums. J Inf Commun Convergence Eng 10(2):162–167CrossRef
5.
Zurück zum Zitat Hampshire JB, Waibel AH (1990) A novel objective function for improved phoneme recognition using time delay neural networks. IEEE Trans Neural Netw 1(2):216–228 Hampshire JB, Waibel AH (1990) A novel objective function for improved phoneme recognition using time delay neural networks. IEEE Trans Neural Netw 1(2):216–228
6.
Zurück zum Zitat Choi JS, Park SJ (2007) Speech enhancement system based on auditory system and time-delay neural network. In: 8th international conference on lecture notes in computer science. LNCS, Part II, pp 153–160 Choi JS, Park SJ (2007) Speech enhancement system based on auditory system and time-delay neural network. In: 8th international conference on lecture notes in computer science. LNCS, Part II, pp 153–160
7.
Zurück zum Zitat Peng Y, Xiong H, Guo C, Liu H, Zou J (2010) Research on the algorithm of communication network speech enhancement based on BP neural network. Int Conf Adv Comput Theor Eng 3:V3-559–V3-562 Peng Y, Xiong H, Guo C, Liu H, Zou J (2010) Research on the algorithm of communication network speech enhancement based on BP neural network. Int Conf Adv Comput Theor Eng 3:V3-559–V3-562
8.
Zurück zum Zitat Vieira K, Wilamowski B, Kubichek R (1997) Speaker verification for security systems using artificial neural networks. Int Conf Ind Electron Control Instrum 3:1102–1107 Vieira K, Wilamowski B, Kubichek R (1997) Speaker verification for security systems using artificial neural networks. Int Conf Ind Electron Control Instrum 3:1102–1107
9.
Zurück zum Zitat Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagation errors. Nature 323:533–536CrossRef Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagation errors. Nature 323:533–536CrossRef
10.
Zurück zum Zitat Hirsch H, Pearce D (2000) The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions. In: Proceedings of the ISCA ITRW ASR2000 on automatic speech recognition: challenges for the next millennium, Paris, France Hirsch H, Pearce D (2000) The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions. In: Proceedings of the ISCA ITRW ASR2000 on automatic speech recognition: challenges for the next millennium, Paris, France
11.
Zurück zum Zitat Leonard RG (1984) A database for speaker independent digit recognition. In: IEEE international conference on acoustics, speech, and signal processing, pp 328–331 Leonard RG (1984) A database for speaker independent digit recognition. In: IEEE international conference on acoustics, speech, and signal processing, pp 328–331
12.
Zurück zum Zitat ITU-T (International Telecommunication Union) Recommendation G. 712 (1996) Transmission performance characteristics of pulse code modulation channels, pp 1–31 ITU-T (International Telecommunication Union) Recommendation G. 712 (1996) Transmission performance characteristics of pulse code modulation channels, pp 1–31
Metadaten
Titel
Time-Delay Neural Network with 3 Frequency Bands Based on Voiced Speech Discrimination in Noise
verfasst von
Jae Seung Choi
Copyright-Jahr
2013
Verlag
Springer Netherlands
DOI
https://doi.org/10.1007/978-94-007-6516-0_29

Neuer Inhalt