Skip to main content
Erschienen in: Neural Computing and Applications 7-8/2014

01.06.2014 | Original Article

An efficient hybrid learning algorithm for neural network–based speech recognition systems on FPGA chip

verfasst von: Shing-Tai Pan, Min-Lun Lan

Erschienen in: Neural Computing and Applications | Ausgabe 7-8/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper implemented an artificial neural network (ANN) on a field programmable gate array (FPGA) chip for Mandarin speech measurement and recognition of nonspecific speaker. A three-layer hybrid learning algorithm (HLA), which combines genetic algorithm (GA) and steepest descent method, was proposed to fulfill a faster global search of optimal weights in ANN. Some other popular evolutionary algorithms, such as differential evolution, particle swarm optimization and improve GA, were compared to the proposed HLA. It can be seen that the proposed HLA algorithm outperforms the other algorithms. Finally, the designed system was implemented on an FPGA chip with an SOC architecture to measure and recognize the speech signals.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Sivaram GSVS, Nemala SK, Mesgarani N, Hermansky H (2010) Data-driven and feedback based spectro-temporal features for speech recognition. IEEE Signal Process Lett 17(11):957–960CrossRef Sivaram GSVS, Nemala SK, Mesgarani N, Hermansky H (2010) Data-driven and feedback based spectro-temporal features for speech recognition. IEEE Signal Process Lett 17(11):957–960CrossRef
2.
Zurück zum Zitat Lauria S (2007) Talking to machines: introducing Robot perception to resolve speech recognition uncertainties. Circuits Syst Signal Process 26(4):513–526CrossRefMATH Lauria S (2007) Talking to machines: introducing Robot perception to resolve speech recognition uncertainties. Circuits Syst Signal Process 26(4):513–526CrossRefMATH
3.
Zurück zum Zitat Wan CY, Lee LS (2008) Histogram-based quantization for robust and/or distributed speech recognition. IEEE Trans Audio Speech Lang Processing 16(4):859–873CrossRef Wan CY, Lee LS (2008) Histogram-based quantization for robust and/or distributed speech recognition. IEEE Trans Audio Speech Lang Processing 16(4):859–873CrossRef
4.
Zurück zum Zitat Hagon MT, Demuth HB, Beale M (1996) Neural network design. Thomson Learning, Stamford Hagon MT, Demuth HB, Beale M (1996) Neural network design. Thomson Learning, Stamford
5.
Zurück zum Zitat Kwong S, Chau CW (1997) Analysis of parallel genetic algorithms on HMM based speech recognition system. IEEE Trans Consumer Electron 43(4):1229–1233CrossRef Kwong S, Chau CW (1997) Analysis of parallel genetic algorithms on HMM based speech recognition system. IEEE Trans Consumer Electron 43(4):1229–1233CrossRef
6.
Zurück zum Zitat Shi Y, Liu J, Liu R (2001) Single-chip speech recognition system based on 8051 microcontroller core. IEEE Trans Consumer Electron 47(1):149–153CrossRef Shi Y, Liu J, Liu R (2001) Single-chip speech recognition system based on 8051 microcontroller core. IEEE Trans Consumer Electron 47(1):149–153CrossRef
7.
Zurück zum Zitat Lin FJ, Huang PK, Chou WD (2007) Recurrent-fuzzy-neural-network-controlled linear induction motor servo drive using genetic algorithms. IEEE Trans Ind Electron 54(3):1449–1461CrossRef Lin FJ, Huang PK, Chou WD (2007) Recurrent-fuzzy-neural-network-controlled linear induction motor servo drive using genetic algorithms. IEEE Trans Ind Electron 54(3):1449–1461CrossRef
8.
Zurück zum Zitat Karamalis PD, Kanatas AG, Constantinou P (2009) A genetic algorithm applied for optimization of antenna arrays used in mobile radio channel characterization devices. IEEE Trans Instrum Meas 58:2475–2487CrossRef Karamalis PD, Kanatas AG, Constantinou P (2009) A genetic algorithm applied for optimization of antenna arrays used in mobile radio channel characterization devices. IEEE Trans Instrum Meas 58:2475–2487CrossRef
10.
Zurück zum Zitat Huang X, Acero A, Wuenon H (2005) Spoken language processing a guide to theory algorithm and system development. Pearson, London Huang X, Acero A, Wuenon H (2005) Spoken language processing a guide to theory algorithm and system development. Pearson, London
11.
Zurück zum Zitat Leung HF, Lam HK, Ling SH (2003) Tuning of the structure and parameters of a neural network using an improved genetic algorithm. IEEE Trans Neural Netw 14:79–88CrossRef Leung HF, Lam HK, Ling SH (2003) Tuning of the structure and parameters of a neural network using an improved genetic algorithm. IEEE Trans Neural Netw 14:79–88CrossRef
12.
Zurück zum Zitat Kennedy J, Eberhart RC (1995) “Particle swarm optimization.” In: Proceedings IEEE International Conference on Neural Networks (Perth, Australia), IEEE Service Center, Piscataway, NJ, pp IV:1942–1948, 1995 Kennedy J, Eberhart RC (1995) “Particle swarm optimization.” In: Proceedings IEEE International Conference on Neural Networks (Perth, Australia), IEEE Service Center, Piscataway, NJ, pp IV:1942–1948, 1995
13.
Zurück zum Zitat Storn R, Price K (1997) Differential evolution- A simple and efficient heurist for global optimization over continuous spaces. J. of Global Optimization 11:341–359MathSciNetCrossRefMATH Storn R, Price K (1997) Differential evolution- A simple and efficient heurist for global optimization over continuous spaces. J. of Global Optimization 11:341–359MathSciNetCrossRefMATH
14.
Zurück zum Zitat Runstein F, Violaro F (1995)”An isolated-word speech recognition system using neural networks”.In: Proceeding of the 38th midwest symposium on circuit and systems, Vol 1, pp 550–553, 1995 Runstein F, Violaro F (1995)”An isolated-word speech recognition system using neural networks”.In: Proceeding of the 38th midwest symposium on circuit and systems, Vol 1, pp 550–553, 1995
15.
Zurück zum Zitat Sadaoki F, Dekker M (2001) Digital speech processing, synthesis, and recognition. Marcel Dekker, New York Sadaoki F, Dekker M (2001) Digital speech processing, synthesis, and recognition. Marcel Dekker, New York
Metadaten
Titel
An efficient hybrid learning algorithm for neural network–based speech recognition systems on FPGA chip
verfasst von
Shing-Tai Pan
Min-Lun Lan
Publikationsdatum
01.06.2014
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 7-8/2014
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-013-1428-5

Weitere Artikel der Ausgabe 7-8/2014

Neural Computing and Applications 7-8/2014 Zur Ausgabe