Skip to main content
Erschienen in: Neural Computing and Applications 7-8/2014

01.06.2014 | Original Article

A robust recurrent simultaneous perturbation stochastic approximation training algorithm for recurrent neural networks

verfasst von: Zhao Xu, Qing Song, Danwei Wang

Erschienen in: Neural Computing and Applications | Ausgabe 7-8/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Training of recurrent neural networks (RNNs) introduces considerable computational complexities due to the need for gradient evaluations. How to get fast convergence speed and low computational complexity remains a challenging and open topic. Besides, the transient response of learning process of RNNs is a critical issue, especially for online applications. Conventional RNN training algorithms such as the backpropagation through time and real-time recurrent learning have not adequately satisfied these requirements because they often suffer from slow convergence speed. If a large learning rate is chosen to improve performance, the training process may become unstable in terms of weight divergence. In this paper, a novel training algorithm of RNN, named robust recurrent simultaneous perturbation stochastic approximation (RRSPSA), is developed with a specially designed recurrent hybrid adaptive parameter and adaptive learning rates. RRSPSA is a powerful novel twin-engine simultaneous perturbation stochastic approximation (SPSA) type of RNN training algorithm. It utilizes three specially designed adaptive parameters to maximize training speed for a recurrent training signal while exhibiting certain weight convergence properties with only two objective function measurements as the original SPSA algorithm. The RRSPSA is proved with guaranteed weight convergence and system stability in the sense of Lyapunov function. Computer simulations were carried out to demonstrate applicability of the theoretical results.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Hopfield JJ (1982) Neural networks and physical systems with emergent collective computational abilities. Proc Natl Acad Sci 79:2554–2558MathSciNetCrossRef Hopfield JJ (1982) Neural networks and physical systems with emergent collective computational abilities. Proc Natl Acad Sci 79:2554–2558MathSciNetCrossRef
2.
Zurück zum Zitat Talebi HA (2009) A recurrent neural-network-based sensor and actuator fault detection and isolation for nonlinear systems with application to the satellite’s attitude control subsystem. IEEE Trans Neural Netw 20:45–60MathSciNetCrossRef Talebi HA (2009) A recurrent neural-network-based sensor and actuator fault detection and isolation for nonlinear systems with application to the satellite’s attitude control subsystem. IEEE Trans Neural Netw 20:45–60MathSciNetCrossRef
3.
Zurück zum Zitat Hou ZG, Gupta MM, Nikiforuk PN, Tan M, Cheng L (2007) A recurrent neural network for hierarchical control of interconnected dynamic systems. IEEE Trans Neural Netw 18:466–481CrossRef Hou ZG, Gupta MM, Nikiforuk PN, Tan M, Cheng L (2007) A recurrent neural network for hierarchical control of interconnected dynamic systems. IEEE Trans Neural Netw 18:466–481CrossRef
4.
Zurück zum Zitat Al Seyab RK, Cao Y (2008) Nonlinear system identification for predictive control using continuous time recurrent neural networks and automatic differentiation. J Process Control 18:568–581CrossRef Al Seyab RK, Cao Y (2008) Nonlinear system identification for predictive control using continuous time recurrent neural networks and automatic differentiation. J Process Control 18:568–581CrossRef
5.
Zurück zum Zitat Song Q, Xiao J, Soh YC (1999) Robust backpropagation training algorithm for multilayered neural tracking controller. IEEE Trans Neural Netw 10:1133–1141CrossRef Song Q, Xiao J, Soh YC (1999) Robust backpropagation training algorithm for multilayered neural tracking controller. IEEE Trans Neural Netw 10:1133–1141CrossRef
6.
Zurück zum Zitat Song Q, Wu Y, Soh YC (2008) Robust adaptive gradient-descent training algorithm for recurrent neural networks in discrete time domain. IEEE Trans Neural Netw 19:1841–1853CrossRef Song Q, Wu Y, Soh YC (2008) Robust adaptive gradient-descent training algorithm for recurrent neural networks in discrete time domain. IEEE Trans Neural Netw 19:1841–1853CrossRef
7.
Zurück zum Zitat Song Q, Spall JC, Soh YC, Ni J (2008) Robust neural network tracking controller using simultaneous perturbation stochastic approximation. IEEE Trans Neural Netw 19:817–835CrossRef Song Q, Spall JC, Soh YC, Ni J (2008) Robust neural network tracking controller using simultaneous perturbation stochastic approximation. IEEE Trans Neural Netw 19:817–835CrossRef
8.
Zurück zum Zitat Song Q (2008) On the weight convergence of Elman networks. IEEE Trans Neural Netw 21:463–480CrossRef Song Q (2008) On the weight convergence of Elman networks. IEEE Trans Neural Netw 21:463–480CrossRef
9.
Zurück zum Zitat Haykin S (1999) Neural networks: a comprehensive foundation. Printice Hall, New JerseyMATH Haykin S (1999) Neural networks: a comprehensive foundation. Printice Hall, New JerseyMATH
10.
Zurück zum Zitat Mandic DP, Chambers JA (2001) Recurrent neural networks for prediction: learning algorithms, architectures and stability. Wiley, New YorkCrossRef Mandic DP, Chambers JA (2001) Recurrent neural networks for prediction: learning algorithms, architectures and stability. Wiley, New YorkCrossRef
11.
Zurück zum Zitat Spall JC (1992) Multivariate stochastic approximation using a simultaneous perturbation gradient approximation. IEEE Trans Autom Control 37:332–341MathSciNetCrossRefMATH Spall JC (1992) Multivariate stochastic approximation using a simultaneous perturbation gradient approximation. IEEE Trans Autom Control 37:332–341MathSciNetCrossRefMATH
12.
Zurück zum Zitat Maeda Y, Wakamura M (2005) Simultaneous perturbation learning rule for recurrent neural networks and its FPGA implementation. IEEE Trans Neural Netw 16:1664–1672CrossRef Maeda Y, Wakamura M (2005) Simultaneous perturbation learning rule for recurrent neural networks and its FPGA implementation. IEEE Trans Neural Netw 16:1664–1672CrossRef
13.
Zurück zum Zitat Trunov AB, Polycarpou MM (2000) Automated fault diagnosis in nonlinear multivariable systems using a learning methodology. IEEE Trans Neural Netw 11:91–101CrossRef Trunov AB, Polycarpou MM (2000) Automated fault diagnosis in nonlinear multivariable systems using a learning methodology. IEEE Trans Neural Netw 11:91–101CrossRef
14.
Zurück zum Zitat Spall JC, Cristion JA (1998) Model-free control of nonlinear stochastic systems with discrete-time measurements. IEEE Trans Autom Control 43:1198–1210MathSciNetCrossRefMATH Spall JC, Cristion JA (1998) Model-free control of nonlinear stochastic systems with discrete-time measurements. IEEE Trans Autom Control 43:1198–1210MathSciNetCrossRefMATH
15.
Zurück zum Zitat Spall JC, Cristion JA (1997) A neural network controller for systems with unmodeled dynamics with applications to wastewater treatment. IEEE Trans Syst Man Cybern Part B Cybern 27:369–375CrossRef Spall JC, Cristion JA (1997) A neural network controller for systems with unmodeled dynamics with applications to wastewater treatment. IEEE Trans Syst Man Cybern Part B Cybern 27:369–375CrossRef
16.
Zurück zum Zitat Maeda Y, De Figueiredo RJP (1997) Learning rules for neuro-controller via simultaneous perturbation. IEEE Trans Neural Netw 8:1119–1130CrossRef Maeda Y, De Figueiredo RJP (1997) Learning rules for neuro-controller via simultaneous perturbation. IEEE Trans Neural Netw 8:1119–1130CrossRef
17.
Zurück zum Zitat Werbos PJ (1988) Generalization of backpropagation with application to a recurrent gas market model. Neural Netw 1:339–356CrossRef Werbos PJ (1988) Generalization of backpropagation with application to a recurrent gas market model. Neural Netw 1:339–356CrossRef
18.
Zurück zum Zitat Rumelhart D, Hinton G, Williams R (1986) Learning internal representations by error backpropagation. Parallel Distrib Process 1:318–362 Rumelhart D, Hinton G, Williams R (1986) Learning internal representations by error backpropagation. Parallel Distrib Process 1:318–362
19.
Zurück zum Zitat Williams RJ, Zipser D (1989) A learning algorithm for continually running fully recurrent neural networks. Neural Comput 1:270–280CrossRef Williams RJ, Zipser D (1989) A learning algorithm for continually running fully recurrent neural networks. Neural Comput 1:270–280CrossRef
20.
Zurück zum Zitat Williams RJ, Zipser D (1995) Gradient-based learning algorithms for recurrent networks and their computational complexity. Backpropag Theory Archit Appl 2:433–501 Williams RJ, Zipser D (1995) Gradient-based learning algorithms for recurrent networks and their computational complexity. Backpropag Theory Archit Appl 2:433–501
21.
Zurück zum Zitat Lin T, Giles C, Horne B, Kung S (1997) A delay damage model selection algorithm for NARX neural networks. IEEE Trans Signal Process 45:2719–2730CrossRef Lin T, Giles C, Horne B, Kung S (1997) A delay damage model selection algorithm for NARX neural networks. IEEE Trans Signal Process 45:2719–2730CrossRef
22.
Zurück zum Zitat Park Y, Murray T, Chen C (2002) Predicting sun spots using a layered perceptron neural network. IEEE Trans Neural Netw 7:501–505CrossRef Park Y, Murray T, Chen C (2002) Predicting sun spots using a layered perceptron neural network. IEEE Trans Neural Netw 7:501–505CrossRef
Metadaten
Titel
A robust recurrent simultaneous perturbation stochastic approximation training algorithm for recurrent neural networks
verfasst von
Zhao Xu
Qing Song
Danwei Wang
Publikationsdatum
01.06.2014
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 7-8/2014
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-013-1436-5

Weitere Artikel der Ausgabe 7-8/2014

Neural Computing and Applications 7-8/2014 Zur Ausgabe