nach oben

Neural Computing and Applications

Erschienen in:

01.06.2014 | Original Article

A robust recurrent simultaneous perturbation stochastic approximation training algorithm for recurrent neural networks

verfasst von: Zhao Xu, Qing Song, Danwei Wang

Erschienen in: Neural Computing and Applications | Ausgabe 7-8/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Training of recurrent neural networks (RNNs) introduces considerable computational complexities due to the need for gradient evaluations. How to get fast convergence speed and low computational complexity remains a challenging and open topic. Besides, the transient response of learning process of RNNs is a critical issue, especially for online applications. Conventional RNN training algorithms such as the backpropagation through time and real-time recurrent learning have not adequately satisfied these requirements because they often suffer from slow convergence speed. If a large learning rate is chosen to improve performance, the training process may become unstable in terms of weight divergence. In this paper, a novel training algorithm of RNN, named robust recurrent simultaneous perturbation stochastic approximation (RRSPSA), is developed with a specially designed recurrent hybrid adaptive parameter and adaptive learning rates. RRSPSA is a powerful novel twin-engine simultaneous perturbation stochastic approximation (SPSA) type of RNN training algorithm. It utilizes three specially designed adaptive parameters to maximize training speed for a recurrent training signal while exhibiting certain weight convergence properties with only two objective function measurements as the original SPSA algorithm. The RRSPSA is proved with guaranteed weight convergence and system stability in the sense of Lyapunov function. Computer simulations were carried out to demonstrate applicability of the theoretical results.

Vorheriger Artikel Kernel linear regression for face recognition

Nächster Artikel Animal migration optimization: an optimization algorithm inspired by animal migration behavior

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Hopfield JJ (1982) Neural networks and physical systems with emergent collective computational abilities. Proc Natl Acad Sci 79:2554–2558MathSciNetCrossRef

Talebi HA (2009) A recurrent neural-network-based sensor and actuator fault detection and isolation for nonlinear systems with application to the satellite’s attitude control subsystem. IEEE Trans Neural Netw 20:45–60MathSciNetCrossRef

Hou ZG, Gupta MM, Nikiforuk PN, Tan M, Cheng L (2007) A recurrent neural network for hierarchical control of interconnected dynamic systems. IEEE Trans Neural Netw 18:466–481CrossRef

Al Seyab RK, Cao Y (2008) Nonlinear system identification for predictive control using continuous time recurrent neural networks and automatic differentiation. J Process Control 18:568–581CrossRef

Song Q, Xiao J, Soh YC (1999) Robust backpropagation training algorithm for multilayered neural tracking controller. IEEE Trans Neural Netw 10:1133–1141CrossRef

Song Q, Wu Y, Soh YC (2008) Robust adaptive gradient-descent training algorithm for recurrent neural networks in discrete time domain. IEEE Trans Neural Netw 19:1841–1853CrossRef

Song Q, Spall JC, Soh YC, Ni J (2008) Robust neural network tracking controller using simultaneous perturbation stochastic approximation. IEEE Trans Neural Netw 19:817–835CrossRef

Song Q (2008) On the weight convergence of Elman networks. IEEE Trans Neural Netw 21:463–480CrossRef

Haykin S (1999) Neural networks: a comprehensive foundation. Printice Hall, New JerseyMATH

10.

Mandic DP, Chambers JA (2001) Recurrent neural networks for prediction: learning algorithms, architectures and stability. Wiley, New YorkCrossRef

11.

Spall JC (1992) Multivariate stochastic approximation using a simultaneous perturbation gradient approximation. IEEE Trans Autom Control 37:332–341MathSciNetCrossRefMATH

12.

Maeda Y, Wakamura M (2005) Simultaneous perturbation learning rule for recurrent neural networks and its FPGA implementation. IEEE Trans Neural Netw 16:1664–1672CrossRef

13.

Trunov AB, Polycarpou MM (2000) Automated fault diagnosis in nonlinear multivariable systems using a learning methodology. IEEE Trans Neural Netw 11:91–101CrossRef

14.

Spall JC, Cristion JA (1998) Model-free control of nonlinear stochastic systems with discrete-time measurements. IEEE Trans Autom Control 43:1198–1210MathSciNetCrossRefMATH

15.

Spall JC, Cristion JA (1997) A neural network controller for systems with unmodeled dynamics with applications to wastewater treatment. IEEE Trans Syst Man Cybern Part B Cybern 27:369–375CrossRef

16.

Maeda Y, De Figueiredo RJP (1997) Learning rules for neuro-controller via simultaneous perturbation. IEEE Trans Neural Netw 8:1119–1130CrossRef

17.

Werbos PJ (1988) Generalization of backpropagation with application to a recurrent gas market model. Neural Netw 1:339–356CrossRef

18.

Rumelhart D, Hinton G, Williams R (1986) Learning internal representations by error backpropagation. Parallel Distrib Process 1:318–362

19.

Williams RJ, Zipser D (1989) A learning algorithm for continually running fully recurrent neural networks. Neural Comput 1:270–280CrossRef

20.

Williams RJ, Zipser D (1995) Gradient-based learning algorithms for recurrent networks and their computational complexity. Backpropag Theory Archit Appl 2:433–501

21.

Lin T, Giles C, Horne B, Kung S (1997) A delay damage model selection algorithm for NARX neural networks. IEEE Trans Signal Process 45:2719–2730CrossRef

22.

Park Y, Murray T, Chen C (2002) Predicting sun spots using a layered perceptron neural network. IEEE Trans Neural Netw 7:501–505CrossRef

23.

Ljung L (2010) Perspectives on system identification. Annu Rev Control 34(1):1–12MathSciNetCrossRef

Titel: A robust recurrent simultaneous perturbation stochastic approximation training algorithm for recurrent neural networks
verfasst von: Zhao Xu
Qing Song
Danwei Wang
Publikationsdatum: 01.06.2014
Verlag: Springer London
Erschienen in: Neural Computing and Applications / Ausgabe 7-8/2014
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI: https://doi.org/10.1007/s00521-013-1436-5

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Weitere Artikel der Ausgabe 7-8/2014

On the robustness of global exponential stability for hybrid neural networks with noise and delay perturbations

Monthly flow forecast for Mississippi River basin using artificial neural networks

A new sparse representation-based classification algorithm using iterative class elimination

Enhancing decisions with life cycle analysis for risk management

Graph embedding discriminant analysis for face recognition

The latest research progress on spectral clustering