Skip to main content
Erschienen in: Neural Computing and Applications 15/2021

16.01.2021 | Original Article

Full-state neural network observer-based hybrid quantum diagonal recurrent neural network adaptive tracking control

verfasst von: Ahmed Elkenawy, Ahmad M. El-Nagar, Mohammad El-Bardini, Nabila M. El-Rabaie

Erschienen in: Neural Computing and Applications | Ausgabe 15/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This study introduces a neural network (NN) adaptive tracking controller-based reinforcement learning (RL) scheme for unknown nonlinear systems. First, an observer using feed-forward NN (FFNN) is performed to estimate the controlled system states. Second, an adaptive control based on actor-critic RL is developed, in which a quantum diagonal recurrent neural network (QDRNN) is proposed to represent the critic and actor parts. The critic QDRNN is applied to perform the “strategic” utility function, and it is minimized by the actor QDRNN. The proposed adaptive tracking NN control guarantees the faster convergence due to the developed updated algorithm for the controller parameters, which is derived using the Lyapunov function. Simulation and practical results indicate the robustness of the proposed observer-based adaptive control relative to other existing controllers.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Wang CC, Yang GH (2018) Observer-based adaptive prescribed performance tracking control for nonlinear systems with unknown control direction and input saturation. Neurocomputing 284:17–26CrossRef Wang CC, Yang GH (2018) Observer-based adaptive prescribed performance tracking control for nonlinear systems with unknown control direction and input saturation. Neurocomputing 284:17–26CrossRef
2.
Zurück zum Zitat Narendra KS (1996) Neural networks for control theory and practice. Proc IEEE 84(10):1385–1406CrossRef Narendra KS (1996) Neural networks for control theory and practice. Proc IEEE 84(10):1385–1406CrossRef
3.
Zurück zum Zitat Peng J, Dubay R (2011) Identification and adaptive neural network control of a DC motor system with dead-zone characteristics. ISA Trans 50(4):588–598CrossRef Peng J, Dubay R (2011) Identification and adaptive neural network control of a DC motor system with dead-zone characteristics. ISA Trans 50(4):588–598CrossRef
4.
Zurück zum Zitat Lakhal AN, Tlili AS, Braiek NB (2010) Neural network observer for nonlinear systems application to induction motors. Int J Control Autom 3(1):1–16 Lakhal AN, Tlili AS, Braiek NB (2010) Neural network observer for nonlinear systems application to induction motors. Int J Control Autom 3(1):1–16
5.
Zurück zum Zitat Chow TW, Fang Y (1998) A recurrent neural-network-based real-time learning control strategy applying to nonlinear systems with unknown dynamics. IEEE Trans Ind Electr 45(1):151–161CrossRef Chow TW, Fang Y (1998) A recurrent neural-network-based real-time learning control strategy applying to nonlinear systems with unknown dynamics. IEEE Trans Ind Electr 45(1):151–161CrossRef
6.
Zurück zum Zitat Kumar R, Srivastava S, Gupta JRP (2017) Diagonal recurrent neural network based adaptive control of nonlinear dynamical systems using Lyapunov stability criterion. ISA Trans 67:407–427CrossRef Kumar R, Srivastava S, Gupta JRP (2017) Diagonal recurrent neural network based adaptive control of nonlinear dynamical systems using Lyapunov stability criterion. ISA Trans 67:407–427CrossRef
7.
Zurück zum Zitat Kumar R, Srivastava S, Gupta JRP, Mohindru A (2018) Diagonal recurrent neural network based identification of nonlinear dynamical systems with Lyapunov stability based adaptive learning rates. Neurocomputing 287:102–117CrossRef Kumar R, Srivastava S, Gupta JRP, Mohindru A (2018) Diagonal recurrent neural network based identification of nonlinear dynamical systems with Lyapunov stability based adaptive learning rates. Neurocomputing 287:102–117CrossRef
8.
Zurück zum Zitat Krener AJ, Respondek W (1985) Nonlinear observers with linearizable error dynamics. SIAM Journal on Control and Optimization 23(2):197–216MathSciNetMATHCrossRef Krener AJ, Respondek W (1985) Nonlinear observers with linearizable error dynamics. SIAM Journal on Control and Optimization 23(2):197–216MathSciNetMATHCrossRef
9.
Zurück zum Zitat Oh S, Khalil HK (1997) Nonlinear output-feedback tracking using high-gain observer and variable structure control. Automatica 33(10):1845–1856MathSciNetMATHCrossRef Oh S, Khalil HK (1997) Nonlinear output-feedback tracking using high-gain observer and variable structure control. Automatica 33(10):1845–1856MathSciNetMATHCrossRef
10.
Zurück zum Zitat Yan XG, Edwards C (2007) Nonlinear robust fault reconstruction and estimation using a sliding mode observer. Automatica 43(9):1605–1614MathSciNetMATHCrossRef Yan XG, Edwards C (2007) Nonlinear robust fault reconstruction and estimation using a sliding mode observer. Automatica 43(9):1605–1614MathSciNetMATHCrossRef
12.
Zurück zum Zitat Mu C, Zhang Y, Wang K (2018) Observer-based adaptive control of uncertain nonlinear systems via neural networks. IEEE Access 6:42675–42686CrossRef Mu C, Zhang Y, Wang K (2018) Observer-based adaptive control of uncertain nonlinear systems via neural networks. IEEE Access 6:42675–42686CrossRef
13.
Zurück zum Zitat Yang Y, Xu C, Yue D, Xie X (2018) Output feedback tracking control of a class of continuous-time nonlinear systems via adaptive dynamic programming approach. Inf Sci 469:1–13MathSciNetMATHCrossRef Yang Y, Xu C, Yue D, Xie X (2018) Output feedback tracking control of a class of continuous-time nonlinear systems via adaptive dynamic programming approach. Inf Sci 469:1–13MathSciNetMATHCrossRef
14.
Zurück zum Zitat He P, Jagannathan S (2005) Reinforcement learning-based output feedback control of nonlinear systems with input constraints. IEEE Trans Syst Man Cybern Part B Cybern 35(1):150–154CrossRef He P, Jagannathan S (2005) Reinforcement learning-based output feedback control of nonlinear systems with input constraints. IEEE Trans Syst Man Cybern Part B Cybern 35(1):150–154CrossRef
15.
Zurück zum Zitat Rios JD, Alanis AY, Arana-Daniel N, Lopez-Franco C (2017) Recurrent high order neural observer for discrete-time non-linear systems with unknown time-delay. Neural Process Lett 46(2):663–679CrossRef Rios JD, Alanis AY, Arana-Daniel N, Lopez-Franco C (2017) Recurrent high order neural observer for discrete-time non-linear systems with unknown time-delay. Neural Process Lett 46(2):663–679CrossRef
16.
Zurück zum Zitat He P, Jagannathan S (2007) Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints. IEEE Trans Syst Man Cybern Part B Cybern 37(2):425–436CrossRef He P, Jagannathan S (2007) Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints. IEEE Trans Syst Man Cybern Part B Cybern 37(2):425–436CrossRef
17.
Zurück zum Zitat Lewis FL, Liu D (eds) (2013) Reinforcement learning and approximate dynamic programming for feedback control, vol 17. Wiley, New York Lewis FL, Liu D (eds) (2013) Reinforcement learning and approximate dynamic programming for feedback control, vol 17. Wiley, New York
18.
Zurück zum Zitat Yang X, Liu D, Wang D (2013) Observer-based adaptive output feedback control for nonaffine nonlinear discrete-time systems using reinforcement learning. In: International Conference on Neural Information Processing, Springer, Berlin, pp 631–638 Yang X, Liu D, Wang D (2013) Observer-based adaptive output feedback control for nonaffine nonlinear discrete-time systems using reinforcement learning. In: International Conference on Neural Information Processing, Springer, Berlin, pp 631–638
19.
Zurück zum Zitat Barto AG (1992) Reinforcement learning and adaptive critic methods. Handbook of intelligent control Barto AG (1992) Reinforcement learning and adaptive critic methods. Handbook of intelligent control
20.
Zurück zum Zitat Sharma M, Verma A (2013) Wavelet reduced order observer based adaptive tracking control for a class of uncertain nonlinear systems using reinforcement learning. Int J Control Autom Syst 11(3):496–502CrossRef Sharma M, Verma A (2013) Wavelet reduced order observer based adaptive tracking control for a class of uncertain nonlinear systems using reinforcement learning. Int J Control Autom Syst 11(3):496–502CrossRef
21.
Zurück zum Zitat Sharma M, Verma A (2013) Wavelet reduced order observer-based adaptive tracking control for a class of uncertain delayed non-linear systems subjected to actuator saturation using actor-critic architecture. Int J Autom Control 7(4):288–303CrossRef Sharma M, Verma A (2013) Wavelet reduced order observer-based adaptive tracking control for a class of uncertain delayed non-linear systems subjected to actuator saturation using actor-critic architecture. Int J Autom Control 7(4):288–303CrossRef
22.
Zurück zum Zitat Kim EK, Mwasilu F, Choi HH, Jung JW (2014) An observer-based optimal voltage control scheme for three-phase UPS systems. IEEE Trans Ind Electr 62(4):2073–2081CrossRef Kim EK, Mwasilu F, Choi HH, Jung JW (2014) An observer-based optimal voltage control scheme for three-phase UPS systems. IEEE Trans Ind Electr 62(4):2073–2081CrossRef
23.
Zurück zum Zitat Yang X, Liu D, Wang D (2013) Observer-based adaptive output feedback control for nonaffine nonlinear discrete-time systems using reinforcement learning. In: International Conference on Neural Information Processing, Springer, Berlin, pp 631–638 Yang X, Liu D, Wang D (2013) Observer-based adaptive output feedback control for nonaffine nonlinear discrete-time systems using reinforcement learning. In: International Conference on Neural Information Processing, Springer, Berlin, pp 631–638
24.
Zurück zum Zitat Mu D, Guan Z, Zhang H (2013) Learning algorithm and application of quantum neural networks with quantum weights. Int J Comput Theory Eng 5(5):788CrossRef Mu D, Guan Z, Zhang H (2013) Learning algorithm and application of quantum neural networks with quantum weights. Int J Comput Theory Eng 5(5):788CrossRef
25.
Zurück zum Zitat Shang F (2015) Quantum-inspired neural network with quantum weights and real Weights. Open J Appl Sci 5(10):609CrossRef Shang F (2015) Quantum-inspired neural network with quantum weights and real Weights. Open J Appl Sci 5(10):609CrossRef
26.
Zurück zum Zitat Panella, M., & Martinelli, G (2011) Neural networks with quantum architecture and quantum learning. Int J Circuit Theory Appl 39(1):61–77MATHCrossRef Panella, M., & Martinelli, G (2011) Neural networks with quantum architecture and quantum learning. Int J Circuit Theory Appl 39(1):61–77MATHCrossRef
27.
Zurück zum Zitat Sarangapani J (2006) Neural network control of nonlinear discrete-time systems. CRC Press, Boca RatonCrossRef Sarangapani J (2006) Neural network control of nonlinear discrete-time systems. CRC Press, Boca RatonCrossRef
28.
Zurück zum Zitat Liu D, Huang Y, Wang D, Wei Q (2013) Neural-network-observer-based optimal control for unknown nonlinear systems using adaptive dynamic programming. Int J Control 86(9):1554–1566MathSciNetMATHCrossRef Liu D, Huang Y, Wang D, Wei Q (2013) Neural-network-observer-based optimal control for unknown nonlinear systems using adaptive dynamic programming. Int J Control 86(9):1554–1566MathSciNetMATHCrossRef
29.
Zurück zum Zitat Abdollahi F, Talebi HA, Patel RV (2006) A stable neural network-based observer with application to flexible-joint manipulators. IEEE Trans Neural Netw 17(1):118–129CrossRef Abdollahi F, Talebi HA, Patel RV (2006) A stable neural network-based observer with application to flexible-joint manipulators. IEEE Trans Neural Netw 17(1):118–129CrossRef
30.
Zurück zum Zitat Khater AA, El-Nagar AM, El-Bardini M, El-Rabaie NM (2018) Adaptive T-S fuzzy controller using reinforcement learning based on Lyapunov stability. J Franklin Instit 355(14):6390–6415MathSciNetMATHCrossRef Khater AA, El-Nagar AM, El-Bardini M, El-Rabaie NM (2018) Adaptive T-S fuzzy controller using reinforcement learning based on Lyapunov stability. J Franklin Instit 355(14):6390–6415MathSciNetMATHCrossRef
31.
Zurück zum Zitat Khater AA, El-Nagar AM, El-Bardini M, El-Rabaie NM (2020) Online learning based on adaptive learning rate for a class of recurrent fuzzy neural network. Neural Comput Appl 32(12):8691–8710MATHCrossRef Khater AA, El-Nagar AM, El-Bardini M, El-Rabaie NM (2020) Online learning based on adaptive learning rate for a class of recurrent fuzzy neural network. Neural Comput Appl 32(12):8691–8710MATHCrossRef
32.
Zurück zum Zitat Khater AA, El-Nagar AM, El-Bardini M, El-Rabaie N (2019) A novel structure of actor-critic learning based on an interval type-2 TSK fuzzy neural network. IEEE Trans Fuzzy Syst 28:3047–3061MATHCrossRef Khater AA, El-Nagar AM, El-Bardini M, El-Rabaie N (2019) A novel structure of actor-critic learning based on an interval type-2 TSK fuzzy neural network. IEEE Trans Fuzzy Syst 28:3047–3061MATHCrossRef
33.
Zurück zum Zitat Elkenawy A, El-Nagar AM, El-Bardini M, El-Rabaie NM (2020) Diagonal recurrent neural network observer-based adaptive control for unknown nonlinear systems. Trans Instit Meas Control 42(15):2833–2856CrossRef Elkenawy A, El-Nagar AM, El-Bardini M, El-Rabaie NM (2020) Diagonal recurrent neural network observer-based adaptive control for unknown nonlinear systems. Trans Instit Meas Control 42(15):2833–2856CrossRef
34.
Zurück zum Zitat Esfandiari K, Abdollahi F, Talebi HA (2014) Observer-based adaptive neural network control for a class of uncertain nonlinear systems. In: 2014 22nd Iranian Conference on Electrical Engineering (ICEE). IEEE, pp 1354–1359 Esfandiari K, Abdollahi F, Talebi HA (2014) Observer-based adaptive neural network control for a class of uncertain nonlinear systems. In: 2014 22nd Iranian Conference on Electrical Engineering (ICEE). IEEE, pp 1354–1359
Metadaten
Titel
Full-state neural network observer-based hybrid quantum diagonal recurrent neural network adaptive tracking control
verfasst von
Ahmed Elkenawy
Ahmad M. El-Nagar
Mohammad El-Bardini
Nabila M. El-Rabaie
Publikationsdatum
16.01.2021
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 15/2021
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-020-05685-x

Weitere Artikel der Ausgabe 15/2021

Neural Computing and Applications 15/2021 Zur Ausgabe