Skip to main content
Erschienen in: Neural Computing and Applications 15/2021

04.02.2021 | Original Article

Optimal tracking control of switched systems applied in grid-connected hybrid generation using reinforcement learning

verfasst von: Jiayue Sun, Huaguang Zhang, Yingchun Wang, Mingrui Fu

Erschienen in: Neural Computing and Applications | Ausgabe 15/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The paper presents a reinforcement learning approach for optimal tracking control of switched systems with application to a grid-tied hybrid generation system. To enhance interaction with the irregular environment, reference trajectory is learned via controller from states to optimal control. The main issue is to solve the optimal tracking control problem for a hybrid generation system consisting of multiple switched subsystems, and reinforcement learning can seek the globally optimal solution well without knowing accurate system dynamics. The investigated learning algorithm is used to generate an optimum map based on the learned ultimate value without knowledge of system parameters and obtains the optimal control law via deriving of algebraic Riccati equation (ARE) with unnecessary knowing of command generator dynamics. The optimal control solution can converge the online learning algorithm well based on policy iteration as verification in the simulation.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Kim S, Jeon J, Cho C, Ahn J, Kwon S (2008) Dynamic modeling and control of a grid-connected hybrid generation system with versatile power transfer. IEEE Trans Ind Electron 55(4):1677–1688CrossRef Kim S, Jeon J, Cho C, Ahn J, Kwon S (2008) Dynamic modeling and control of a grid-connected hybrid generation system with versatile power transfer. IEEE Trans Ind Electron 55(4):1677–1688CrossRef
2.
Zurück zum Zitat Jun Zhao, Dimirovski GM (2004) Quadratic stability of a class of switched nonlinear systems. IEEE Trans Autom Control 49(4):574–578MathSciNetCrossRef Jun Zhao, Dimirovski GM (2004) Quadratic stability of a class of switched nonlinear systems. IEEE Trans Autom Control 49(4):574–578MathSciNetCrossRef
3.
Zurück zum Zitat Aleksandrov AY, Chen Y, Platonov AV, Zhang L (2011) Stability analysis for a class of switched nonlinear systems. Automatica 47(10):2286–2291MathSciNetCrossRef Aleksandrov AY, Chen Y, Platonov AV, Zhang L (2011) Stability analysis for a class of switched nonlinear systems. Automatica 47(10):2286–2291MathSciNetCrossRef
4.
Zurück zum Zitat Valenciaga F, Puleston PF (2005) Supervisor control for a stand-alone hybrid generation system using wind and photovoltaic energy. IEEE Trans Energy Convers 20(2):398–405CrossRef Valenciaga F, Puleston PF (2005) Supervisor control for a stand-alone hybrid generation system using wind and photovoltaic energy. IEEE Trans Energy Convers 20(2):398–405CrossRef
5.
Zurück zum Zitat Heydari A, Balakrishnan SN (2014) Optimal switching and control of nonlinear switching systems using approximate dynamic programming. IEEE Trans Neural Netw Learn Syst 25(6):1106–1117CrossRef Heydari A, Balakrishnan SN (2014) Optimal switching and control of nonlinear switching systems using approximate dynamic programming. IEEE Trans Neural Netw Learn Syst 25(6):1106–1117CrossRef
6.
Zurück zum Zitat Zhang H, Qin C, Luo Y (2014) Neural-network-based constrained optimal control scheme for discrete-time switched nonlinear system using dual heuristic programming. IEEE Trans Autom Sci Eng 11(3):839–849CrossRef Zhang H, Qin C, Luo Y (2014) Neural-network-based constrained optimal control scheme for discrete-time switched nonlinear system using dual heuristic programming. IEEE Trans Autom Sci Eng 11(3):839–849CrossRef
7.
Zurück zum Zitat Lewis FL, Vrabie D, Vamvoudakis KG (2012) Reinforcement learning and feedback control: using natural decision methods to design optimal adaptive controllers. IEEE Control Syst Mag 32(6):76–105MathSciNetCrossRef Lewis FL, Vrabie D, Vamvoudakis KG (2012) Reinforcement learning and feedback control: using natural decision methods to design optimal adaptive controllers. IEEE Control Syst Mag 32(6):76–105MathSciNetCrossRef
9.
Zurück zum Zitat Wei C, Zhang Z, Qiao W, Qu L (2015) Reinforcement-learning-based intelligent maximum power point tracking control for wind energy conversion systems. IEEE Trans Ind Electron 62(10):6360–6370CrossRef Wei C, Zhang Z, Qiao W, Qu L (2015) Reinforcement-learning-based intelligent maximum power point tracking control for wind energy conversion systems. IEEE Trans Ind Electron 62(10):6360–6370CrossRef
10.
Zurück zum Zitat Mannava A, Balakrishnan SN, Tang L, Landers RG (2012) Optimal tracking control of motion systems. IEEE Trans Control Syst Technol 20(6):1548–1558CrossRef Mannava A, Balakrishnan SN, Tang L, Landers RG (2012) Optimal tracking control of motion systems. IEEE Trans Control Syst Technol 20(6):1548–1558CrossRef
11.
Zurück zum Zitat Zhang H, Cui L, Zhang X, Luo Y (2011) Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method. IEEE Trans Neural Netw 22(12):2226–2236CrossRef Zhang H, Cui L, Zhang X, Luo Y (2011) Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method. IEEE Trans Neural Netw 22(12):2226–2236CrossRef
12.
Zurück zum Zitat Luo B, Yang Y, Liu D, Wu H (2020) Event-triggered optimal control with performance guarantees using adaptive dynamic programming. IEEE Trans Neural Netw Learn Syst 31(1):76–88MathSciNetCrossRef Luo B, Yang Y, Liu D, Wu H (2020) Event-triggered optimal control with performance guarantees using adaptive dynamic programming. IEEE Trans Neural Netw Learn Syst 31(1):76–88MathSciNetCrossRef
13.
Zurück zum Zitat Wang F, Zhang H, Liu D (2009) Adaptive dynamic programming: an introduction. IEEE Comput Intell Mag 4(2):39–47CrossRef Wang F, Zhang H, Liu D (2009) Adaptive dynamic programming: an introduction. IEEE Comput Intell Mag 4(2):39–47CrossRef
14.
Zurück zum Zitat Jiang Y, Jiang ZP (2012) Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics. Automatica 48(10):2699–2704MathSciNetCrossRef Jiang Y, Jiang ZP (2012) Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics. Automatica 48(10):2699–2704MathSciNetCrossRef
15.
Zurück zum Zitat Lewis FL, Vamvoudakis KG (2011) Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data. IEEE Trans Syst Man Cybern Part B (Cybernetics) 41(1):14–25CrossRef Lewis FL, Vamvoudakis KG (2011) Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data. IEEE Trans Syst Man Cybern Part B (Cybernetics) 41(1):14–25CrossRef
16.
Zurück zum Zitat Tong S, Zhang L, Li Y (2016) Observed-based adaptive fuzzy decentralized tracking control for switched uncertain nonlinear large-scale systems with dead zones. IEEE Trans Syst Man Cybern Syst 46(1):37–47CrossRef Tong S, Zhang L, Li Y (2016) Observed-based adaptive fuzzy decentralized tracking control for switched uncertain nonlinear large-scale systems with dead zones. IEEE Trans Syst Man Cybern Syst 46(1):37–47CrossRef
17.
Zurück zum Zitat Hajiahmadi M, De Schutter B, Hellendoorn H (2016) Design of stabilizing switching laws for mixed switched affine systems. IEEE Trans Autom Control 61(6):1676–1681MathSciNetCrossRef Hajiahmadi M, De Schutter B, Hellendoorn H (2016) Design of stabilizing switching laws for mixed switched affine systems. IEEE Trans Autom Control 61(6):1676–1681MathSciNetCrossRef
18.
Zurück zum Zitat Lu W, Zhu P, Ferrari S (2016) A hybrid-adaptive dynamic programming approach for the model-free control of nonlinear switched systems. IEEE Trans Autom Control 61(10):3203–3208MathSciNetCrossRef Lu W, Zhu P, Ferrari S (2016) A hybrid-adaptive dynamic programming approach for the model-free control of nonlinear switched systems. IEEE Trans Autom Control 61(10):3203–3208MathSciNetCrossRef
19.
Zurück zum Zitat Niu B, Ahn CK, Li H, Liu M (2018) Adaptive control for stochastic switched nonlower triangular nonlinear systems and its application to a one-link manipulator. IEEE Trans Syst Man Cybern Syst 48(10):1701–1714CrossRef Niu B, Ahn CK, Li H, Liu M (2018) Adaptive control for stochastic switched nonlower triangular nonlinear systems and its application to a one-link manipulator. IEEE Trans Syst Man Cybern Syst 48(10):1701–1714CrossRef
20.
Zurück zum Zitat Ni Z, He H, Wen J (2013) Adaptive learning in tracking control based on the dual critic network design. IEEE Trans Neural Netw Learn Syst 24(6):913–928CrossRef Ni Z, He H, Wen J (2013) Adaptive learning in tracking control based on the dual critic network design. IEEE Trans Neural Netw Learn Syst 24(6):913–928CrossRef
21.
Zurück zum Zitat Shen H, Huo S, Cao J, Huang T (2019) Generalized state estimation for Markovian coupled networks under round-robin protocol and redundant channels. IEEE Trans Cybern 49(4):1292–1301CrossRef Shen H, Huo S, Cao J, Huang T (2019) Generalized state estimation for Markovian coupled networks under round-robin protocol and redundant channels. IEEE Trans Cybern 49(4):1292–1301CrossRef
24.
Zurück zum Zitat Vamvoudakis KG, Lewis FL (2010) Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica 46(5):878–888MathSciNetCrossRef Vamvoudakis KG, Lewis FL (2010) Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica 46(5):878–888MathSciNetCrossRef
25.
Zurück zum Zitat Vrabie D, Pastravanu O et al (2009) Adaptive optimal control for continuous-time linear systems based on policy iteration. Automatica 45(2):477–484MathSciNetCrossRef Vrabie D, Pastravanu O et al (2009) Adaptive optimal control for continuous-time linear systems based on policy iteration. Automatica 45(2):477–484MathSciNetCrossRef
26.
Zurück zum Zitat Tran DH, Hamker F, Nassour J (2020) A humanoid robot learns to recover perturbation during swinging motion. IEEE Trans Syst Man Cybern Syst 50(10):3701–3712CrossRef Tran DH, Hamker F, Nassour J (2020) A humanoid robot learns to recover perturbation during swinging motion. IEEE Trans Syst Man Cybern Syst 50(10):3701–3712CrossRef
27.
Zurück zum Zitat Liu M, Wan Y, Li S, Lewis FL, Fu S (2020) Learning and uncertainty-exploited directional antenna control for robust long-distance and broad-band aerial communication. IEEE Trans Veh Technol 69(1):593–606CrossRef Liu M, Wan Y, Li S, Lewis FL, Fu S (2020) Learning and uncertainty-exploited directional antenna control for robust long-distance and broad-band aerial communication. IEEE Trans Veh Technol 69(1):593–606CrossRef
28.
Zurück zum Zitat Corona D, Buisson J, De Schutter B, Giua A (2007) Stabilization of switched affine systems: an application to the buck-boost converter. In: 2007 American control conference, New York, NY, pp 6037–6042 Corona D, Buisson J, De Schutter B, Giua A (2007) Stabilization of switched affine systems: an application to the buck-boost converter. In: 2007 American control conference, New York, NY, pp 6037–6042
29.
Zurück zum Zitat Lee JY, Park JB, Choi YH (2012) Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems. Automatica 48(11):2850–2859MathSciNetCrossRef Lee JY, Park JB, Choi YH (2012) Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems. Automatica 48(11):2850–2859MathSciNetCrossRef
35.
Zurück zum Zitat Sun XM, Liu GP, Rees D, Wang W (2008) Delay-dependent stability for discrete systems with large delay sequence based on switching techniques. Automatica 44(11):2902–2908MathSciNetCrossRef Sun XM, Liu GP, Rees D, Wang W (2008) Delay-dependent stability for discrete systems with large delay sequence based on switching techniques. Automatica 44(11):2902–2908MathSciNetCrossRef
36.
Zurück zum Zitat Xu X, Antsaklis PJ (2002) Optimal control of switched systems via nonlinear optimization based on direct differentiations of value functions. Int J Control 75(16):1406–1426CrossRef Xu X, Antsaklis PJ (2002) Optimal control of switched systems via nonlinear optimization based on direct differentiations of value functions. Int J Control 75(16):1406–1426CrossRef
37.
Zurück zum Zitat Xu X, Antsaklis PJ (2004) Optimal control of switched systems based on parameterization of the switching instants. IEEE Trans Autom Control 49(1):2–16MathSciNetCrossRef Xu X, Antsaklis PJ (2004) Optimal control of switched systems based on parameterization of the switching instants. IEEE Trans Autom Control 49(1):2–16MathSciNetCrossRef
38.
Zurück zum Zitat Seatzu C, Corona D, Giua A, Bemporad A (2006) Optimal control of continuous-time switched affine systems. IEEE Trans Autom Control 51(5):726–741MathSciNetCrossRef Seatzu C, Corona D, Giua A, Bemporad A (2006) Optimal control of continuous-time switched affine systems. IEEE Trans Autom Control 51(5):726–741MathSciNetCrossRef
39.
Zurück zum Zitat Egerstedt M, Wardi Y, Axelsson H (2006) Transition-time optimization for switched-mode dynamical systems. IEEE Trans Autom Control 51(1):110–115MathSciNetCrossRef Egerstedt M, Wardi Y, Axelsson H (2006) Transition-time optimization for switched-mode dynamical systems. IEEE Trans Autom Control 51(1):110–115MathSciNetCrossRef
40.
Zurück zum Zitat Vrabie D, Pastravanu O, Abu-Khalaf M, Lewis FL (2009) Adaptive optimal control for continuous-time linear systems based on policy iteration. Automatica 45(2):477–484MathSciNetCrossRef Vrabie D, Pastravanu O, Abu-Khalaf M, Lewis FL (2009) Adaptive optimal control for continuous-time linear systems based on policy iteration. Automatica 45(2):477–484MathSciNetCrossRef
41.
Zurück zum Zitat Lee JY, Park JB et al (2012) Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems. Automatica 48(11):2850–2859MathSciNetCrossRef Lee JY, Park JB et al (2012) Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems. Automatica 48(11):2850–2859MathSciNetCrossRef
42.
Zurück zum Zitat Lewis FL, Vrabie D, Syrmos V (2012) Optimal control, 3rd edn. Wiley, New YorkCrossRef Lewis FL, Vrabie D, Syrmos V (2012) Optimal control, 3rd edn. Wiley, New YorkCrossRef
43.
Zurück zum Zitat Kleinman D (1968) On an iterative technique for Riccati equation computations. IEEE Trans Autom Control 13(1):114–115CrossRef Kleinman D (1968) On an iterative technique for Riccati equation computations. IEEE Trans Autom Control 13(1):114–115CrossRef
44.
Zurück zum Zitat Liu D, Li H, Wang D (2014) Online synchronous approximate optimal learning algorithm for multi-player non-zero-sum games with unknown dynamics. IEEE Trans Syst Man Cybern Syst 44(8):1015–1027CrossRef Liu D, Li H, Wang D (2014) Online synchronous approximate optimal learning algorithm for multi-player non-zero-sum games with unknown dynamics. IEEE Trans Syst Man Cybern Syst 44(8):1015–1027CrossRef
Metadaten
Titel
Optimal tracking control of switched systems applied in grid-connected hybrid generation using reinforcement learning
verfasst von
Jiayue Sun
Huaguang Zhang
Yingchun Wang
Mingrui Fu
Publikationsdatum
04.02.2021
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 15/2021
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-021-05696-2

Weitere Artikel der Ausgabe 15/2021

Neural Computing and Applications 15/2021 Zur Ausgabe