Skip to main content
Top
Published in: Arabian Journal for Science and Engineering 3/2020

19-09-2019 | Research Article - Electrical Engineering

Genetic Algorithm-Optimized Fuzzy Lyapunov Reinforcement Learning for Nonlinear Systems

Authors: Amit Kukker, Rajneesh Sharma

Published in: Arabian Journal for Science and Engineering | Issue 3/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper aims to introduce nonlinear optimization in the fuzzy reinforcement learning (RL) approach through genetic algorithm (GA)-based minimization. In conventional fuzzy RL, an agent attempts to find most optimal action at each stage by choosing an action having the lowest Q value or the greedy action. However, Q function is an unknown function and an attempt to find minima of such a function based on a limited set of values, in our view, is inaccurate and insufficient. A more rigorous approach would be to employ a nonlinear optimization procedure for finding minima of the Q function. We propose to employ genetic algorithm for finding optimal action value in each iteration of the algorithm rather than plain algebraic minimum. For guaranteed stability of the designed controller, we use Lyapunov theory-based fuzzy RL control with GA optimizer. We validate the performance of our controller on three benchmark nonlinear NL control problems: (1) inverted pendulum swing up, (2) cart pole balance, and (3) rotational/translational proof-mass actuator system. We carry out comparative evaluation of our controller against: (1) hybrid Lyapunov fuzzy RL control and (2) fuzzy Q learning control. Results show that our proposed GA-optimized fuzzy Lyapunov RL controller is able to achieve a high success rate with stable and superior tracking performance.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Russell, S.J.; Norvig, P.: Artificial Intelligence: A Modern Approach. Pearson Education Limited, Kuala Lumpur (2016)MATH Russell, S.J.; Norvig, P.: Artificial Intelligence: A Modern Approach. Pearson Education Limited, Kuala Lumpur (2016)MATH
2.
go back to reference Sammut, C.; Webb, G.I.: Encyclopedia of Machine Learning and Data Mining. Springer, Berlin (2017)CrossRef Sammut, C.; Webb, G.I.: Encyclopedia of Machine Learning and Data Mining. Springer, Berlin (2017)CrossRef
3.
go back to reference Mars, P.: Learning Algorithms: Theory and Applications in Signal Processing, Control and Communications. CRC Press, Boca Raton (2018)CrossRef Mars, P.: Learning Algorithms: Theory and Applications in Signal Processing, Control and Communications. CRC Press, Boca Raton (2018)CrossRef
4.
go back to reference Silver, D.; et al.: Mastering the game of Go with deep neural networks and tree search. Nature 529(7587), 484 (2016)CrossRef Silver, D.; et al.: Mastering the game of Go with deep neural networks and tree search. Nature 529(7587), 484 (2016)CrossRef
5.
go back to reference Wang, H.; et al.: Adaptive neural tracking control for a class of nonlinear systems with dynamic uncertainties. IEEE Trans. Cybern. 47(10), 3075–3087 (2017)CrossRef Wang, H.; et al.: Adaptive neural tracking control for a class of nonlinear systems with dynamic uncertainties. IEEE Trans. Cybern. 47(10), 3075–3087 (2017)CrossRef
6.
go back to reference Baghya Shree, S.; Kamaraj, N.: Hybrid neuro fuzzy approach for automatic generation control in restructured power system. Int. J. Electr. Power Energy Syst. 74, 274–285 (2016)CrossRef Baghya Shree, S.; Kamaraj, N.: Hybrid neuro fuzzy approach for automatic generation control in restructured power system. Int. J. Electr. Power Energy Syst. 74, 274–285 (2016)CrossRef
7.
go back to reference Mannion, P. et al.: Dynamic economic emissions dispatch optimisation using multi-agent reinforcement learning. In: Proceedings of the Adaptive and Learning Agents Workshop (at AAMAS 2016). (2016) Mannion, P. et al.: Dynamic economic emissions dispatch optimisation using multi-agent reinforcement learning. In: Proceedings of the Adaptive and Learning Agents Workshop (at AAMAS 2016). (2016)
8.
go back to reference Liu, Y.; et al.: Partial-nodes-based state estimation for complex Networks with unbounded distributed delays. IEEE Trans. Neural Netw. Learn. Syst. 29(8), 3906–3912 (2018)MathSciNetCrossRef Liu, Y.; et al.: Partial-nodes-based state estimation for complex Networks with unbounded distributed delays. IEEE Trans. Neural Netw. Learn. Syst. 29(8), 3906–3912 (2018)MathSciNetCrossRef
9.
go back to reference Rubio, J.J.: Modified optimal control with a back propagation network for robotic arms. IET Control Theory Appl. 6(14), 2216–2225 (2012)MathSciNetCrossRef Rubio, J.J.: Modified optimal control with a back propagation network for robotic arms. IET Control Theory Appl. 6(14), 2216–2225 (2012)MathSciNetCrossRef
10.
go back to reference de Jes´us Rubio, J.: SOFMLS: online self-organizing fuzzy modified least-squares network. IEEE Trans. Fuzzy Syst. 17(6), 1296–1309 (2009)CrossRef de Jes´us Rubio, J.: SOFMLS: online self-organizing fuzzy modified least-squares network. IEEE Trans. Fuzzy Syst. 17(6), 1296–1309 (2009)CrossRef
11.
go back to reference Li, X.; et al.: Assessing information security risk for an evolving smart city based on fuzzy and grey FMEA. J Intell Fuzzy Syst. 34(4), 2491–2501 (2018)CrossRef Li, X.; et al.: Assessing information security risk for an evolving smart city based on fuzzy and grey FMEA. J Intell Fuzzy Syst. 34(4), 2491–2501 (2018)CrossRef
12.
go back to reference de Jes´us Rubio, J.; et al.: Neural network updating via argument Kalman filter for modeling of Takagi–Sugeno fuzzy models. J. Intell. Fuzzy Syst. 35(2), 2585–2596 (2018)CrossRef de Jes´us Rubio, J.; et al.: Neural network updating via argument Kalman filter for modeling of Takagi–Sugeno fuzzy models. J. Intell. Fuzzy Syst. 35(2), 2585–2596 (2018)CrossRef
14.
go back to reference Yang, Z.; Ce, L.; Lian, L.: Electricity price forecasting by a hybrid model, combining wavelet transform, ARMA and kernel-based extreme learning machine methods. Appl. Energy 190, 291–305 (2017)CrossRef Yang, Z.; Ce, L.; Lian, L.: Electricity price forecasting by a hybrid model, combining wavelet transform, ARMA and kernel-based extreme learning machine methods. Appl. Energy 190, 291–305 (2017)CrossRef
15.
go back to reference Mohri, M.; Rostamizadeh, A.; Talwalkar, A.: Foundations of Machine Learning. MIT Press, Cambridge (2012)MATH Mohri, M.; Rostamizadeh, A.; Talwalkar, A.: Foundations of Machine Learning. MIT Press, Cambridge (2012)MATH
16.
go back to reference Sharma, R.: Lyapunov theory based stable Markov game fuzzy control for non-linear systems. Eng. Appl. Artif. Intell. 55, 119–127 (2016)CrossRef Sharma, R.: Lyapunov theory based stable Markov game fuzzy control for non-linear systems. Eng. Appl. Artif. Intell. 55, 119–127 (2016)CrossRef
17.
go back to reference Kumar, A.; Sharma, R.: Fuzzy lyapunov reinforcement learning for non linear systems. ISA Trans. 67, 151–159 (2017)CrossRef Kumar, A.; Sharma, R.: Fuzzy lyapunov reinforcement learning for non linear systems. ISA Trans. 67, 151–159 (2017)CrossRef
18.
go back to reference Aguilar-Ibanez, C.: A constructive Lyapunov function for controlling the inverted pendulum. In: American Control Conference, Westin Seattle Hotel, Seattle, Washington, USA, 11–13 June 2008 Aguilar-Ibanez, C.: A constructive Lyapunov function for controlling the inverted pendulum. In: American Control Conference, Westin Seattle Hotel, Seattle, Washington, USA, 11–13 June 2008
19.
go back to reference Zhang, Q.; et al.: Energy-efficient scheduling for real-time systems based on deep Q-learning model. IEEE Trans. Sustain. Comput. 4, 132–141 (2017)CrossRef Zhang, Q.; et al.: Energy-efficient scheduling for real-time systems based on deep Q-learning model. IEEE Trans. Sustain. Comput. 4, 132–141 (2017)CrossRef
20.
go back to reference Deng, Y.; et al.: Deep direct reinforcement learning for financial signal representation and trading. IEEE Trans. Neural Netw. Learn. Syst. 28(3), 653–664 (2017)CrossRef Deng, Y.; et al.: Deep direct reinforcement learning for financial signal representation and trading. IEEE Trans. Neural Netw. Learn. Syst. 28(3), 653–664 (2017)CrossRef
21.
go back to reference Such, F.P., et al.: Deep neuroevolution: genetic algorithms are a competitive alternative for training deep neural networks for reinforcement learning (2017). arXiv preprint arXiv:1712.06567 Such, F.P., et al.: Deep neuroevolution: genetic algorithms are a competitive alternative for training deep neural networks for reinforcement learning (2017). arXiv preprint arXiv:​1712.​06567
Metadata
Title
Genetic Algorithm-Optimized Fuzzy Lyapunov Reinforcement Learning for Nonlinear Systems
Authors
Amit Kukker
Rajneesh Sharma
Publication date
19-09-2019
Publisher
Springer Berlin Heidelberg
Published in
Arabian Journal for Science and Engineering / Issue 3/2020
Print ISSN: 2193-567X
Electronic ISSN: 2191-4281
DOI
https://doi.org/10.1007/s13369-019-04126-9

Other articles of this Issue 3/2020

Arabian Journal for Science and Engineering 3/2020 Go to the issue

Premium Partners