Skip to main content
Top

2023 | OriginalPaper | Chapter

Reinforcement Control for Planar Robot Based on Neural Network and Extended State Observer

Authors : Duy Nguyen Trung, Thien Nguyen Van, Hai Xuan Le, Dung Do Manh, Duy Hoang

Published in: Intelligent Systems and Networks

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Based on the cooperation of neural network and extended state observer (ESO), in this paper, an approach for reinforcement control will be presented for Planar robot. By using the sliding surface as a state variable, the nominal system in quadratic form will be converted to first-order where the total uncertain component is estimated and remove by ESO. Then, a reinforcement algorithm will be added in collaboration to determine the nearly optimal solution of Hamilton-Jacobi-Bellman (HJB) equation. During the determination of the control signal, only one neural network is applied to reduce the computational complexity while still achieving the desired requirements. The simulation results of the algorithm will be examined on the Planar Robot with two degrees of freedom, thereby confirming the effectiveness of proposed control strategy.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Blatnický, M., Dizo, J., Gerlici, J., Sága, M., Lack, T., Kuba, E.: Design of a robotic manipulator for handling products of automotive industry. Int. J. Adv. Robot. Syst. 17(1) (2020) Blatnický, M., Dizo, J., Gerlici, J., Sága, M., Lack, T., Kuba, E.: Design of a robotic manipulator for handling products of automotive industry. Int. J. Adv. Robot. Syst. 17(1) (2020)
2.
go back to reference Li, H., Liu, C.L., Zhang, Y., Chen, Y.Y.: Adaptive neural networks-based fixed-time fault-tolerant consensus tracking for uncertain multiple Euler-Lagrange systems. ISA Trans. 129, 102–113 (2021) Li, H., Liu, C.L., Zhang, Y., Chen, Y.Y.: Adaptive neural networks-based fixed-time fault-tolerant consensus tracking for uncertain multiple Euler-Lagrange systems. ISA Trans. 129, 102–113 (2021)
4.
go back to reference Shao, K., Tang, R., Xu, F., Wang, X., Zheng, J.: Adaptive sliding mode control for uncertain Euler-Lagrange systems with input saturation. J. Franklin Inst. 358(16), 8356–8376 (2021)MathSciNetCrossRefMATH Shao, K., Tang, R., Xu, F., Wang, X., Zheng, J.: Adaptive sliding mode control for uncertain Euler-Lagrange systems with input saturation. J. Franklin Inst. 358(16), 8356–8376 (2021)MathSciNetCrossRefMATH
5.
go back to reference Ramírez-Neria, M., Madonski, R., Luviano-Juárez, A., Gao, Z., Sira-Ramírez, H.: Design of ADRC for second-order mechanical systems without time-derivatives in the tracking controller. In: 2020 American Control Conference (ACC), (2020) Ramírez-Neria, M., Madonski, R., Luviano-Juárez, A., Gao, Z., Sira-Ramírez, H.: Design of ADRC for second-order mechanical systems without time-derivatives in the tracking controller. In: 2020 American Control Conference (ACC), (2020)
7.
go back to reference Zhang, Z., Leibold, M., Wollherr, D.: Integral sliding-mode observer-based disturbance estimation for Euler-Lagrangian systems. IEEE Trans. Control Syst. Technol. 28(6), 2377–2389 (2019)CrossRef Zhang, Z., Leibold, M., Wollherr, D.: Integral sliding-mode observer-based disturbance estimation for Euler-Lagrangian systems. IEEE Trans. Control Syst. Technol. 28(6), 2377–2389 (2019)CrossRef
8.
go back to reference Zhou, Q., Zha, S., Li, R., Lu, R., Wu, C.: Adaptive neural network tracking control for robotic manipulators with dead zone. IEEE Trans. Neural Netw. Learn. Syst. 30(12), 3611 - 3620 (2018) Zhou, Q., Zha, S., Li, R., Lu, R., Wu, C.: Adaptive neural network tracking control for robotic manipulators with dead zone. IEEE Trans. Neural Netw. Learn. Syst. 30(12), 3611 - 3620 (2018)
9.
go back to reference Jouila, A., Nouri, K.: An adaptive robust nonsingular fast terminal sliding mode controller based on wavelet neural network for a 2-DOF robotic arm. J. Franklin Inst. 357(18), 13259–13282 (2020)MathSciNetCrossRefMATH Jouila, A., Nouri, K.: An adaptive robust nonsingular fast terminal sliding mode controller based on wavelet neural network for a 2-DOF robotic arm. J. Franklin Inst. 357(18), 13259–13282 (2020)MathSciNetCrossRefMATH
10.
go back to reference Ioannou, P., Fidan, B.: Advances in Design and Control, Adaptive Control Tutorizal. SIAM, PA (2006)CrossRefMATH Ioannou, P., Fidan, B.: Advances in Design and Control, Adaptive Control Tutorizal. SIAM, PA (2006)CrossRefMATH
11.
go back to reference Tang, L., Liu, Y.J., Tong, S.: Adaptive neural control using reinforcement learning for a class of robot manipulator. Neural Comput. Appl. 25, 135–141 (2014)CrossRef Tang, L., Liu, Y.J., Tong, S.: Adaptive neural control using reinforcement learning for a class of robot manipulator. Neural Comput. Appl. 25, 135–141 (2014)CrossRef
12.
go back to reference Lewis, F.L., Vrabie, D.: Reinforcement learning and adaptive dynamic programming for feedback control. IEEE Circuits Syst. Mag. 9(3), 32–50 (2009)CrossRef Lewis, F.L., Vrabie, D.: Reinforcement learning and adaptive dynamic programming for feedback control. IEEE Circuits Syst. Mag. 9(3), 32–50 (2009)CrossRef
13.
go back to reference Dupree, K., Patre, P.M., Wilcox, Z.D., Dixon, W.E.: Asymptotic optimal control of uncertain nonlinear Euler-Lagrange systems. Automatica 1, 99–107 (2011)MathSciNetCrossRefMATH Dupree, K., Patre, P.M., Wilcox, Z.D., Dixon, W.E.: Asymptotic optimal control of uncertain nonlinear Euler-Lagrange systems. Automatica 1, 99–107 (2011)MathSciNetCrossRefMATH
14.
15.
go back to reference Vamvoudakis, K.G., Lewis, F.L.: Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica, 46, 878–888, (2010) Vamvoudakis, K.G., Lewis, F.L.: Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica, 46, 878–888, (2010)
17.
go back to reference Luy, N.T.: Reinforecement learning-based optimal tracking control for wheeled mobile robot. In: Proceedings of IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems (CYBER), pp. 371-376, May 2012 Luy, N.T.: Reinforecement learning-based optimal tracking control for wheeled mobile robot. In: Proceedings of IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems (CYBER), pp. 371-376, May 2012
Metadata
Title
Reinforcement Control for Planar Robot Based on Neural Network and Extended State Observer
Authors
Duy Nguyen Trung
Thien Nguyen Van
Hai Xuan Le
Dung Do Manh
Duy Hoang
Copyright Year
2023
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-99-4725-6_62

Premium Partner