Skip to main content

2016 | OriginalPaper | Buchkapitel

Distributed Optimal Flocking Design for Multi-agent Two-Player Zero-Sum Games with Unknown System Dynamics and Disturbance

verfasst von : Hao Xu, Luis Rodolfo Garcia Carrillo

Erschienen in: Advances in Visual Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, distributed flocking strategies have been exploited for multi-agent two-player zero-sum games. Two main challenges are addressed, i.e. (a) handling system uncertainties and disturbances, and (b) achieving optimality. Adopting the emerging Approximate Dynamic Programming (ADP) technology, a novel distributed adaptive flocking design is proposed to optimize the multi-agent two-player zero-sum games even when the system dynamics and disturbances are unknown. First, to evaluate the multi-agent flocking performance and effects from disturbances, a novel flocking cost function is developed. Next, an innovative type of online neural network (NN) based identifier is proposed to approximate the multi-agent zero-sum game system dynamics effectively. Subsequently, another novel neural network (NN) is proposed to approximate the optimal flocking cost function by using the Hamilton-Jacobi-Isaacs (HJI) equation in a forward in time manner. Moreover, a novel additional term is designed and included into the NN update law to relax the stringent requirement of initial admissible control. Eventually, the distributed adaptive optimal flocking design is obtained by using the learnt Multi-agent zero-sum games system dynamics and approximated optimal flocking cost function. Simulation results demonstrate the effectiveness of proposed scheme.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Reynolds, C.W.: Flocks, herds, and schools: a distributed behavioral model. Comput. Graph. 21, 25–34 (1986)CrossRef Reynolds, C.W.: Flocks, herds, and schools: a distributed behavioral model. Comput. Graph. 21, 25–34 (1986)CrossRef
2.
Zurück zum Zitat Saber, R.O.: Flocking for multi-agent dynamic systems: algorithms and theory. IEEE Trans. Autom. Control 51, 401–420 (2006)MathSciNetCrossRef Saber, R.O.: Flocking for multi-agent dynamic systems: algorithms and theory. IEEE Trans. Autom. Control 51, 401–420 (2006)MathSciNetCrossRef
3.
Zurück zum Zitat Wang, Q., Fang, H., Chen, J., Mao, Y., Dou, L.: Flocking with obstacle avoidance and connectivity maintenance in multi-agent systems. In: Proceedings of IEEE Control and Decision Conference, pp. 4009–4014 (2012) Wang, Q., Fang, H., Chen, J., Mao, Y., Dou, L.: Flocking with obstacle avoidance and connectivity maintenance in multi-agent systems. In: Proceedings of IEEE Control and Decision Conference, pp. 4009–4014 (2012)
4.
Zurück zum Zitat Dragan, V., Morozan, R.: Global solution to game-theoretic riccati equation of stochastic control. J. Diff. Equ. 138, 328–350 (1997)MathSciNetCrossRefMATH Dragan, V., Morozan, R.: Global solution to game-theoretic riccati equation of stochastic control. J. Diff. Equ. 138, 328–350 (1997)MathSciNetCrossRefMATH
5.
Zurück zum Zitat Wang, J., Xin, M.: Integrated optimal formation control of multiple unmanned aerial vehicles. IEEE Trans. Control Syst. Tech. 21, 1731–1744 (2013)CrossRef Wang, J., Xin, M.: Integrated optimal formation control of multiple unmanned aerial vehicles. IEEE Trans. Control Syst. Tech. 21, 1731–1744 (2013)CrossRef
6.
Zurück zum Zitat Lewis, F.L., Vrabie, D., Syrmos, V.L.: Optimal Control, 3rd edn. Wiley, New York (2012)CrossRefMATH Lewis, F.L., Vrabie, D., Syrmos, V.L.: Optimal Control, 3rd edn. Wiley, New York (2012)CrossRefMATH
7.
Zurück zum Zitat Bertsekas, D.P., Tsitsiklis, J.: Neuro-Dynamic Programming. Athena Scientific, CA (1996)MATH Bertsekas, D.P., Tsitsiklis, J.: Neuro-Dynamic Programming. Athena Scientific, CA (1996)MATH
8.
Zurück zum Zitat Al-Tamimi, A., Lewis, F.L., Abu-Khalaf, M.: Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control. Automatica 3, 471–481 (2007)MathSciNetMATH Al-Tamimi, A., Lewis, F.L., Abu-Khalaf, M.: Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control. Automatica 3, 471–481 (2007)MathSciNetMATH
9.
Zurück zum Zitat Dierks, T., Jagannathan, S.: Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update. IEEE Trans. Neural Netw. Learn. Syst. 23, 1118–1129 (2012)CrossRef Dierks, T., Jagannathan, S.: Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update. IEEE Trans. Neural Netw. Learn. Syst. 23, 1118–1129 (2012)CrossRef
10.
Zurück zum Zitat Diestel, R.: Graph Theory. Graduate Texts in Mathematics, vol. 184. Springer, Heidelberg (2000)MATH Diestel, R.: Graph Theory. Graduate Texts in Mathematics, vol. 184. Springer, Heidelberg (2000)MATH
11.
Zurück zum Zitat Jagannathan, S.: Neural Network Control of Nonlinear Discrete-Time Systems. CRC Press, FL (2006)MATH Jagannathan, S.: Neural Network Control of Nonlinear Discrete-Time Systems. CRC Press, FL (2006)MATH
Metadaten
Titel
Distributed Optimal Flocking Design for Multi-agent Two-Player Zero-Sum Games with Unknown System Dynamics and Disturbance
verfasst von
Hao Xu
Luis Rodolfo Garcia Carrillo
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-50835-1_54