Skip to main content
Erschienen in: Journal of Economic Interaction and Coordination 2/2019

20.10.2017 | Regular Article

Emergence of anti-coordination through reinforcement learning in generalized minority games

verfasst von: Anindya S. Chakrabarti, Diptesh Ghosh

Erschienen in: Journal of Economic Interaction and Coordination | Ausgabe 2/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper we propose adaptive strategies to solve coordination failures in a prototype generalized minority game model with a multi-agent, multi-choice environment. We illustrate the model with an application to large scale distributed processing systems with a large number of agents and servers. In our set up, agents are assigned responsibility to complete tasks that require unit time. They request servers to process these tasks. Servers can process only one task at a time. Agents have to choose servers independently and simultaneously, and have access to the outcomes of their own past requests only. Coordination failure occurs if more than one agent simultaneously requests the same server to process tasks at the same time, while other servers remain idle. Since agents are independent, this leads to multiple coordination failures. In this paper, we propose strategies based on reinforcement learning that minimize such coordination failures. We also prove a null result that a large category of probabilistic strategies which attempts to combine information about other agents’ strategies, asymptotically converge to uniformly random choices over the servers.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Alpern S, Reyniers DJ (2002) Spatial dispersion as a dynamic coordination problem. Theory Decis 53:29–59CrossRef Alpern S, Reyniers DJ (2002) Spatial dispersion as a dynamic coordination problem. Theory Decis 53:29–59CrossRef
Zurück zum Zitat Arthur WB (1994) Inductive reasoning and bounded rationality: the El Farol problem. Am Econ Rev 84:406–411 Arthur WB (1994) Inductive reasoning and bounded rationality: the El Farol problem. Am Econ Rev 84:406–411
Zurück zum Zitat Banerjee P, Mitra M, Mukherjee C (2012) Kolkata paise restaurant problem and the cyclically fair norm. In: Abergel F et al (eds) Econophysics of systematic risk and network dynamics. New economic window series. Springer, Milan, pp 201–216 Banerjee P, Mitra M, Mukherjee C (2012) Kolkata paise restaurant problem and the cyclically fair norm. In: Abergel F et al (eds) Econophysics of systematic risk and network dynamics. New economic window series. Springer, Milan, pp 201–216
Zurück zum Zitat Barret JA, Zollman KJS (2009) The role of forgetting in the evolution and learning of language. J Exp Theor Artif Intell 21(4):293–309CrossRef Barret JA, Zollman KJS (2009) The role of forgetting in the evolution and learning of language. J Exp Theor Artif Intell 21(4):293–309CrossRef
Zurück zum Zitat Busoniu L, Babuska R, De Schutter B (2008) A comprehensive survey of multi-agent reinforcement learning. IEEE Trans Syst Man Cybern Part C Appl Rev 38(2):156–172CrossRef Busoniu L, Babuska R, De Schutter B (2008) A comprehensive survey of multi-agent reinforcement learning. IEEE Trans Syst Man Cybern Part C Appl Rev 38(2):156–172CrossRef
Zurück zum Zitat Chakrabarti AS, Ghosh D (2017) Emergence of distributed coordination in the Kolkata paise restaurant problem with finite information. Physica A 483:16–24CrossRef Chakrabarti AS, Ghosh D (2017) Emergence of distributed coordination in the Kolkata paise restaurant problem with finite information. Physica A 483:16–24CrossRef
Zurück zum Zitat Chakrabarti AS, Chakrabarti BK, Chatterjee A, Mitra M (2009) The Kolkata paise restaurant problem and resource utilization. Physica A 388:2420–2426CrossRef Chakrabarti AS, Chakrabarti BK, Chatterjee A, Mitra M (2009) The Kolkata paise restaurant problem and resource utilization. Physica A 388:2420–2426CrossRef
Zurück zum Zitat Chakraborti A, Challet D, Chatterjee A, Marsili M, Zhang Y-C, Chakrabarti BK (2015) Statistical mechanics of competitive resource allocation using agent-based models. Phys Rep 552:125CrossRef Chakraborti A, Challet D, Chatterjee A, Marsili M, Zhang Y-C, Chakrabarti BK (2015) Statistical mechanics of competitive resource allocation using agent-based models. Phys Rep 552:125CrossRef
Zurück zum Zitat Challet D (2004) Competition between adaptive agents: learning and collective efficiency. In: Turner K, Wolpart D (eds) Collective and the design of complex systems. Springer, Berlin Challet D (2004) Competition between adaptive agents: learning and collective efficiency. In: Turner K, Wolpart D (eds) Collective and the design of complex systems. Springer, Berlin
Zurück zum Zitat Challet D (2008) Inter-pattern speculation: beyond minority, majority and $-games. J Econ Dyn Control 32(1):85–100CrossRef Challet D (2008) Inter-pattern speculation: beyond minority, majority and $-games. J Econ Dyn Control 32(1):85–100CrossRef
Zurück zum Zitat Challet D, Zhang Y-C (1997) Emergence of cooperation and organization in an evolutionary game. Physica A 246:407CrossRef Challet D, Zhang Y-C (1997) Emergence of cooperation and organization in an evolutionary game. Physica A 246:407CrossRef
Zurück zum Zitat Challet D, Marsili M, Zhang Y-C (2004) Minority games: interacting agents in financial markets. Oxford University of Press, Oxford Challet D, Marsili M, Zhang Y-C (2004) Minority games: interacting agents in financial markets. Oxford University of Press, Oxford
Zurück zum Zitat Chu W, Holloway L, Lan M, Efe K (1980) Task allocation in distributed data processing. Computer 13:57–89CrossRef Chu W, Holloway L, Lan M, Efe K (1980) Task allocation in distributed data processing. Computer 13:57–89CrossRef
Zurück zum Zitat Cigler L, Faltings B (2013) Decentralized anti-coordination through multi-agents learning. J Artif Intell Res 47:441–473CrossRef Cigler L, Faltings B (2013) Decentralized anti-coordination through multi-agents learning. J Artif Intell Res 47:441–473CrossRef
Zurück zum Zitat DeGroot MH (1974) Reaching a consensus. J Am Stat Assoc 69(345):118–121CrossRef DeGroot MH (1974) Reaching a consensus. J Am Stat Assoc 69(345):118–121CrossRef
Zurück zum Zitat Doty K, McEntire P, O’Reilly J (1982) Task allocation in a distributed computer system. In Proceedings of IEEE INFOCOM, pp 33–38 Doty K, McEntire P, O’Reilly J (1982) Task allocation in a distributed computer system. In Proceedings of IEEE INFOCOM, pp 33–38
Zurück zum Zitat Fogel DB, Chellapilla K, Angeline PJ (1999) Inductive reasoning and bounded rationality reconsidered. IEEE Trans Evol Comput 3(2):142CrossRef Fogel DB, Chellapilla K, Angeline PJ (1999) Inductive reasoning and bounded rationality reconsidered. IEEE Trans Evol Comput 3(2):142CrossRef
Zurück zum Zitat Galstyan A, Czajkowski K, Lerman K (2005) Resource allocation in the grid with learning agents. J Grid Comput 3:91100CrossRef Galstyan A, Czajkowski K, Lerman K (2005) Resource allocation in the grid with learning agents. J Grid Comput 3:91100CrossRef
Zurück zum Zitat Ghosh A, Chatterjee A, Mitra M, Chakrabarti BK (2010) Statistics of the Kolkata paise restaurant problem. New J Phys 12(7):075033CrossRef Ghosh A, Chatterjee A, Mitra M, Chakrabarti BK (2010) Statistics of the Kolkata paise restaurant problem. New J Phys 12(7):075033CrossRef
Zurück zum Zitat Grenager T, Powers R, Shoham Y (2002) Dispersion games: general definitions and some specific learning results. In: AAAI-02 Proceedings Grenager T, Powers R, Shoham Y (2002) Dispersion games: general definitions and some specific learning results. In: AAAI-02 Proceedings
Zurück zum Zitat Hansen J, Giauque W (1986) Task allocation in distributed processing systems. Oper Res Lett 5:137–143CrossRef Hansen J, Giauque W (1986) Task allocation in distributed processing systems. Oper Res Lett 5:137–143CrossRef
Zurück zum Zitat Hwang R, Gen M, Katayama H (2008) A comparison of multiprocessor task scheduling algorithms with communication costs. Comput Oper Res 35(3):976–993CrossRef Hwang R, Gen M, Katayama H (2008) A comparison of multiprocessor task scheduling algorithms with communication costs. Comput Oper Res 35(3):976–993CrossRef
Zurück zum Zitat Jackson M (2010) Social and economic networks. Princeton University Press, PrincetonCrossRef Jackson M (2010) Social and economic networks. Princeton University Press, PrincetonCrossRef
Zurück zum Zitat Mosetti G, Challet D, Solomon S (2009) Structure-preserving desynchronization of minority games. Topical issue on the physics approach to risk: agent-based models and networks. Eur Phys J B 71:573CrossRef Mosetti G, Challet D, Solomon S (2009) Structure-preserving desynchronization of minority games. Topical issue on the physics approach to risk: agent-based models and networks. Eur Phys J B 71:573CrossRef
Zurück zum Zitat Nowak MA (2006) Evolutionary dynamics: exploring the equations of life. Harvard University of Press, Cambridge Nowak MA (2006) Evolutionary dynamics: exploring the equations of life. Harvard University of Press, Cambridge
Zurück zum Zitat Pugliese E, Castellano C, Marsili M, Pietronero L (2009) Collaborate, compete and share. Eur Phys J B 67(3):319–327CrossRef Pugliese E, Castellano C, Marsili M, Pietronero L (2009) Collaborate, compete and share. Eur Phys J B 67(3):319–327CrossRef
Zurück zum Zitat Sornette D (2004) Why stock markets crash: critical events in complex financial systems. Princeton University Press, Princeton Sornette D (2004) Why stock markets crash: critical events in complex financial systems. Princeton University Press, Princeton
Metadaten
Titel
Emergence of anti-coordination through reinforcement learning in generalized minority games
verfasst von
Anindya S. Chakrabarti
Diptesh Ghosh
Publikationsdatum
20.10.2017
Verlag
Springer Berlin Heidelberg
Erschienen in
Journal of Economic Interaction and Coordination / Ausgabe 2/2019
Print ISSN: 1860-711X
Elektronische ISSN: 1860-7128
DOI
https://doi.org/10.1007/s11403-017-0204-5

Weitere Artikel der Ausgabe 2/2019

Journal of Economic Interaction and Coordination 2/2019 Zur Ausgabe