nach oben

Journal of Economic Interaction and Coordination

Erschienen in:

20.10.2017 | Regular Article

Emergence of anti-coordination through reinforcement learning in generalized minority games

verfasst von: Anindya S. Chakrabarti, Diptesh Ghosh

Erschienen in: Journal of Economic Interaction and Coordination | Ausgabe 2/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

In this paper we propose adaptive strategies to solve coordination failures in a prototype generalized minority game model with a multi-agent, multi-choice environment. We illustrate the model with an application to large scale distributed processing systems with a large number of agents and servers. In our set up, agents are assigned responsibility to complete tasks that require unit time. They request servers to process these tasks. Servers can process only one task at a time. Agents have to choose servers independently and simultaneously, and have access to the outcomes of their own past requests only. Coordination failure occurs if more than one agent simultaneously requests the same server to process tasks at the same time, while other servers remain idle. Since agents are independent, this leads to multiple coordination failures. In this paper, we propose strategies based on reinforcement learning that minimize such coordination failures. We also prove a null result that a large category of probabilistic strategies which attempts to combine information about other agents’ strategies, asymptotically converge to uniformly random choices over the servers.

Nächster Artikel Asset diversification and systemic risk in the financial system

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Nur mit Berechtigung zugänglich

Alpern S, Reyniers DJ (2002) Spatial dispersion as a dynamic coordination problem. Theory Decis 53:29–59CrossRef

Arthur WB (1994) Inductive reasoning and bounded rationality: the El Farol problem. Am Econ Rev 84:406–411

Banerjee P, Mitra M, Mukherjee C (2012) Kolkata paise restaurant problem and the cyclically fair norm. In: Abergel F et al (eds) Econophysics of systematic risk and network dynamics. New economic window series. Springer, Milan, pp 201–216

Barret JA, Zollman KJS (2009) The role of forgetting in the evolution and learning of language. J Exp Theor Artif Intell 21(4):293–309CrossRef

Busoniu L, Babuska R, De Schutter B (2008) A comprehensive survey of multi-agent reinforcement learning. IEEE Trans Syst Man Cybern Part C Appl Rev 38(2):156–172CrossRef

Chakrabarti AS, Ghosh D (2017) Emergence of distributed coordination in the Kolkata paise restaurant problem with finite information. Physica A 483:16–24CrossRef

Chakrabarti AS, Chakrabarti BK, Chatterjee A, Mitra M (2009) The Kolkata paise restaurant problem and resource utilization. Physica A 388:2420–2426CrossRef

Chakraborti A, Challet D, Chatterjee A, Marsili M, Zhang Y-C, Chakrabarti BK (2015) Statistical mechanics of competitive resource allocation using agent-based models. Phys Rep 552:125CrossRef

Challet D (2004) Competition between adaptive agents: learning and collective efficiency. In: Turner K, Wolpart D (eds) Collective and the design of complex systems. Springer, Berlin

Challet D (2008) Inter-pattern speculation: beyond minority, majority and $-games. J Econ Dyn Control 32(1):85–100CrossRef

Challet D, Zhang Y-C (1997) Emergence of cooperation and organization in an evolutionary game. Physica A 246:407CrossRef

Challet D, Marsili M, Zhang Y-C (2004) Minority games: interacting agents in financial markets. Oxford University of Press, Oxford

Chu W, Holloway L, Lan M, Efe K (1980) Task allocation in distributed data processing. Computer 13:57–89CrossRef

Cigler L, Faltings B (2013) Decentralized anti-coordination through multi-agents learning. J Artif Intell Res 47:441–473CrossRef

DeGroot MH (1974) Reaching a consensus. J Am Stat Assoc 69(345):118–121CrossRef

Doty K, McEntire P, O’Reilly J (1982) Task allocation in a distributed computer system. In Proceedings of IEEE INFOCOM, pp 33–38

Fogel DB, Chellapilla K, Angeline PJ (1999) Inductive reasoning and bounded rationality reconsidered. IEEE Trans Evol Comput 3(2):142CrossRef

Galstyan A, Czajkowski K, Lerman K (2005) Resource allocation in the grid with learning agents. J Grid Comput 3:91100CrossRef

Ghosh A, Chatterjee A, Mitra M, Chakrabarti BK (2010) Statistics of the Kolkata paise restaurant problem. New J Phys 12(7):075033CrossRef

Grenager T, Powers R, Shoham Y (2002) Dispersion games: general definitions and some specific learning results. In: AAAI-02 Proceedings

Hansen J, Giauque W (1986) Task allocation in distributed processing systems. Oper Res Lett 5:137–143CrossRef

Hwang R, Gen M, Katayama H (2008) A comparison of multiprocessor task scheduling algorithms with communication costs. Comput Oper Res 35(3):976–993CrossRef

Jackson M (2010) Social and economic networks. Princeton University Press, PrincetonCrossRef

Mosetti G, Challet D, Solomon S (2009) Structure-preserving desynchronization of minority games. Topical issue on the physics approach to risk: agent-based models and networks. Eur Phys J B 71:573CrossRef

Nowak MA (2006) Evolutionary dynamics: exploring the equations of life. Harvard University of Press, Cambridge

Pugliese E, Castellano C, Marsili M, Pietronero L (2009) Collaborate, compete and share. Eur Phys J B 67(3):319–327CrossRef

Sornette D (2004) Why stock markets crash: critical events in complex financial systems. Princeton University Press, Princeton

Titel: Emergence of anti-coordination through reinforcement learning in generalized minority games
verfasst von: Anindya S. Chakrabarti
Diptesh Ghosh
Publikationsdatum: 20.10.2017
Verlag: Springer Berlin Heidelberg
Erschienen in: Journal of Economic Interaction and Coordination / Ausgabe 2/2019
Print ISSN: 1860-711X
Elektronische ISSN: 1860-7128
DOI: https://doi.org/10.1007/s11403-017-0204-5

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 2/2019

Trading volume and return volatility of Bitcoin market: evidence for the sequential information arrival hypothesis

Asset diversification and systemic risk in the financial system

Franck Jovanovic and Christophe Schinckus: Econophysics and financial economics: an emerging dialogue

Market efficiency, trading institutions and information mirages: evidence from a laboratory asset market

A note on the relationship between the total factor productivity and the network of firms

Partnership duration and concurrent partnering: implications for models of HIV prevalence