Skip to main content
Erschienen in: Group Decision and Negotiation 2/2021

13.01.2020

Game Adaptation by Using Reinforcement Learning Over Meta Games

verfasst von: Simão Reis, Luís Paulo Reis, Nuno Lau

Erschienen in: Group Decision and Negotiation | Ausgabe 2/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this work, we propose a Dynamic Difficulty Adjustment methodology to achieve automatic video game balance. The balance task is modeled as a meta game, a game where actions change the rules of another base game. Based on the model of Reinforcement Learning (RL), an agent assumes the role of a game master and learns its optimal policy by playing the meta game. In this new methodology we extend traditional RL by adding the existence of a meta environment whose state transition depends on the evolution of a base environment. In addition, we propose a Multi Agent System training model for the game master agent, where it plays against multiple agent opponents, each with a distinct behavior and proficiency level while playing the base game. Our experiment is conducted on an adaptive grid-world environment in singleplayer and multiplayer scenarios. Our results are expressed in twofold: (i) the resulting decision making by the game master through gameplay, which must comply in accordance to an established balance objective by the game designer; (ii) the initial conception of a framework for automatic game balance, where the balance task design is reduced to the modulation of a reward function (balance reward), an action space (balance strategies) and the definition of a balance space state.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Altimira D, Mueller FF, Lee G, Clarke J, Billinghurst M (2014) Towards understanding balancing in exertion games. In: Proceedings of the 11th conference on advances in computer entertainment technology, ACM, New York, NY, USA, ACE ’14, pp 10:1–10:8. https://doi.org/10.1145/2663806.2663838 Altimira D, Mueller FF, Lee G, Clarke J, Billinghurst M (2014) Towards understanding balancing in exertion games. In: Proceedings of the 11th conference on advances in computer entertainment technology, ACM, New York, NY, USA, ACE ’14, pp 10:1–10:8. https://​doi.​org/​10.​1145/​2663806.​2663838
Zurück zum Zitat Andrade G, Ramalho G, Santana H, Corruble V (2005) Extending reinforcement learning to provide dynamic game balancing, pp 7–12 Andrade G, Ramalho G, Santana H, Corruble V (2005) Extending reinforcement learning to provide dynamic game balancing, pp 7–12
Zurück zum Zitat Burke JW, McNeill MDJ, Charles DK, Morrow PJ, Crosbie JH, McDonough SM (2010) Augmented reality games for upper-limb stroke rehabilitation. In: 2010 second international conference on games and virtual worlds for serious applications, pp 75–78. https://doi.org/10.1109/VS-GAMES.2010.21 Burke JW, McNeill MDJ, Charles DK, Morrow PJ, Crosbie JH, McDonough SM (2010) Augmented reality games for upper-limb stroke rehabilitation. In: 2010 second international conference on games and virtual worlds for serious applications, pp 75–78. https://​doi.​org/​10.​1109/​VS-GAMES.​2010.​21
Zurück zum Zitat Cechanowicz JE, Gutwin C, Bateman S, Mandryk R, Stavness I (2014) Improving player balancing in racing games. In: Proceedings of the first ACM SIGCHI annual symposium on computer–human interaction in play, ACM, New York, NY, USA, CHI PLAY ’14, pp 47–56. https://doi.org/10.1145/2658537.2658701 Cechanowicz JE, Gutwin C, Bateman S, Mandryk R, Stavness I (2014) Improving player balancing in racing games. In: Proceedings of the first ACM SIGCHI annual symposium on computer–human interaction in play, ACM, New York, NY, USA, CHI PLAY ’14, pp 47–56. https://​doi.​org/​10.​1145/​2658537.​2658701
Zurück zum Zitat Hunicke R (2005) The case for dynamic difficulty adjustment in games. In: Proceedings of the 2005 ACM SIGCHI international conference on advances in computer entertainment technology, ACM, New York, NY, USA, ACE ’05, pp 429–433. https://doi.org/10.1145/1178477.1178573 Hunicke R (2005) The case for dynamic difficulty adjustment in games. In: Proceedings of the 2005 ACM SIGCHI international conference on advances in computer entertainment technology, ACM, New York, NY, USA, ACE ’05, pp 429–433. https://​doi.​org/​10.​1145/​1178477.​1178573
Zurück zum Zitat Jaderberg M, Czarnecki WM, Dunning I, Marris L, Lever G, Castañeda AG, Beattie C, Rabinowitz NC, Morcos AS, Ruderman A, Sonnerat N, Green T, Deason L, Leibo JZ, Silver D, Hassabis D, Kavukcuoglu K, Graepel T (2018) Human-level performance in first-person multiplayer games with population-based deep reinforcement learning. CoRR arXiv:1807.01281 Jaderberg M, Czarnecki WM, Dunning I, Marris L, Lever G, Castañeda AG, Beattie C, Rabinowitz NC, Morcos AS, Ruderman A, Sonnerat N, Green T, Deason L, Leibo JZ, Silver D, Hassabis D, Kavukcuoglu K, Graepel T (2018) Human-level performance in first-person multiplayer games with population-based deep reinforcement learning. CoRR arXiv:​1807.​01281
Zurück zum Zitat Missura O, Gaertner T (2008) Online adaptive agent for connect four. In: Proceedings of the 4th international conference on games research and development cybergames, pp 1–8 Missura O, Gaertner T (2008) Online adaptive agent for connect four. In: Proceedings of the 4th international conference on games research and development cybergames, pp 1–8
Zurück zum Zitat Missura O, Gärtner T (2009) Player modeling for intelligent difficulty adjustment. In: Gama J, Costa VS, Jorge AM, Brazdil PB (eds) Discovery science. Springer, Berlin, pp 197–211CrossRef Missura O, Gärtner T (2009) Player modeling for intelligent difficulty adjustment. In: Gama J, Costa VS, Jorge AM, Brazdil PB (eds) Discovery science. Springer, Berlin, pp 197–211CrossRef
Zurück zum Zitat Mnih V, Kavukcuoglu K, Silver D, Graves A, Antonoglou I, Wierstra D, Riedmiller MA (2013) Playing atari with deep reinforcement learning. CoRR arXiv:1312.5602 Mnih V, Kavukcuoglu K, Silver D, Graves A, Antonoglou I, Wierstra D, Riedmiller MA (2013) Playing atari with deep reinforcement learning. CoRR arXiv:​1312.​5602
Zurück zum Zitat Mueller F, Vetere F, Gibbs M, Edge D, Agamanolis S, Sheridan J, Heer J (2012) Balancing exertion experiences. In: Proceedings of the SIGCHI conference on human factors in computing systems, ACM, New York, NY, USA, CHI ’12, pp 1853–1862. https://doi.org/10.1145/2207676.2208322 Mueller F, Vetere F, Gibbs M, Edge D, Agamanolis S, Sheridan J, Heer J (2012) Balancing exertion experiences. In: Proceedings of the SIGCHI conference on human factors in computing systems, ACM, New York, NY, USA, CHI ’12, pp 1853–1862. https://​doi.​org/​10.​1145/​2207676.​2208322
Zurück zum Zitat Pratama H, Krisnadhi A (2019) Representing dynamic difficulty in turn-based role playing games using monte carlo tree search. In: 2018 international conference on advanced computer science and information systems, ICACSIS 2018, Institute of Electrical and Electronics Engineers Inc., United States, 2018 international conference on advanced computer science and information systems, ICACSIS 2018, pp 207–212. https://doi.org/10.1109/ICACSIS.2018.8618167 Pratama H, Krisnadhi A (2019) Representing dynamic difficulty in turn-based role playing games using monte carlo tree search. In: 2018 international conference on advanced computer science and information systems, ICACSIS 2018, Institute of Electrical and Electronics Engineers Inc., United States, 2018 international conference on advanced computer science and information systems, ICACSIS 2018, pp 207–212. https://​doi.​org/​10.​1109/​ICACSIS.​2018.​8618167
Zurück zum Zitat Rego PA, Moreira PM, Reis LP (2011) Natural user interfaces in serious games for rehabilitation. In: 6th Iberian conference on information systems and technologies (CISTI 2011), pp 1–4 Rego PA, Moreira PM, Reis LP (2011) Natural user interfaces in serious games for rehabilitation. In: 6th Iberian conference on information systems and technologies (CISTI 2011), pp 1–4
Zurück zum Zitat Reis S, Reis LP, Lau N (2019) Automatic generation of a sub-optimal agent population with learning. In: Rocha Á, Adeli H, Reis LP, Costanzo S (eds) New knowledge in information systems and technologies. Springer, Cham, pp 65–74CrossRef Reis S, Reis LP, Lau N (2019) Automatic generation of a sub-optimal agent population with learning. In: Rocha Á, Adeli H, Reis LP, Costanzo S (eds) New knowledge in information systems and technologies. Springer, Cham, pp 65–74CrossRef
Zurück zum Zitat Silver D, Hubert T, Schrittwieser J, Antonoglou I, Lai M, Guez A, Lanctot M, Sifre L, Kumaran D, Graepel T, et al. (2017) Mastering chess and shogi by self-play with a general reinforcement learning algorithm. Preprint arXiv:171201815 Silver D, Hubert T, Schrittwieser J, Antonoglou I, Lai M, Guez A, Lanctot M, Sifre L, Kumaran D, Graepel T, et al. (2017) Mastering chess and shogi by self-play with a general reinforcement learning algorithm. Preprint arXiv:​171201815
Zurück zum Zitat Simões D, Lau N, Reis LP (2018a) Adjusted bounded weighted policy learner. In: RoboCup international symposium 2018 Simões D, Lau N, Reis LP (2018a) Adjusted bounded weighted policy learner. In: RoboCup international symposium 2018
Zurück zum Zitat Simões D, Lau N, Reis LP (2018b) Mixed-policy asynchronous deep q-learning. In: Ollero A, Sanfeliu A, Montano L, Lau N, Cardeira C (eds) ROBOT 2017: third iberian robotics conference. Springer, Cham, pp 129–140CrossRef Simões D, Lau N, Reis LP (2018b) Mixed-policy asynchronous deep q-learning. In: Ollero A, Sanfeliu A, Montano L, Lau N, Cardeira C (eds) ROBOT 2017: third iberian robotics conference. Springer, Cham, pp 129–140CrossRef
Zurück zum Zitat Tuyls K, Weiss G (2012) Multiagent learning: basics, challenges, and prospects. AI Mag 33(3):41–52 Tuyls K, Weiss G (2012) Multiagent learning: basics, challenges, and prospects. AI Mag 33(3):41–52
Zurück zum Zitat Zhang H, Wang J, Zhou Z, Zhang W, Wen Y, Yu Y, Li W (2017) Learning to design games: strategic environments in deep reinforcement learning. CoRR arXiv:1707.01310 Zhang H, Wang J, Zhou Z, Zhang W, Wen Y, Yu Y, Li W (2017) Learning to design games: strategic environments in deep reinforcement learning. CoRR arXiv:​1707.​01310
Metadaten
Titel
Game Adaptation by Using Reinforcement Learning Over Meta Games
verfasst von
Simão Reis
Luís Paulo Reis
Nuno Lau
Publikationsdatum
13.01.2020
Verlag
Springer Netherlands
Erschienen in
Group Decision and Negotiation / Ausgabe 2/2021
Print ISSN: 0926-2644
Elektronische ISSN: 1572-9907
DOI
https://doi.org/10.1007/s10726-020-09652-8

Weitere Artikel der Ausgabe 2/2021

Group Decision and Negotiation 2/2021 Zur Ausgabe

Premium Partner