Top

KI - Künstliche Intelligenz

Published in:

01-03-2011 | Fachbeitrag

CadiaPlayer: Search-Control Techniques

Authors: Hilmar Finnsson, Yngvi Björnsson

Published in: KI - Künstliche Intelligenz | Issue 1/2011

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Effective search control is one of the key components of any successful simulation-based game-playing program. In General Game Playing (GGP), learning of useful search-control knowledge is a particularly challenging task because it must be done in real-time during online play. In here we describe the search-control techniques used in the 2010 version of the GGP agent CadiaPlayer, and show how they have evolved over the years to become increasingly effective and robust across a wide range of games. In particular, we present a new combined search-control scheme (RAVE/MAST/FAST) for biasing action selection. The scheme proves quite effective on a wide range of games including chess-like games, which have up until now proved quite challenging for simulation-based GGP agents.

previous article Special Issue on General Game Playing

next article Centurio, a General Game Player: Parallel, Java- and ASP-based

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

KI - Künstliche Intelligenz

The Scientific journal "KI – Künstliche Intelligenz" is the official journal of the division for artificial intelligence within the "Gesellschaft für Informatik e.V." (GI) – the German Informatics Society - with constributions from troughout the field of artificial intelligence.

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Skirmish is a chess-like game. We used the variation played in the 2007 GGP finals where each player has two knights, two bishops, two rooks, and four pawns (can only move by capturing). The objective is to capture as many of the opponent’s pieces as possible. Breakthrough is played on a chess board where the players, armed with two ranks of pawns, try to be the first to break through to reach the opponent’s back rank. The pawns can move forward and diagonally, but can only capture diagonally. Chinook is a variant of Breakthrough played with checkers pawns and the added twist that two independent games are played simultaneously, one on the white squares and another on the black ones. 3D Tic-Tac-Toe and Nine Board Tic-Tac-Toe are variants of Tic-Tac-Toe; the former is played on a 4×4×4 cube, and in the latter 9 Tic-Tac-Toe boards are placed in a 3×3 grid formation, and on each turn a player can play only on the board having the same coordinate as the last cell marked by the opponent (e.g., if the opponent marked a center cell in one of the boards, the player must on his turn play in the Tic-Tac-Toe board in the center). TTCC4 is a hybrid of several games where each player has a chess pawn, a chess knight, and a checkers king—these pieces respawn on their start square if captured. Instead of moving a piece a player can choose to drop a disk onto the board like in Connect 4 (captured disks do not respawn). The goal is to form a 3-in-a-row formation with your pieces anywhere on the 3×3 center squares of the table. Full game descriptions can be found on the Dresden GGP server (http://euklid.inf.tu-dresden.de:8180/ggpserver).

Banerjee B, Stone P (2007) General game learning using knowledge transfer. In: Veloso MM (ed) 20th IJCAI, pp 672–677

Björnsson Y, Finnsson H (2009) CADIAPlayer: a simulation-based general game player. IEEE Trans Comput Intell AI in Games 1(1):4–15 CrossRef

Chaslot G, Winands M, van den Herik JH, Uiterwijk J, Bouzy B (2007) Progressive strategies for Monte-Carlo tree search. In: 10th JCIS, heuristic search and computer game playing session

Clune J (2007) Heuristic evaluation functions for general game playing. In: Holte RC, Howe A (eds) 22nd AAAI. AAAI Press, Menlo Park, pp 1134–1139

Coulom R (2006) Efficient selectivity and backup operators in Monte-Carlo tree search. In: van den Herik HJ, Ciancarini P, Donkers HHLM (eds) CG2006. Springer, Berlin, pp 72–83

Cox E, Schkufza E, Madsen Ryan MG (2009) Factoring general games using propositional automata. In: GIGA’09 the IJCAI workshop on general game playing

Enzenberger M, Müller M (2009) Fuego—an open-source framework for board games and Go engine based on Monte-Carlo tree search. Tech Rep 09-08, Dept of Computing Science, University of Alberta

Finnsson H, Björnsson Y (2008) Simulation-based approach to general game playing. In: Fox D, Gomes CP (eds) 23rd AAAI. AAAI Press, Menlo Park, pp 259–264

Finnsson H, Björnsson Y (2010) Learning simulation-control in general game-playing agents. In: Fox M, Poole D (eds) 24th AAAI. AAAI Press, Menlo Park, pp 954–959

10.

Gelly S, Silver D (2007) Combining online and offline knowledge in UCT. In: Ghahramani Z (ed) 24th ICML, vol 227. ACM, New York, pp 273–280 CrossRef

11.

Gelly S, Wang Y, Munos R, Teytaud O (2006) Modification of UCT with patterns in Monte-Carlo Go. Technical Report 6062, INRIA

12.

Genesereth MR, Love N, Pell B (2005) General game playing: overview of the AAAI competition. AI Mag 26(2):62–72

13.

Günther M, Schiffel S, Thielscher M (2009) Factoring general games. In: GIGA’09 the IJCAI workshop on general game playing

14.

Kirci M, Schaeffer J, Sturtevant N (2009) Feature learning using state differences. In: GIGA’09 the IJCAI workshop on general game playing

15.

Kocsis L, Szepesvári C (2006) Bandit based Monte-Carlo planning. In: ECML, pp 282–293

16.

Kuhlmann G, Dresner K, Stone P (2006) Automatic heuristic construction in a complete general game player. In: 21st AAAI, pp 1457–1462

17.

Kuhlmann G, Stone P (2007) Graph-based domain mapping for transfer learning in general games. In: 18th ECML

18.

Lorentz RJ (2008) Amazons discover Monte-Carlo. In: van den Herik HJ, Xu X, Ma Z, Winands MHM (eds) CG2008. Lecture notes in computer science, vol 5131. Springer, Berlin, pp 13–24

19.

Love N, Hinrichs T, Genesereth M (2006) General game playing: game description language specification. Technical Report, Stanford University, 4 April 2006

20.

Pell B (1996) A strategic metagame player for general chess-like games. Comput Intell 12:177–198 CrossRef

21.

Schiffel S (2010) Symmetry detection in general game playing. In: Fox M, Poole D (eds) 24th AAAI, pp 980–985. AAAI Press, Menlo Park

22.

Schiffel S, Thielscher M (2007) Automatic construction of a heuristic search function for general game playing. In: Veloso MM (ed) 7th IJCAI workshop on nonmontonic reasoning, action and change (NRAC07)

23.

Schiffel S, Thielscher M (2007) Fluxplayer: a successful general game player. In: Holte RC, Howe A (eds) 22nd AAAI, AAAI Press, Menlo Park, pp 1191–1196

24.

Schiffel S, Thielscher M (2009) Automated theorem proving for general game playing. In: Boutilier C (ed) 21st IJCAI. Morgan Kaufmann, San Francisco, pp 911–916

25.

Sharma S, Kobti Z, Goodwin S (2008) Knowledge generation for improving simulations in UCT for general game playing. In: AI 2008: advances in artificial intelligence. Springer, Berlin, pp 49–55 CrossRef

26.

Sutton RS (1988) Learning to predict by the methods of temporal differences. Mach Learn 3:9–44

27.

Thielscher M (2010) A general game description language for incomplete information games. In: Fox M, Poole D (eds) 24th AAAI. AAAI Press, Menlo Park, pp 994–999

28.

Thielscher M, Voigt S (2010) A temporal proof system for general game playing. In: Fox M, Poole D (eds) 24th AAAI. AAAI Press, Menlo Park, pp 1000–1005

29.

Waugh K (2009) Faster state manipulation in general games using generated code. In: GIGA’09 the IJCAI workshop on general game playing

Title: CadiaPlayer: Search-Control Techniques
Authors: Hilmar Finnsson
Yngvi Björnsson
Publication date: 01-03-2011
Publisher: Springer-Verlag
Published in: KI - Künstliche Intelligenz / Issue 1/2011
Print ISSN: 0933-1875
Electronic ISSN: 1610-1987
DOI: https://doi.org/10.1007/s13218-010-0080-9

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

KI - Künstliche Intelligenz

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Other articles of this Issue 1/2011

Anchoring Symbols to Percepts in the Fluent Calculus

Explizites Wissen versus Black Box Ansätze in der KI

Reasoning about Time, Action and Knowledge in Multi-Agent Systems

A Parallel General Game Player

Fusing DL Reasoning with HTN Planning

GDL-II

Premium Partner