Skip to main content
Top

2021 | OriginalPaper | Chapter

An Evolutionary Approach to Combinatorial Gameplaying Using Extended Classifier Systems

Authors : Karmanya Oberoi, Sarthak Tandon, Abhishek Das, Swati Aggarwal

Published in: Applications of Artificial Intelligence and Machine Learning

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Extended classifier system (XCS) is an extension of a popular online rule-based machine learning technique, learning classifier system (LCS), in which a classifier’s fitness is based on its accuracy instead of the prediction itself, and a genetic algorithm (GA) and reinforcement learning (RL) component is utilized for exploratory and learning purposes, respectively. With the emergence of increasingly intricate rule-based learning techniques, there is a need to examine feasible methods of learning that can overcome the challenges posed by complex scenarios while supporting online performance. Checkers is a strategic, combinatorial game having a high branching factor and a complex state space that provides a promising avenue for scrutinizing novel approaches. This paper presents a preliminary investigation into feasibility of XCS in such complex avenues by taking 6 × 6 checkers as a specific case of study. The XCS agent was adapted to this problem, trained with random agent and was able to perform well against the alpha–beta pruning algorithm of various depths as well as human agents of different skill levels (beginner, intermediate and advanced).

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bull L (2004) Learning classifier systems: a brief introduction. In: Applications of learning classifier systems, pp 1–12 Bull L (2004) Learning classifier systems: a brief introduction. In: Applications of learning classifier systems, pp 1–12
2.
go back to reference Snell F (1967) Progress in theoretical biology. Academic Press, New York Snell F (1967) Progress in theoretical biology. Academic Press, New York
3.
go back to reference Holland J (2001) Adaptation in natural and artificial systems. MIT Press, Cambridge Holland J (2001) Adaptation in natural and artificial systems. MIT Press, Cambridge
4.
go back to reference Holland J, Holyoak K, Nisbett R, Thagard P, Smoliar S (1987) Induction: processes of inference, learning, and discovery. IEEE Expert 2:92–93CrossRef Holland J, Holyoak K, Nisbett R, Thagard P, Smoliar S (1987) Induction: processes of inference, learning, and discovery. IEEE Expert 2:92–93CrossRef
5.
go back to reference Wilson S (1995) Classifier fitness based on accuracy. Evol Comput 3:149–175CrossRef Wilson S (1995) Classifier fitness based on accuracy. Evol Comput 3:149–175CrossRef
6.
go back to reference Silver D, Huang A, Maddison C, Guez A, Sifre L, van den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M, Dieleman S, Grewe D, Nham J, Kalchbrenner N, Sutskever I, Lillicrap T, Leach M, Kavukcuoglu K, Graepel T, Hassabis D (2016) Mastering the game of Go with deep neural networks and tree search. Nature 529:484–489CrossRef Silver D, Huang A, Maddison C, Guez A, Sifre L, van den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M, Dieleman S, Grewe D, Nham J, Kalchbrenner N, Sutskever I, Lillicrap T, Leach M, Kavukcuoglu K, Graepel T, Hassabis D (2016) Mastering the game of Go with deep neural networks and tree search. Nature 529:484–489CrossRef
7.
go back to reference Silver D, Hubert T, Schrittwieser J, Antonoglou I, Lai M, Guez A, Lanctot M, Sifre L, Kumaran D, Graepel T, Lillicrap T, Simonyan K, Hassabis D (2018) A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play. Science 362:1140–1144MathSciNetCrossRef Silver D, Hubert T, Schrittwieser J, Antonoglou I, Lai M, Guez A, Lanctot M, Sifre L, Kumaran D, Graepel T, Lillicrap T, Simonyan K, Hassabis D (2018) A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play. Science 362:1140–1144MathSciNetCrossRef
8.
go back to reference Dubel ICL, Lefakis L (2006) Checkers reinforcement learning project: AI checkers player Dubel ICL, Lefakis L (2006) Checkers reinforcement learning project: AI checkers player
9.
go back to reference Kwasnicka H, Spirydowicz A (2019) Checkers: TD (λ) learning applied for deterministic game Kwasnicka H, Spirydowicz A (2019) Checkers: TD (λ) learning applied for deterministic game
10.
go back to reference Tesauro G (1995) Temporal difference learning and TD-Gammon. Commun ACM 38:58–68CrossRef Tesauro G (1995) Temporal difference learning and TD-Gammon. Commun ACM 38:58–68CrossRef
11.
go back to reference Eck N, Wezel M (2005) Reinforcement learning and its application to Othello Eck N, Wezel M (2005) Reinforcement learning and its application to Othello
12.
go back to reference Van der Ree M, Wiering M (2013) Reinforcement learning in the game of Othello: learning against a fixed opponent and learning from self-play. In: 2013 IEEE symposium on adaptive dynamic programming and reinforcement learning (ADPRL) Van der Ree M, Wiering M (2013) Reinforcement learning in the game of Othello: learning against a fixed opponent and learning from self-play. In: 2013 IEEE symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
14.
go back to reference Castillo C, Lurgi M, Martinez I (2003) Chimps: an evolutionary reinforcement learning approach for soccer agents. In: SMC’03 conference proceedings. 2003 IEEE international conference on systems, man and cybernetics. Conference theme—system security and assurance (Cat. No. 03CH37483) Castillo C, Lurgi M, Martinez I (2003) Chimps: an evolutionary reinforcement learning approach for soccer agents. In: SMC’03 conference proceedings. 2003 IEEE international conference on systems, man and cybernetics. Conference theme—system security and assurance (Cat. No. 03CH37483)
15.
go back to reference Rudolph S, von Mammen S, Jungbluth J, Hähner J (2016) Design and evaluation of an extended learning classifier-based StarCraft micro AI. In: Applications of evolutionary computation, pp 669–681 Rudolph S, von Mammen S, Jungbluth J, Hähner J (2016) Design and evaluation of an extended learning classifier-based StarCraft micro AI. In: Applications of evolutionary computation, pp 669–681
16.
go back to reference Shafi K, Abbass H (2017) A survey of learning classifier systems in games [review article]. IEEE Comput Intell Mag 12:42–55CrossRef Shafi K, Abbass H (2017) A survey of learning classifier systems in games [review article]. IEEE Comput Intell Mag 12:42–55CrossRef
17.
go back to reference Schaeffer J, Culberson J, Treloar N, Knight B, Lu P, Szafron D (1992) A world championship caliber checkers program. Artif Intell 53:273–289CrossRef Schaeffer J, Culberson J, Treloar N, Knight B, Lu P, Szafron D (1992) A world championship caliber checkers program. Artif Intell 53:273–289CrossRef
18.
go back to reference Schaeffer J, Burch N, Bjornsson Y, Kishimoto A, Muller M, Lake R, Lu P, Sutphen S (2007) Checkers is solved. Science 317:1518–1522MathSciNetCrossRef Schaeffer J, Burch N, Bjornsson Y, Kishimoto A, Muller M, Lake R, Lu P, Sutphen S (2007) Checkers is solved. Science 317:1518–1522MathSciNetCrossRef
19.
go back to reference Browne W, Scott D (2005) An abstraction algorithm for genetics-based reinforcement learning. In: Proceedings of the 2005 conference on genetic and evolutionary computation—GECCO’05 Browne W, Scott D (2005) An abstraction algorithm for genetics-based reinforcement learning. In: Proceedings of the 2005 conference on genetic and evolutionary computation—GECCO’05
20.
go back to reference Jain S, Verma S, Kumar S, Aggarwal S (2018) An evolutionary learning approach to play Othello using XCS. In: 2018 IEEE congress on evolutionary computation (CEC) Jain S, Verma S, Kumar S, Aggarwal S (2018) An evolutionary learning approach to play Othello using XCS. In: 2018 IEEE congress on evolutionary computation (CEC)
21.
go back to reference Clementis L (2013) Model driven classifier evaluation in rule-based system. In: Advances in intelligent systems and computing, pp 267–276 Clementis L (2013) Model driven classifier evaluation in rule-based system. In: Advances in intelligent systems and computing, pp 267–276
22.
go back to reference Urbanowicz R, Moore J (2009) Learning classifier systems: a complete introduction, review, and roadmap. J Artif Evol Appl 2009:1–25 Urbanowicz R, Moore J (2009) Learning classifier systems: a complete introduction, review, and roadmap. J Artif Evol Appl 2009:1–25
23.
go back to reference Butz M, Wilson S (2001) An algorithmic description of XCS. In: Advances in learning classifier systems, pp 253–272 Butz M, Wilson S (2001) An algorithmic description of XCS. In: Advances in learning classifier systems, pp 253–272
24.
go back to reference Wilson S (2000) Get real! XCS with continuous-valued inputs. In: Lecture notes in computer science, pp 209–219 Wilson S (2000) Get real! XCS with continuous-valued inputs. In: Lecture notes in computer science, pp 209–219
25.
go back to reference Browne W (2004) The development of an industrial learning classifier system for data-mining in a steel hop strip mill. In: Applications of learning classifier systems, pp 223–259 Browne W (2004) The development of an industrial learning classifier system for data-mining in a steel hop strip mill. In: Applications of learning classifier systems, pp 223–259
26.
go back to reference Lanzi P (2002) Learning classifier systems from a reinforcement learning perspective. Soft Comput 6:162–170CrossRef Lanzi P (2002) Learning classifier systems from a reinforcement learning perspective. Soft Comput 6:162–170CrossRef
27.
go back to reference Zhong J, Hu X, Zhang J, Gu M (2005) Comparison of performance between different selection strategies on simple genetic algorithms. In: International conference on computational intelligence for modelling, control and automation and international conference on intelligent agents, web technologies and internet commerce (CIMCA-IAWTIC’06) Zhong J, Hu X, Zhang J, Gu M (2005) Comparison of performance between different selection strategies on simple genetic algorithms. In: International conference on computational intelligence for modelling, control and automation and international conference on intelligent agents, web technologies and internet commerce (CIMCA-IAWTIC’06)
28.
go back to reference Shuqin L, Weiming X, Xiaohua Y (2015) Study on the evaluation function parameters of the checkers game program on Weka platform. Int J New Technol Res 1(7) Shuqin L, Weiming X, Xiaohua Y (2015) Study on the evaluation function parameters of the checkers game program on Weka platform. Int J New Technol Res 1(7)
29.
go back to reference Su Z, Li S, Jia Y, Zheng L, Fan S (2013) The realization of genetic algorithm in terms of checkers evaluation function. Appl Mech Mater 411–414:1979–1985CrossRef Su Z, Li S, Jia Y, Zheng L, Fan S (2013) The realization of genetic algorithm in terms of checkers evaluation function. Appl Mech Mater 411–414:1979–1985CrossRef
30.
go back to reference Tang K, Jarvis R (2005) Is XCS suitable for problems with temporal rewards? In: International conference on computational intelligence for modelling, control and automation and international conference on intelligent agents, web technologies and internet commerce (CIMCA-IAWTIC’06) Tang K, Jarvis R (2005) Is XCS suitable for problems with temporal rewards? In: International conference on computational intelligence for modelling, control and automation and international conference on intelligent agents, web technologies and internet commerce (CIMCA-IAWTIC’06)
31.
go back to reference Barry A (2003) Limits in long path learning with XCS. In: Genetic and evolutionary computation—GECCO 2003, pp 1832–1843 Barry A (2003) Limits in long path learning with XCS. In: Genetic and evolutionary computation—GECCO 2003, pp 1832–1843
32.
go back to reference Lanzi P, Loiacono D (2007) Classifier systems that compute action mappings. In: Proceedings of the 9th annual conference on genetic and evolutionary computation—GECCO’07 Lanzi P, Loiacono D (2007) Classifier systems that compute action mappings. In: Proceedings of the 9th annual conference on genetic and evolutionary computation—GECCO’07
33.
go back to reference Butz M, Goldberg D, Lanzi P (2004) Gradient-based learning updates improve XCS performance in multistep problems. In: Genetic and evolutionary computation—GECCO 2004, pp 751–762 Butz M, Goldberg D, Lanzi P (2004) Gradient-based learning updates improve XCS performance in multistep problems. In: Genetic and evolutionary computation—GECCO 2004, pp 751–762
34.
go back to reference Hiniker A, Daniels J, Williamson H (2013) Go go games. In: Proceedings of the 12th international conference on interaction design and children—IDC’13 Hiniker A, Daniels J, Williamson H (2013) Go go games. In: Proceedings of the 12th international conference on interaction design and children—IDC’13
35.
go back to reference Li X, Yang G (2016) Transferable XCS. In: Proceedings of the 2016 on genetic and evolutionary computation conference—GECCO’16 Li X, Yang G (2016) Transferable XCS. In: Proceedings of the 2016 on genetic and evolutionary computation conference—GECCO’16
Metadata
Title
An Evolutionary Approach to Combinatorial Gameplaying Using Extended Classifier Systems
Authors
Karmanya Oberoi
Sarthak Tandon
Abhishek Das
Swati Aggarwal
Copyright Year
2021
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-16-3067-5_54

Premium Partner