research-article

HyperNEAT-GGP: a hyperNEAT-based atari general game player

Authors:
Matthew Hausknecht

University of Texas at Austin, Austin, TX, USA

University of Texas at Austin, Austin, TX, USA
View Profile

,
Piyush Khandelwal

University of Texas at Austin, Austin, TX, USA

University of Texas at Austin, Austin, TX, USA
View Profile

,
Risto Miikkulainen

University of Texas at Austin, Austin, USA

University of Texas at Austin, Austin, USA
View Profile

,
Peter Stone

University of Texas at Austin, Austin, USA

University of Texas at Austin, Austin, USA
View Profile

GECCO '12: Proceedings of the 14th annual conference on Genetic and evolutionary computationJuly 2012Pages 217–224https://doi.org/10.1145/2330163.2330195

Published:07 July 2012Publication History

GECCO '12: Proceedings of the 14th annual conference on Genetic and evolutionary computation

Pages 217–224

ABSTRACT

This paper considers the challenge of enabling agents to learn with as little domain-specific knowledge as possible. The main contribution is HyperNEAT-GGP, a HyperNEAT-based General Game Playing approach to Atari games. By leveraging the geometric regularities present in the Atari game screen, HyperNEAT effectively evolves policies for playing two different Atari games, Asterix and Freeway. Results show that HyperNEAT-GGP outperforms existing benchmarks on these games. HyperNEAT-GGP represents a step towards the ambitious goal of creating an agent capable of learning and seamlessly transitioning between many different tasks.

References

M. Campbell, A. J. H. Jr., and F. hsiung Hsu. Deep blue. Artif. Intell., 134(1-2):57--83, 2002. Google ScholarDigital Library
J. Clune, B. E. Beckmann, C. Ofria, and R. T. Pennock. Evolving coordinated quadruped gaits with the hyperneat generative encoding. In Proceedings of the Eleventh conference on Congress on Evolutionary Computation, CEC'09, pages 2764--2771, Piscataway, NJ, USA, 2009. IEEE Press. Google ScholarDigital Library
D. B. D'Ambrosio and K. O. Stanley. Generative encoding for multiagent learning. In GECCO '08: Proceedings of the 10th annual conference on Genetic and evolutionary computation, pages 819--826, New York, NY, USA, 2008. ACM. Google ScholarDigital Library
C. Diuk, A. Cohen, and M. L. Littman. An object-oriented representation for efficient reinforcement learning. In Proceedings of 25th International Conference on Machine Learning (ICML), pages 240--247, 2008. Google ScholarDigital Library
G. Edgers. Atari and the deep history of video games. http://www.boston.com/bostonglobe/ideas/articles/2009/03/08/a_talk_with_nick_montfort/.Google Scholar
D. Ferrucci, E. Brown, J. Chu-Carroll, J. Fan, D. Gondek, A. A. Kalyanpur, A. Lally, J. W. Murdock, E. Nyberg, J. Prager, N. Schlaefer, and C. Welty. Building Watson: An Overview of the DeepQA Project. AI Magazine, 31(3), 2010.Google Scholar
J. Gauci and K. O. Stanley. A case study on the critical role of geometric regularity in machine learning. In Proceedings of the 23rd National Conference on Artificial Intelligence (AAAI), 2008. Google ScholarDigital Library
M. Genesereth and N. Love. General game playing: Overview of the aaai competition. AI Magazine, 26:62--72, 2005.Google ScholarDigital Library
S. M. Lucas. Ms pac-man competition (screen capture mode). http://dces.essex.ac.uk/staff/sml/pacman/CIG2011Results.html.Google Scholar
S. M. Lucas. Ms pac-man competition. SIGEVOlution, 2(4):37--38, 2007. Google ScholarDigital Library
Y. Naddaf. Game-independent ai agents for playing atari 2600 console games. Master's thesis, University of Alberta, 2010.Google Scholar
M. Parker and B. Bryant. Backpropagation without human supervision for visual control in quake ii. Proceedings of the 2009 IEEE Symposium on Computational Intelligence and Games (CIG'09), pages 287--293, 2009. Google ScholarDigital Library
K. O. Stanley and R. Miikkulainen. Evolving neural networks through augmenting topologies. Evolutionary Computation, 10(2):99--127, 2002. Google ScholarDigital Library
P. Stone and R. S. Sutton. Scaling reinforcement learning toward RoboCup soccer. In Proceedings of the Eighteenth International Conference on Machine Learning, pages 537--544. Morgan Kaufmann, San Francisco, CA, 2001. Google ScholarDigital Library
P. Stone, R. S. Sutton, and G. Kuhlmann. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior, 13(3):165--188, 2005.Google ScholarCross Ref
R. S. Sutton and A. G. Barto. Reinforcement learning: An introduction. IEEE Transactions on Neural Networks, 9(5):1054--1054, 1998. Google ScholarDigital Library
G. Tesauro. Td-gammon, a self-teaching backgammon program, achieves master-level play. Neural Comput., 6:215--219, March 1994. Google ScholarDigital Library
P. Verbancsics and K. O. Stanley. Evolving static representations for task transfer. J. Mach. Learn. Res., 11:1737--1769, August 2010. Google ScholarDigital Library

Index Terms

HyperNEAT-GGP: a hyperNEAT-based atari general game player
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

Investigating whether hyperNEAT produces modular neural networks
GECCO '10: Proceedings of the 12th annual conference on Genetic and evolutionary computation

HyperNEAT represents a class of neuroevolutionary algorithms that captures some of the power of natural development with a computationally efficient high-level abstraction of development. This class of algorithms is intended to provide many of the ...
Read More
Evolving the placement and density of neurons in the hyperneat substrate
GECCO '10: Proceedings of the 12th annual conference on Genetic and evolutionary computation

The Hypercube-based NeuroEvolution of Augmenting Topologies (HyperNEAT) approach demonstrated that the pattern of weights across the connectivity of an artificial neural network (ANN) can be generated as a function of its geometry, thereby allowing ...
Read More
Enhancing es-hyperneat to evolve more complex regular neural networks
GECCO '11: Proceedings of the 13th annual conference on Genetic and evolutionary computation

The recently-introduced evolvable-substrate HyperNEAT algorithm (ES-HyperNEAT) demonstrated that the placement and density of hidden nodes in an artificial neural network can be determined based on implicit information in an infinite-resolution pattern ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
GECCO '12: Proceedings of the 14th annual conference on Genetic and evolutionary computation
July 2012
1396 pages
ISBN:9781450311779
DOI:10.1145/2330163
Editor:
Terence Soule
University of Idaho, USA
,
General Chair:
Jason H. Moore
Dartmouth College, USA
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 7 July 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
atari
general game playing
hyperNEAT
learning agents
neuroevolution
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,669of4,410submissions,38%
Upcoming Conference
GECCO '24

Sponsor:

sigevo

Genetic and Evolutionary Computation Conference

July 14 - 18, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 33
  Total Citations
  View Citations
- 453
  Total Downloads
- Downloads (Last 12 months)14
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HyperNEAT-GGP: a hyperNEAT-based atari general game player

GECCO '12: Proceedings of the 14th annual conference on Genetic and evolutionary computation

ABSTRACT

References

Cited By

Index Terms

Recommendations

Investigating whether hyperNEAT produces modular neural networks

Evolving the placement and density of neurons in the hyperneat substrate

Enhancing es-hyperneat to evolve more complex regular neural networks

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

HyperNEAT-GGP: a hyperNEAT-based atari general game player

GECCO '12: Proceedings of the 14th annual conference on Genetic and evolutionary computation

ABSTRACT

References

Cited By

Index Terms

Recommendations

Investigating whether hyperNEAT produces modular neural networks

Evolving the placement and density of neurons in the hyperneat substrate

Enhancing es-hyperneat to evolve more complex regular neural networks

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media