research-article

How novelty search escapes the deceptive trap of learning to learn

Authors:
Sebastian Risi

University of Central Florida, Orlando, FL, USA

University of Central Florida, Orlando, FL, USA
View Profile

,
Sandy D. Vanderbleek

University of Central Florida, Orlando, FL, USA

University of Central Florida, Orlando, FL, USA
View Profile

,
Charles E. Hughes

University of Central Florida, Orlando, FL, USA

University of Central Florida, Orlando, FL, USA
View Profile

,
Kenneth O. Stanley

University of Central Florida, Orlando, FL, USA

University of Central Florida, Orlando, FL, USA
View Profile

GECCO '09: Proceedings of the 11th Annual conference on Genetic and evolutionary computationJuly 2009Pages 153–160https://doi.org/10.1145/1569901.1569923

Published:08 July 2009Publication History

GECCO '09: Proceedings of the 11th Annual conference on Genetic and evolutionary computation

Pages 153–160

ABSTRACT

A major goal for researchers in neuroevolution is to evolve artificial neural networks (ANNs) that can learn during their lifetime. Such networks can adapt to changes in their environment that evolution on its own cannot anticipate. However, a profound problem with evolving adaptive systems is that if the impact of learning on the fitness of the agent is only marginal, then evolution is likely to produce individuals that do not exhibit the desired adaptive behavior. Instead, because it is easier at first to improve fitness without evolving the ability to learn, they are likely to exploit domain-dependent static (i.e. non-adaptive) heuristics. This paper proposes a way to escape the deceptive trap of static policies based on the novelty search algorithm, which opens up a new avenue in the evolution of adaptive systems because it can exploit the behavioral difference between learning and non-learning individuals. The main idea in novelty search is to abandon objective-based fitness and instead simply search only for novel behavior, which avoids deception entirely and has shown prior promising results in other domains. This paper shows that novelty search significantly outperforms fitness-based search in a tunably deceptive T-Maze navigation domain because it fosters the emergence of adaptive behavior.

References

T. Aaltonen et al. Measurement of the top quark mass with dilepton events selected using neuroevolution at CDF. Physical Review Letters, 2009. To appear.Google ScholarCross Ref
J. Blynel and D. Floreano. Exploring the T-Maze: Evolving Learning-Like Robot Behaviors using CTRNNs. In 2nd European Workshop on Evolutionary Robotics (EvoRob'2003), Lecture Notes in Computer Science, 2003.Google ScholarDigital Library
T. Carew, E. Walters, and E. Kandel. Classical conditioning in a simple withdrawal reflex in Aplysia californica. The Journal of Neuroscience, 1(12):1426--1437, 1981.Google ScholarCross Ref
P. Darwen and Y. Yao. Every niching method has its niche: Fitness sharing and implicit sharing compared. Parallel Problem Solving from Nature (PPSN IV), pages 398--407, 1996. Google ScholarDigital Library
D. Floreano and J. Urzelai. Evolutionary robots with online self-organization and behavioral fitness. Neural Networks, 13:431--443, 2000. Google ScholarDigital Library
D. E. Goldberg and J. Richardson. Genetic algorithms with sharing for multimodal function optimization. In Proceedings of the Second International Conference on Genetic Algorithms on Genetic algorithms and their application, pages 41--49, Hillsdale, NJ, USA, 1987. L. Erlbaum Associates Inc. Google ScholarDigital Library
G. E. Hinton and S. J. Nowlan. How learning can guide evolution. Complex Systems, 1, 1987.Google Scholar
G. S. Hornby. Alps: the age-layered population structure for reducing the problem of premature convergence. In GECCO '06: Proceedings of the 8th annual conference on Genetic and evolutionary computation, pages 815--822, New York, NY, USA, 2006. ACM. Google ScholarDigital Library
J. Hu, E. Goodman, K. Seo, Z. Fan, and R. Rosenberg. The hierarchical fair competition (hfc) framework for sustainable evolutionary algorithms. Evolutionary Computation, 13(2):241--277, 2005. PMID: 15969902. Google ScholarDigital Library
M. Hutter and S. Legg. Fitness uniform optimization. IEEE Transactions on Evolutionary Computation, 10:568--589, 2006. Google ScholarDigital Library
E. D. Jong. The incremental pareto-coevolution archive. In Proceedings of the Genetic and Evolutionary Computation Conference, (GECCO--2004), Berlin, 2004. Springer.Google ScholarCross Ref
J. Lehman and K. O. Stanley. Exploiting open endedness to solve problems through the search for novelty. In Proceedings of the Eleventh International Conference on Artificial Life, Cambridge, MA, 2008. MIT Press.Google Scholar
S. W. Mahfoud. Niching methods for genetic algorithms. PhD thesis, Champaign, IL, USA, 1995. Google ScholarDigital Library
G. Mayley. Guiding or hiding: Explorations into the effects of learning on the rate of evolution. In Fourth European Conference on Artificial Life, pages 135--144. MIT Press, 1997.Google Scholar
Y. Niv, D. Joel, I. Meilijson, and E. Ruppin. Evolution of reinforcement learning in uncertain environments: A simple explanation for complex foraging behaviors. Adaptive Behavior, 10(1):5--24, 2002. Google ScholarDigital Library
S. Nolfi and D. Floreano. Learning and evolution. Autonomous Robots, 7(1):89--113, July 1999. Google ScholarDigital Library
S. Nolfi, D. Parisi, and J. L. Elman. Learning and evolution in neural networks. Adaptive Behavior, 3:5--28, 1994. Google ScholarDigital Library
A. Soltoggio, J. A. Bullinaria, C. Mattiussi, P. Dürr, and D. Floreano. Evolutionary Advantages of Neuromodulated Plasticity in Dynamic, Reward-based Scenarios. In Artificial Life XI, pages 569--576, Cambridge, MA, 2008. MIT Press.Google Scholar
A. Soltoggio, P. Dürr, C. Mattiussi, and D. Floreano. Evolving neuromodulatory topologies for reinforcement learning-like problems. In Proceedings of the IEEE Congress on Evolutionary Computation, 2007.Google ScholarCross Ref
K. O. Stanley. rtNEAT C++ software homepage: www.cs.utexas.edu/users/nn/keyword?rtneat. 2006--2008.Google Scholar
K. O. Stanley, B. D. Bryant, and R. Miikkulainen. Evolving adaptive neural networks with and without adaptive synapses. In Proceedings of the 2003 IEEE Congress on Evolutionary Computation (CEC--2003). Canberra, Australia: IEEE Press, 2003.Google ScholarCross Ref
K. O. Stanley and R. Miikkulainen. Evolving neural networks through augmenting topologies. Evolutionary Computation, 10:99--127, 2002. Google ScholarDigital Library

Index Terms

How novelty search escapes the deceptive trap of learning to learn
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

Devising Effective Novelty Search Algorithms: A Comprehensive Empirical Study
GECCO '15: Proceedings of the 2015 Annual Conference on Genetic and Evolutionary Computation

Novelty search is a state-of-the-art evolutionary approach that promotes behavioural novelty instead of pursuing a static objective. Along with a large number of successful applications, many different variants of novelty search have been proposed. It ...
Read More
Critical factors in the performance of novelty search
GECCO '11: Proceedings of the 13th annual conference on Genetic and evolutionary computation

Novelty search is a recently proposed method for evolutionary computation designed to avoid the problem of deception, in which the fitness function guides the search process away from global optima. Novelty search replaces fitness-based selection with ...
Read More
Evolving plastic neural networks with novelty search

Biological brains can adapt and learn from past experience. Yet neuroevolution, that is, automatically creating artificial neural networks (ANNs) through evolutionary algorithms, has sometimes focused on static ANNs that cannot change their weights ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
GECCO '09: Proceedings of the 11th Annual conference on Genetic and evolutionary computation
July 2009
2036 pages
ISBN:9781605583259
DOI:10.1145/1569901
General Chair:
Franz Rothlauf
University of Mainz, Germany
Copyright © 2009 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 8 July 2009
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
adaptation
learning
neat
neural networks
neuroevolution
neuromodulation
novelty search
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,669of4,410submissions,38%
Upcoming Conference
GECCO '24

Sponsor:

sigevo

Genetic and Evolutionary Computation Conference

July 14 - 18, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 49
  Total Citations
  View Citations
- 400
  Total Downloads
- Downloads (Last 12 months)29
- Downloads (Last 6 weeks)6
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

How novelty search escapes the deceptive trap of learning to learn

GECCO '09: Proceedings of the 11th Annual conference on Genetic and evolutionary computation

ABSTRACT

References

Cited By

Index Terms

Recommendations

Devising Effective Novelty Search Algorithms: A Comprehensive Empirical Study

Critical factors in the performance of novelty search

Evolving plastic neural networks with novelty search