skip to main content
10.1145/1569901.1569923acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

How novelty search escapes the deceptive trap of learning to learn

Published:08 July 2009Publication History

ABSTRACT

A major goal for researchers in neuroevolution is to evolve artificial neural networks (ANNs) that can learn during their lifetime. Such networks can adapt to changes in their environment that evolution on its own cannot anticipate. However, a profound problem with evolving adaptive systems is that if the impact of learning on the fitness of the agent is only marginal, then evolution is likely to produce individuals that do not exhibit the desired adaptive behavior. Instead, because it is easier at first to improve fitness without evolving the ability to learn, they are likely to exploit domain-dependent static (i.e. non-adaptive) heuristics. This paper proposes a way to escape the deceptive trap of static policies based on the novelty search algorithm, which opens up a new avenue in the evolution of adaptive systems because it can exploit the behavioral difference between learning and non-learning individuals. The main idea in novelty search is to abandon objective-based fitness and instead simply search only for novel behavior, which avoids deception entirely and has shown prior promising results in other domains. This paper shows that novelty search significantly outperforms fitness-based search in a tunably deceptive T-Maze navigation domain because it fosters the emergence of adaptive behavior.

References

  1. T. Aaltonen et al. Measurement of the top quark mass with dilepton events selected using neuroevolution at CDF. Physical Review Letters, 2009. To appear.Google ScholarGoogle ScholarCross RefCross Ref
  2. J. Blynel and D. Floreano. Exploring the T-Maze: Evolving Learning-Like Robot Behaviors using CTRNNs. In 2nd European Workshop on Evolutionary Robotics (EvoRob'2003), Lecture Notes in Computer Science, 2003.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T. Carew, E. Walters, and E. Kandel. Classical conditioning in a simple withdrawal reflex in Aplysia californica. The Journal of Neuroscience, 1(12):1426--1437, 1981.Google ScholarGoogle ScholarCross RefCross Ref
  4. P. Darwen and Y. Yao. Every niching method has its niche: Fitness sharing and implicit sharing compared. Parallel Problem Solving from Nature (PPSN IV), pages 398--407, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. D. Floreano and J. Urzelai. Evolutionary robots with online self-organization and behavioral fitness. Neural Networks, 13:431--443, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. E. Goldberg and J. Richardson. Genetic algorithms with sharing for multimodal function optimization. In Proceedings of the Second International Conference on Genetic Algorithms on Genetic algorithms and their application, pages 41--49, Hillsdale, NJ, USA, 1987. L. Erlbaum Associates Inc. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. G. E. Hinton and S. J. Nowlan. How learning can guide evolution. Complex Systems, 1, 1987.Google ScholarGoogle Scholar
  8. G. S. Hornby. Alps: the age-layered population structure for reducing the problem of premature convergence. In GECCO '06: Proceedings of the 8th annual conference on Genetic and evolutionary computation, pages 815--822, New York, NY, USA, 2006. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Hu, E. Goodman, K. Seo, Z. Fan, and R. Rosenberg. The hierarchical fair competition (hfc) framework for sustainable evolutionary algorithms. Evolutionary Computation, 13(2):241--277, 2005. PMID: 15969902. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Hutter and S. Legg. Fitness uniform optimization. IEEE Transactions on Evolutionary Computation, 10:568--589, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. E. D. Jong. The incremental pareto-coevolution archive. In Proceedings of the Genetic and Evolutionary Computation Conference, (GECCO--2004), Berlin, 2004. Springer.Google ScholarGoogle ScholarCross RefCross Ref
  12. J. Lehman and K. O. Stanley. Exploiting open endedness to solve problems through the search for novelty. In Proceedings of the Eleventh International Conference on Artificial Life, Cambridge, MA, 2008. MIT Press.Google ScholarGoogle Scholar
  13. S. W. Mahfoud. Niching methods for genetic algorithms. PhD thesis, Champaign, IL, USA, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. G. Mayley. Guiding or hiding: Explorations into the effects of learning on the rate of evolution. In Fourth European Conference on Artificial Life, pages 135--144. MIT Press, 1997.Google ScholarGoogle Scholar
  15. Y. Niv, D. Joel, I. Meilijson, and E. Ruppin. Evolution of reinforcement learning in uncertain environments: A simple explanation for complex foraging behaviors. Adaptive Behavior, 10(1):5--24, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Nolfi and D. Floreano. Learning and evolution. Autonomous Robots, 7(1):89--113, July 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Nolfi, D. Parisi, and J. L. Elman. Learning and evolution in neural networks. Adaptive Behavior, 3:5--28, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Soltoggio, J. A. Bullinaria, C. Mattiussi, P. Dürr, and D. Floreano. Evolutionary Advantages of Neuromodulated Plasticity in Dynamic, Reward-based Scenarios. In Artificial Life XI, pages 569--576, Cambridge, MA, 2008. MIT Press.Google ScholarGoogle Scholar
  19. A. Soltoggio, P. Dürr, C. Mattiussi, and D. Floreano. Evolving neuromodulatory topologies for reinforcement learning-like problems. In Proceedings of the IEEE Congress on Evolutionary Computation, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  20. K. O. Stanley. rtNEAT C++ software homepage: www.cs.utexas.edu/users/nn/keyword?rtneat. 2006--2008.Google ScholarGoogle Scholar
  21. K. O. Stanley, B. D. Bryant, and R. Miikkulainen. Evolving adaptive neural networks with and without adaptive synapses. In Proceedings of the 2003 IEEE Congress on Evolutionary Computation (CEC--2003). Canberra, Australia: IEEE Press, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  22. K. O. Stanley and R. Miikkulainen. Evolving neural networks through augmenting topologies. Evolutionary Computation, 10:99--127, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. How novelty search escapes the deceptive trap of learning to learn

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      GECCO '09: Proceedings of the 11th Annual conference on Genetic and evolutionary computation
      July 2009
      2036 pages
      ISBN:9781605583259
      DOI:10.1145/1569901

      Copyright © 2009 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 8 July 2009

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,669of4,410submissions,38%

      Upcoming Conference

      GECCO '24
      Genetic and Evolutionary Computation Conference
      July 14 - 18, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader