Article

Policy transfer with a relational learning classifier system

Author:
Drew Mellor

The University of Newcastle, Callaghan, Australia

The University of Newcastle, Callaghan, Australia
View Profile

GECCO '05: Proceedings of the 7th annual workshop on Genetic and evolutionary computationJune 2005Pages 82–84https://doi.org/10.1145/1102256.1102273

Published:25 June 2005Publication History

GECCO '05: Proceedings of the 7th annual workshop on Genetic and evolutionary computation

Pages 82–84

ABSTRACT

Policy transfer occurs when a system transfers a policy learnt for one task to another task with little or no retraining, and allows a system to perform robustly and learn efficiently, especially when the new task is more complex than the original task. In this paper we report on work in progress into policy transfer using a relational learning classifier system. The system, FOX-cs, uses a high level relational language (a subset first order logic) in combination with a P-learning technique adapted for Xcs and its derivatives. FOX-CS achieved successful policy transfer in two blocks world tasks, stacking and onab, by learning a policy that was independent of the number of blocks, thus avoiding the prohibitive training times that would normally arise due to the exponential explosion in the number of states as the number of blocks increases.

References

Joshua Cole, John Lloyd, and Kee Siong Ng. Symbolic learning for adaptive agents. In Proceedings of the Annual Partner Conference, Smart Internet Technology Cooperative Research Centre, 2003. www.smartinternet.com.au/SITWEB/publication/publications.jsp.Google Scholar
Sašo Džeroski, Luc De Raedt, and Kurt Driessens. Relational reinforcement learning. Machine Learning, 43(1-2):7--52, 2001. Google ScholarDigital Library
Pier Luca Lanzi. Mining interesting knowledge from data with the XCS classifier system. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001), pages 958--965, San Francisco, California, USA, 2001. Morgan Kaufmann.Google Scholar
Drew Mellor. A first order logic classifier system. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2005), To Appear, 2005. Google ScholarDigital Library
John Slaney and Sylvie Thiébaux. Blocks World revisited. Artificial Intelligence, 125:119--153, 2001. Google ScholarDigital Library
Stewart W. Wilson. Generalization in the XCS classifier system. In Genetic Programming 1998: Proceedings of the Third Annual Conference, pages 665--674, University of Wisconsin, Madison, Wisconsin, USA, 1998. Morgan Kaufmann.Google Scholar

Index Terms

Policy transfer with a relational learning classifier system
1. Computing methodologies
  1. Artificial intelligence
    1. Knowledge representation and reasoning
2. Theory of computation
  1. Logic

Recommendations

Efficient Deep Reinforcement Learning through Policy Transfer
AAMAS '20: Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems

Transfer Learning (TL) has shown great potential to accelerate Reinforcement Learning (RL) by leveraging prior knowledge from past learned policies of relevant tasks. Existing TL approaches either explicitly computes the similarity between tasks or ...
Read More
Transferable XCS
GECCO '16: Proceedings of the Genetic and Evolutionary Computation Conference 2016

Traditional accuracy-based XCS classifier system generally learns and evolves classifiers from scratch when facing each particular problem. Inspired by humans with the ability to learn new skills by inducing knowledge from related problems, transfer ...
Read More
Learning classifier system with average reward reinforcement learning

In the family of Learning Classifier Systems, the classifier system XCS is most widely used and investigated. However, the standard XCS has difficulties solving large multi-step problems, where long action chains are needed to get delayed rewards. Up to ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
GECCO '05: Proceedings of the 7th annual workshop on Genetic and evolutionary computation
June 2005
431 pages
ISBN:9781450378000
DOI:10.1145/1102256
Conference Chair:
Franz Rothlauf
University of Mannheim, Germany
Copyright © 2005 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 25 June 2005
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
XCS
blocks world
first order logic
learning classifier system
policy transfer
relational learning
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,669of4,410submissions,38%
Upcoming Conference
GECCO '24

Sponsor:

sigevo

Genetic and Evolutionary Computation Conference

July 14 - 18, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 162
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Policy transfer with a relational learning classifier system

GECCO '05: Proceedings of the 7th annual workshop on Genetic and evolutionary computation

ABSTRACT

References

Cited By

Index Terms

Recommendations

Efficient Deep Reinforcement Learning through Policy Transfer

Transferable XCS

Learning classifier system with average reward reinforcement learning