Top

Published in:

2008 | OriginalPaper | Chapter

Proposal of Exploitation-Oriented Learning PS-r#

Authors : Kazuteru Miyazaki, Shigenobu Kobayashi

Published in: Intelligent Data Engineering and Automated Learning – IDEAL 2008

Publisher: Springer Berlin Heidelberg

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Exploitation-oriented Learning

(XoL) is a novel approach to goal-directed learning from interaction. Though

reinforcement learning

is much more focus on the learning and can gurantee the optimality in

Markov Decision Processes

(MDPs) environments, XoL aims to learn

a rational policy

, whose expected reward per an action is larger than zero, very quickly. We know PS-r* that is one of the XoL methods. It can learn

an useful rational policy

that is not inferior to a random walk in

Partially Observed Markov Decision Processes

(POMDPs) environments where the number of types of a reward is one. However, PS-r* requires

(

) memories where

and

are the numbers of types of a sensory input and an action.In this paper, we propose PS-r

that can learn an useful rational policy in the POMDPs environments by

(

) memories. We confirm the effectiveness of PS-r

in numerical examples.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

next chapter Kernel Regression with a Mahalanobis Metric for Short-Term Traffic Flow Forecasting

Title: Proposal of Exploitation-Oriented Learning PS-r#
Authors: Kazuteru Miyazaki
Shigenobu Kobayashi
Publisher: Springer Berlin Heidelberg
Book: Intelligent Data Engineering and Automated Learning – IDEAL 2008
Print ISBN: 978-3-540-88905-2

Electronic ISBN: 978-3-540-88906-9

Copyright Year: 2008
DOI: https://doi.org/10.1007/978-3-540-88906-9_1

Springer Professional

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner