Abstract
In the previous chapters, we considered a planning setting in which the algorithm is given a model of the MODP and produces a coverage set. But what if the model is unknown at the time a coverage set needs to be produced? In that case, the algorithm must learn about the MODP from interaction with the environment. This is called the reinforcement learning or simply learning setting [Sutton and Barto, 1998, Wiering and Van Otterlo, 2012].
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Roijers, D.M., Whiteson, S. (2017). Learning. In: Multi-Objective Decision Making. Synthesis Lectures on Artificial Intelligence and Machine Learning. Springer, Cham. https://doi.org/10.1007/978-3-031-01576-2_6
Download citation
DOI: https://doi.org/10.1007/978-3-031-01576-2_6
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-00448-3
Online ISBN: 978-3-031-01576-2
eBook Packages: Synthesis Collection of Technology (R0)eBColl Synthesis Collection 7