Overview

Authors:

Csaba Szepesvári ⁰

Csaba Szepesvári
1. University of Alberta, USA
View author publications

You can also search for this author in PubMed Google Scholar

Part of the book series: Synthesis Lectures on Artificial Intelligence and Machine Learning (SLAIML)

1804 Accesses
187 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 24.99

Price excludes VAT (USA)

Softcover Book USD 32.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

About this book

Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book, we focus on those algorithms of reinforcement learning that build on the powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning problems, describe the core ideas, note a large number of state of the art algorithms, followed by the discussion of their theoretical properties and limitations. Table of Contents: Markov Decision Processes / Value Prediction Problems / Control / For Further Exploration

Reinforcement Learning Algorithms: Categorization and Structural Properties

Reinforcement Learning

Reinforcement Learning for Control Using Value Function Approximation

Table of contents (4 chapters)

Front Matter

Pages i-xiii

Download chapter PDF
Markov Decision Processes
- Csaba Szepesvári
Pages 1-10
Value Prediction Problems
- Csaba Szepesvári
Pages 11-36
Control
- Csaba Szepesvári
Pages 37-62
For Further Exploration
- Csaba Szepesvári
Pages 63-64
Back Matter

Pages 65-89

Download chapter PDF

Authors and Affiliations

University of Alberta, USA

Csaba Szepesvári

About the author

Csaba Szepesvári received his PhD in 1999 from "Jozsef Attila" University, Szeged, Hungary. He is currently an Associate Professor at the Department of Computing Science of the University of Alberta and a principal investigator of the Alberta Ingenuity Center for Machine Learning. Previously, he held a senior researcher position at the Computer and Automation Research Institute of the Hungarian Academy of Sciences, where he headed the Machine Learning Group. Before that, he spent 5 years in the software industry. In 1998, he became the Research Director of Mindmaker, Ltd., working on natural language processing and speech products, while from 2000, he became the Vice President of Research at the Silicon Valley company Mindmaker Inc. He is the coauthor of a book on nonlinear approximate adaptive controllers, published over 80 journal and conference papers and serves as the Associate Editor of IEEE Transactions on Adaptive Control and AI Communications, is on the board of editors of theJournal of Machine Learning Research and the Machine Learning Journal, and is a regular member of the program committee at various machine learning and AI conferences. His areas of expertise include statistical learning theory, reinforcement learning and nonlinear adaptive control.

Bibliographic Information

Book Title: Algorithms for Reinforcement Learning
Authors: Csaba Szepesvári
Series Title: Synthesis Lectures on Artificial Intelligence and Machine Learning
DOI: https://doi.org/10.1007/978-3-031-01551-9
Publisher: Springer Cham
eBook Packages: Synthesis Collection of Technology (R0), eBColl Synthesis Collection 3
Copyright Information: Springer Nature Switzerland AG 2010
Softcover ISBN: 978-3-031-00423-0Published: 07 July 2010
eBook ISBN: 978-3-031-01551-9Published: 31 May 2022
Series ISSN: 1939-4608
Series E-ISSN: 1939-4616
Edition Number: 1
Number of Pages: XIII, 89
Topics: Artificial Intelligence, Machine Learning, Mathematical Models of Cognitive Processes and Neural Networks

Publish with us

Policies and ethics

Algorithms for Reinforcement Learning

Overview

Access this book

Other ways to access

About this book

Similar content being viewed by others

Reinforcement Learning Algorithms: Categorization and Structural Properties

Reinforcement Learning

Reinforcement Learning for Control Using Value Function Approximation

Table of contents (4 chapters)

Front Matter

Markov Decision Processes

Value Prediction Problems

Control

For Further Exploration

Back Matter

Authors and Affiliations

University of Alberta, USA

About the author

Bibliographic Information

Publish with us

Navigation

Algorithms for Reinforcement Learning

Overview

Access this book

Other ways to access

About this book

Similar content being viewed by others

Reinforcement Learning Algorithms: Categorization and Structural Properties

Reinforcement Learning

Reinforcement Learning for Control Using Value Function Approximation

Table of contents (4 chapters)

Front Matter

Markov Decision Processes

Value Prediction Problems

Control

For Further Exploration

Back Matter

Authors and Affiliations

University of Alberta, USA

About the author

Bibliographic Information

Publish with us

Search

Navigation