Skip to main content

2023 | Buch

An Introduction to Optimal Control Theory

The Dynamic Programming Approach

verfasst von: Onésimo Hernández-Lerma, Leonardo R. Laura-Guarachi, Saul Mendoza-Palacios, David González-Sánchez

Verlag: Springer International Publishing

Buchreihe : Texts in Applied Mathematics

insite
SUCHEN

Über dieses Buch

This book introduces optimal control problems for large families of deterministic and stochastic systems with discrete or continuous time parameter. These families include most of the systems studied in many disciplines, including Economics, Engineering, Operations Research, and Management Science, among many others.

The main objective is to give a concise, systematic, and reasonably self contained presentation of some key topics in optimal control theory. To this end, most of the analyses are based on the dynamic programming (DP) technique. This technique is applicable to almost all control problems that appear in theory and applications. They include, for instance, finite and infinite horizon control problems in which the underlying dynamic system follows either a deterministic or stochastic difference or differential equation. In the infinite horizon case, it also uses DP to study undiscounted problems, such as the ergodic or long-run average cost.

After a general introduction to control problems, the book covers the topic dividing into four parts with different dynamical systems: control of discrete-time deterministic systems, discrete-time stochastic systems, ordinary differential equations, and finally a general continuous-time MCP with applications for stochastic differential equations.

The first and second part should be accessible to undergraduate students with some knowledge of elementary calculus, linear algebra, and some concepts from probability theory (random variables, expectations, and so forth). Whereas the third and fourth part would be appropriate for advanced undergraduates or graduate students who have a working knowledge of mathematical analysis (derivatives, integrals, ...) and stochastic processes.

Inhaltsverzeichnis

Frontmatter
Chapter 1. Introduction: Optimal Control Problems
Abstract
In a few words, in an optimal control problem (OCP) we are given a dynamical system that is “controllable” in the sense that its behavior depends on some parameters or components that we can choose within certain ranges. These components are called control actions. When we look at these control actions throught the whole period of time in which the system is functioning, then they form control policies or strategies. On the other hand, we are also given an objective function or performance index that somehow measures the system’s response to each control policy. The OCP is then to find a control policy that optimizes the given objective function.
Onésimo Hernández-Lerma, Leonardo R. Laura-Guarachi, Saul Mendoza-Palacios, David González-Sánchez
Chapter 2. Discrete–Time Deterministic Systems
Abstract
In  this chapter we consider the discrete–time system (1.​1) in the so–called deterministic case (see Remark 1.2(a)), so that the disturbances \(\xi _t\) are supposed to be given constants in some space S. Since this information is irrelevant for our present purposes, we will omit the notation \(\xi _t\) so (1.​1) becomes.
Onésimo Hernández-Lerma, Leonardo R. Laura-Guarachi, Saul Mendoza-Palacios, David González-Sánchez
Chapter 3. Discrete–Time Stochastic Control Systems
Abstract
For the discrete–time deterministic systems studied in Chap. 2 there is essentially a unique dynamic model.
Onésimo Hernández-Lerma, Leonardo R. Laura-Guarachi, Saul Mendoza-Palacios, David González-Sánchez
Chapter 4. Continuous–Time Deterministic Systems
Abstract
We now consider a deterministic continuous–time optimal control problem (OCP) in which the state process \(x(\cdot )\) evolves in the state space \(X:=\mathbb {R}^n\) according to an ordinary differential equation.
Onésimo Hernández-Lerma, Leonardo R. Laura-Guarachi, Saul Mendoza-Palacios, David González-Sánchez
Chapter 5. Continuous–Time Markov Control ProcessesMarkov control processcontinuous–time Optimal control problem (OCP)continuous–time
Abstract
As noted in Remark 4.7(b), the solution \(x(\cdot )\) of the (deterministic) ordinary differential equation (4.0.1) can be interpreted as a Markov control process (MCP), also known as a controlled Markov process. In this chapter we introduce some facts on general continuous–time MCPs, which allows us to make a unified presentation of related control problems. We will begin below with some comments on (noncontrolled) continuous–time Markov processes. (We only wish to motivate some concepts, so our presentation is not very precise. For further details, see the bibliographical notes at the end of this chapter.)
Onésimo Hernández-Lerma, Leonardo R. Laura-Guarachi, Saul Mendoza-Palacios, David González-Sánchez
Chapter 6. Controlled Diffusion Processes
Abstract
In the remainder of these notes we consider a class of \(\mathbb {R}^d\)–valued Markov processes \(\{x(t),t\ge 0\}\) called (Markov) diffusion processes. These are processes that are characterized in a suitable sense by a function \(b:[0,\infty )\times \mathbb {R}^d\rightarrow \mathbb {R}^d\) called the drift vector, and a \(d\times d\) matrix D on \([0,\infty )\times \mathbb {R}^d\) called the diffusion matrix, which is assumed to be symmetric and nonnegative definite. In the extreme case in which \(D\equiv 0\), the zero matrix, the process \(x(\cdot )\) is the solution of an ordinary differential equation \(\dot{x}(t)=b(t,x(t)), t\ge 0\). At the other extreme, if \(b\equiv 0\) and \(D\equiv I\) the identity matrix, then \(x(\cdot )\) is a Markov process called Wiener process or Brownian motion. (See Example 5.​3, above, or any introductory book on stochastic analysis or stochastic differential equations, for instance, Arnold (1974), Evans (2013), Mikosch (1998), Øksendal (2003),...)
Onésimo Hernández-Lerma, Leonardo R. Laura-Guarachi, Saul Mendoza-Palacios, David González-Sánchez
Backmatter
Metadaten
Titel
An Introduction to Optimal Control Theory
verfasst von
Onésimo Hernández-Lerma
Leonardo R. Laura-Guarachi
Saul Mendoza-Palacios
David González-Sánchez
Copyright-Jahr
2023
Electronic ISBN
978-3-031-21139-3
Print ISBN
978-3-031-21138-6
DOI
https://doi.org/10.1007/978-3-031-21139-3