nach oben

Erschienen in:

2021 | OriginalPaper | Buchkapitel

On Finite Approximations to Markov Decision Processes with Recursive and Nonlinear Discounting

verfasst von : Fan Deng, Xin Guo, Yi Zhang

Erschienen in: Modern Trends in Controlled Stochastic Processes:

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

In this paper, finite approximation schemes are justified for Markov decision processes in Borel spaces with recursive and nonlinear discounting. Explicit error bounds are obtained in terms of the system primitives. This allows one to solve the original problem approximately up to any given accuracy, by solving a sequence of problems in finite spaces.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Full Gradient DQN Reinforcement Learning: A Provably Convergent Scheme

Nächstes Kapitel Locks, Bombs and Testing: The Case of Independent Locks

Aliprantis, C., Border, K.: Infinite Dimensional Analysis. Springer, Heidelberg (2006)MATH

Altman, E.: Constrained Markov Decision Processes. Chapman and Hall/CRC, Boca Raton (1999)MATH

Bäuerle, N., Jaśkiewicz, A., Nowak, A.: Stochastic dynamic programming with non-linear discounting. Appl. Math. Optim. (2020). https://doi.org/10.1007/s00245-020-09731-x

Bertsekas, D.: Convergence of discretization procedures in dynamic programming. IEEE Trans. Autom. Control 20, 415–419 (1975)MathSciNetCrossRef

Cioletti, L., Oliveira, E.: Applications of variable discounting dynamic programming to iterated function systems and related problems. Nonlinearity 32, 853–883 (2019)MathSciNetCrossRef

Dufour, F., Prieto-Rumeau, T.: Approximation of infinite horizon discounted cost Markov decision processes. In: Hernández-Hernández, D., Minjárez-Sosa, J. (eds.) Optimization, Control, and Applications of Stochastic Systems, pp. 59–76. Birkhäuser, Boston (2012)CrossRef

Dufour, F., Prieto-Rumeau, T.: Approximation of Markov decision processes with general state space. J. Math. Anal. Appl. 388, 1254–1267 (2012)MathSciNetCrossRef

Feinberg, E.A., Kasyanov, P., Zadoianchuk, N.: Berge’s theorem for noncompact image sets. J. Math. Anal. Appl. 397, 255–259 (2013)MathSciNetCrossRef

Hernández-Lerma, O., Lasserre, J.: Discrete-Time Markov Control Processes. Springer, New York (1996)CrossRef

10.

Jaśkiewicz, A., Matkowski, J., Nowak, A.: Persistently optimal policies in stochastic dynamic programming with generalized discounting. Math. Oper. Res. 38, 108–121 (2013)MathSciNetCrossRef

11.

Jaśkiewicz, A., Matkowski, J., Nowak, A.: On variable discounting in dynamic programming: applications to resource extraction and other economic models. Ann. Oper. Res. 220, 263–278 (2014)MathSciNetCrossRef

12.

Jaśkiewicz, A., Matkowski, J., Nowak, A.: Generalized discounting in dynamic programming with unbounded returns. Oper. Res. Lett. 42, 231–233 (2014)

13.

Kuntz, J., Thomas, P., Stan, G., Barahona, M.: Approximations of countably-infinite linear programs over bounded measure spaces. SIAM J. Optim. (2020). Preprint available via arXiv:1810.03658v3

14.

Piunovskiy, A., Zhang, Y.: Continuous-Time Markov Decision Processes. Springer, Cham (2020)CrossRef

15.

Puterman, M.: Markov Decision Processes. Wiley, New York (1994)CrossRef

16.

Saldi, N., Linder, T., Yüksel, S.: Finite Approximations in Discrete-Time Stochastic Control. Springer, Cham (2018)CrossRef

17.

Sennott, L.: Stochastic Dynamic Programming and the Control of Queueing Systems. Wiley, New York (1999)MATH

Titel: On Finite Approximations to Markov Decision Processes with Recursive and Nonlinear Discounting
verfasst von: Fan Deng
Xin Guo
Yi Zhang
Verlag: Springer International Publishing
Buch: Modern Trends in Controlled Stochastic Processes:
Print ISBN: 978-3-030-76927-7

Electronic ISBN: 978-3-030-76928-4

Copyright-Jahr: 2021
DOI: https://doi.org/10.1007/978-3-030-76928-4_11

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"