Skip to main content

2018 | OriginalPaper | Buchkapitel

6. Approximations for Constrained Markov Decision Problems

verfasst von : Naci Saldi, Tamás Linder, Serdar Yüksel

Erschienen in: Finite Approximations in Discrete-Time Stochastic Control

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This chapter studies the finite-state approximation of a discrete-time constrained Markov decision process with compact state space, under the discounted and average cost criteria. Using the linear programming formulation of the constrained discounted problem, we prove the convergence of the optimal value function of the finite-state model to the optimal value function of the original model. Under further continuity conditions on the transition probability of the original discounted model, we also establish a method to compute approximately optimal policies. For the average cost criterion, instead of using the finite-state linear programming approximation method, we use a direct method to establish analogous results under drift and minorization conditions which guarantee the geometric ergodicity of Markov chains induced by stationary policies.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat C.D. Aliprantis, K.C. Border, Infinite Dimensional Analysis (Springer, Berlin, 2006) C.D. Aliprantis, K.C. Border, Infinite Dimensional Analysis (Springer, Berlin, 2006)
11.
Zurück zum Zitat A. Barnivok, A Course in Convexity (American Mathematical Society, Providence, 2002) A. Barnivok, A Course in Convexity (American Mathematical Society, Providence, 2002)
41.
Zurück zum Zitat F. Dufour, T. Prieto-Rumeau, Finite linear programming approximations of constrained discounted Markov decision processes. SIAM J. Control Optim. 51(2), 1298–1324 (2013) F. Dufour, T. Prieto-Rumeau, Finite linear programming approximations of constrained discounted Markov decision processes. SIAM J. Control Optim. 51(2), 1298–1324 (2013)
43.
Zurück zum Zitat F. Dufour, T. Prieto-Rumeau, Stochastic approximations of constrained discounted Markov decision processes. J. Math. Anal. Appl. 413, 856–879 (2014) F. Dufour, T. Prieto-Rumeau, Stochastic approximations of constrained discounted Markov decision processes. J. Math. Anal. Appl. 413, 856–879 (2014)
45.
Zurück zum Zitat P.M. Esfahani, T. Sutter, D. Kuhn, J. Lygeros, From infinite to finite programs: explicit error bounds with applications to approximate dynamic programming. arXiv:1701.06379 (2017) P.M. Esfahani, T. Sutter, D. Kuhn, J. Lygeros, From infinite to finite programs: explicit error bounds with applications to approximate dynamic programming. arXiv:1701.06379 (2017)
67.
Zurück zum Zitat O. Hernandez-Lerma, J. Gonzalez-Hernandez, Constrained Markov control processes in Borel spaces:the discounted case. Math. Methods Oper. Res. 52, 271–285 (2000) O. Hernandez-Lerma, J. Gonzalez-Hernandez, Constrained Markov control processes in Borel spaces:the discounted case. Math. Methods Oper. Res. 52, 271–285 (2000)
68.
Zurück zum Zitat O. Hernández-Lerma, J.B. Lasserre, Discrete-Time Markov Control Processes: Basic Optimality Criteria (Springer, Berlin, 1996) O. Hernández-Lerma, J.B. Lasserre, Discrete-Time Markov Control Processes: Basic Optimality Criteria (Springer, Berlin, 1996)
73.
Zurück zum Zitat O. Hernandez-Lerma, J. Gonzalez-Hernandez, R.R. Lopez-Martinez, Constrained average cost Markov control processes in Borel spaces. SIAM J. Control Optim. 42(2), 442–468 (2003) O. Hernandez-Lerma, J. Gonzalez-Hernandez, R.R. Lopez-Martinez, Constrained average cost Markov control processes in Borel spaces. SIAM J. Control Optim. 42(2), 442–468 (2003)
117.
Zurück zum Zitat N. Saldi, S. Yüksel, T. Linder, Finite-state approximations to constrained Markov decision processes with Borel spaces, in 52nd Annual Allerton Conference on Communication, Control, and Computing, Monticello, IL (2015), pp. 567–572 N. Saldi, S. Yüksel, T. Linder, Finite-state approximations to constrained Markov decision processes with Borel spaces, in 52nd Annual Allerton Conference on Communication, Control, and Computing, Monticello, IL (2015), pp. 567–572
Metadaten
Titel
Approximations for Constrained Markov Decision Problems
verfasst von
Naci Saldi
Tamás Linder
Serdar Yüksel
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-79033-6_6