Skip to main content
Erschienen in:
Buchtitelbild

2013 | OriginalPaper | Buchkapitel

1. Relative Value Iteration for Stochastic Differential Games

verfasst von : Ari Arapostathis, Vivek S. Borkar, K. Suresh Kumar

Erschienen in: Advances in Dynamic Games

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We study zero-sum stochastic differential games with player dynamics governed by a nondegenerate controlled diffusion process. Under the assumption of uniform stability, we establish the existence of a solution to the Isaac’s equation for the ergodic game and characterize the optimal stationary strategies. The data is not assumed to be bounded, nor do we assume geometric ergodicity. Thus our results extend previous work in the literature. We also study a relative value iteration scheme that takes the form of a parabolic Isaac’s equation. Under the hypothesis of geometric ergodicity we show that the relative value iteration converges to the elliptic Isaac’s equation as time goes to infinity. We use these results to establish convergence of the relative value iteration for risk-sensitive control problems under an asymptotic flatness assumption.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Arapostathis A, Borkar VS (2012) A relative value iteration algorithm for nondegenerate controlled diffusions. SIAM J Control Optim 50(4):1886–1902MathSciNetCrossRefMATH Arapostathis A, Borkar VS (2012) A relative value iteration algorithm for nondegenerate controlled diffusions. SIAM J Control Optim 50(4):1886–1902MathSciNetCrossRefMATH
Zurück zum Zitat Arapostathis A, Borkar VS, Ghosh MK (2011) Ergodic control of diffusion processes. Encyclopedia of mathematics and its applications, vol 143. Cambridge University Press, CambridgeCrossRef Arapostathis A, Borkar VS, Ghosh MK (2011) Ergodic control of diffusion processes. Encyclopedia of mathematics and its applications, vol 143. Cambridge University Press, CambridgeCrossRef
Zurück zum Zitat Beneš VE (1970) Existence of optimal strategies based on specified information, for a class of stochastic decision problems. SIAM J Control 8:179–188MathSciNetCrossRefMATH Beneš VE (1970) Existence of optimal strategies based on specified information, for a class of stochastic decision problems. SIAM J Control 8:179–188MathSciNetCrossRefMATH
Zurück zum Zitat Borkar VS, Ghosh MK (1992) Stochastic differential games: occupation measure based approach. J Optim Theory Appl 73(2):359–385MathSciNetCrossRefMATH Borkar VS, Ghosh MK (1992) Stochastic differential games: occupation measure based approach. J Optim Theory Appl 73(2):359–385MathSciNetCrossRefMATH
Zurück zum Zitat Borkar VS, Suresh Kumar K (2010) Singular perturbations in risk-sensitive stochastic control. SIAM J Control Optim 48(6):3675–3697MathSciNetCrossRefMATH Borkar VS, Suresh Kumar K (2010) Singular perturbations in risk-sensitive stochastic control. SIAM J Control Optim 48(6):3675–3697MathSciNetCrossRefMATH
Zurück zum Zitat Gilbarg D, Trudinger NS (1983) Elliptic partial differential equations of second order, 2nd edn. Grundlehren der Mathematischen Wissenschaften, vol 224. Springer, Berlin Gilbarg D, Trudinger NS (1983) Elliptic partial differential equations of second order, 2nd edn. Grundlehren der Mathematischen Wissenschaften, vol 224. Springer, Berlin
Zurück zum Zitat Gruber M (1984) Harnack inequalities for solutions of general second order parabolic equations and estimates of their Hölder constants. Math Z 185(1):23–43MathSciNetCrossRefMATH Gruber M (1984) Harnack inequalities for solutions of general second order parabolic equations and estimates of their Hölder constants. Math Z 185(1):23–43MathSciNetCrossRefMATH
Zurück zum Zitat Ladyženskaja OA, Solonnikov VA, Ural′ceva NN (1967) Linear and quasilinear equations of parabolic type. Translated from the Russian by S. Smith. Translations of Mathematical Monographs, Vol. 23. American Mathematical Society, Providence, R.I. Ladyženskaja OA, Solonnikov VA, Ural′ceva NN (1967) Linear and quasilinear equations of parabolic type. Translated from the Russian by S. Smith. Translations of Mathematical Monographs, Vol. 23. American Mathematical Society, Providence, R.I.
Zurück zum Zitat Meyn SP, Tweedie RL (1993) Stability of Markovian processes. III. Foster-Lyapunov criteria for continuous-time processes. Adv Appl Probab 25(3):518–548MathSciNetMATH Meyn SP, Tweedie RL (1993) Stability of Markovian processes. III. Foster-Lyapunov criteria for continuous-time processes. Adv Appl Probab 25(3):518–548MathSciNetMATH
Zurück zum Zitat White DJ (1963) Dynamic programming, Markov chains, and the method of successive approximations. J Math Anal Appl 6:373–376MathSciNetCrossRefMATH White DJ (1963) Dynamic programming, Markov chains, and the method of successive approximations. J Math Anal Appl 6:373–376MathSciNetCrossRefMATH
Zurück zum Zitat Whittle P (1990) Risk-sensitive optimal control. Wiley-Interscience Series in Systems and Optimization. Wiley, Chichester Whittle P (1990) Risk-sensitive optimal control. Wiley-Interscience Series in Systems and Optimization. Wiley, Chichester
Metadaten
Titel
Relative Value Iteration for Stochastic Differential Games
verfasst von
Ari Arapostathis
Vivek S. Borkar
K. Suresh Kumar
Copyright-Jahr
2013
DOI
https://doi.org/10.1007/978-3-319-02690-9_1