Top

Dynamic Games and Applications

Published in:

07-06-2019

Nonzero-Sum Stochastic Games with Probability Criteria

Authors: Xiangxiang Huang, Xianping Guo

Published in: Dynamic Games and Applications | Issue 2/2020

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

In this paper, we consider two-person nonzero-sum discrete-time stochastic games under the probability criterion. Taking \(\lambda \) for player 1 and \(\mu \) for player 2 as their profit goal, the two players are concerned with the probabilities that the rewards they earn before the first passage to some target state set are more than \(\lambda \) and \(\mu \), respectively. We firstly give a characterization of the probabilities, and then, under a mild condition, we show that the optimal value function for each player is the unique solution to the corresponding optimality equation by an iterative approximation, and then establish the existence of Nash equilibria. Finally, a queueing system is provided to show the application of our main result.

previous article Evolution of Behavior When Duopolists Choose Prices and Quantities

next article Evolution of a Collusive Price in a Networked Market

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Altman E (1994) Flow control using the theory of zero-sum Markov games. IEEE Trans Autom Control 39:814–818MathSciNetCrossRef

Bouakiz M, Kebir Y (1995) Target-level criterion in Markov decision processes. J Optim Theory Appl 86:1–15MathSciNetCrossRef

Boda K, Filar JA, Lin YL, Spanjers L (2004) Stochastic target hitting time and the problem of early retirement. IEEE Trans Autom Control 49:409–419MathSciNetCrossRef

Cao XR (2003) Semi-Markov decision problems and performance sensitivity analysis. IEEE Trans Autom Control 48:758–769MathSciNetCrossRef

Fan K (1952) Fixed-point and minimax theorems in locally convex topological linear spaces. Proc Nat Acad Sci 38:121–126MathSciNetCrossRef

Guo XP, Hernández-Lerma O (2003) Zero-sum games for continuous-time Markov chains with unbounded transition and average payoff rates. J Appl Probab 40:327–345MathSciNetCrossRef

Guo XP, Hernández-Lerma O (2005) Nonzero-sum games for continuous-time Markov chains with unbounded discounted payoffs. J Appl Probab 42:303–320MathSciNetCrossRef

Guo XP, Vykertas M, Zhang Y (2013) Absorbing continuous-time Markov decision processes with total cost criteria. Adv Appl Probab 45:490–519MathSciNetCrossRef

Ghosh MK, Kumar KS, Pal C (2016) Zero-sum risk-sensitive stochastic games for continuous time Markov chains. Stoch Anal Appl 34:835–851MathSciNetCrossRef

10.

Huang YH, Guo XP, Song XY (2011) Performance analysis for controlled semi-Markov systems with application to maintenance. J Optim Theory Appl 150:395–415MathSciNetCrossRef

11.

Huo HF, Zou XL, Guo XP (2017) The risk probability criterion for discounted continuous-time Markov decision processes. Discrete Event Dyn Syst 27:675–699MathSciNetCrossRef

12.

Hernández-Lerma O, Lasserre JB (2001) Zero-sum stochastic games in Borel spaces: average payoff criterion. SIAM J Control Optim 39:1520–1539CrossRef

13.

Hernández-Lerma O, Lasserre JB (1996) Discrete-time Markov control processes. Springer, New YorkCrossRef

14.

Hernández-Lerma O, Lasserre JB (1999) Further topics on discrete-time Markov control processes. Springer, New YorkCrossRef

15.

Huang XX, Guo XP, Peng JP (2017) A probability criterion for zero-sum stochastic games. J Dyn Games 4:369–383MathSciNetCrossRef

16.

Kira A, Ueno T, Fujita T (2012) Threshold probability of non-terminal type in finite horizon Markov decision processes. J Math Anal Appl 386:461–472MathSciNetCrossRef

17.

Liu QL, Huang XX (2017) Discrete-time zero-sum Markov games with first passage criteria. Optimization 66:571–587MathSciNetCrossRef

18.

Shapley LS (1953) Stochastic games. Proc Nat Acad Sci 39:1095–1100MathSciNetCrossRef

19.

Sennott LI (1994) Zero-sum stochastic games with unbounded costs: discounted and average cost cases. Zeitschrift für Oper Res 39:209–225MathSciNetMATH

20.

Sakaguchi M, Ohtsubo Y (2013) Markov decision processes associated with two threshold probability criteria. J Control Appl 11:548–557MathSciNetCrossRef

21.

Sakaguchi M, Ohtsubo Y (2010) Optimal threshold probability and expectation in semi-Markov decision processes. Appl Math Comput 216:2947–2958MathSciNetMATH

22.

Wu CB, Lin YL (1999) Minimizing risk models in Markov decision processes with policies depending on target values. J Math Anal Appl 231:47–67MathSciNetCrossRef

23.

White DJ (1993) Minimizing a threshold probability in discounted Markov decision processes. J Math Anal Appl 173:634–646MathSciNetCrossRef

24.

Wei QD, Chen X (2016) Stochastic games for continuous-time jump processes under finite-horizon payoff criterion. Appl Math Optim 74:273–301MathSciNetCrossRef

25.

Zhang WZ, Wang BF, Chen DW (2018) Continuous-time constrained stochastic games with average criteria. Oper Res Lett. 46:109–115MathSciNetCrossRef

Title: Nonzero-Sum Stochastic Games with Probability Criteria
Authors: Xiangxiang Huang
Xianping Guo
Publication date: 07-06-2019
Publisher: Springer US
Published in: Dynamic Games and Applications / Issue 2/2020
Print ISSN: 2153-0785
Electronic ISSN: 2153-0793
DOI: https://doi.org/10.1007/s13235-019-00317-z

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Other articles of this Issue 2/2020

A Differential Game with Exit Costs

Self-consistent Feedback Stackelberg Equilibria for Infinite Horizon Stochastic Games

Limit Optimal Trajectories in Zero-Sum Stochastic Games

Long-Time Behavior of First-Order Mean Field Games on Euclidean Space

Evasive Path Planning Under Surveillance Uncertainty

Optimality, Equilibrium, and Curb Sets in Decision Problems Without Commitment

Premium Partner