Skip to main content
Top
Published in: Dynamic Games and Applications 2/2020

07-06-2019

Nonzero-Sum Stochastic Games with Probability Criteria

Authors: Xiangxiang Huang, Xianping Guo

Published in: Dynamic Games and Applications | Issue 2/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, we consider two-person nonzero-sum discrete-time stochastic games under the probability criterion. Taking \(\lambda \) for player 1 and \(\mu \) for player 2 as their profit goal, the two players are concerned with the probabilities that the rewards they earn before the first passage to some target state set are more than \(\lambda \) and \(\mu \), respectively. We firstly give a characterization of the probabilities, and then, under a mild condition, we show that the optimal value function for each player is the unique solution to the corresponding optimality equation by an iterative approximation, and then establish the existence of Nash equilibria. Finally, a queueing system is provided to show the application of our main result.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
2.
3.
go back to reference Boda K, Filar JA, Lin YL, Spanjers L (2004) Stochastic target hitting time and the problem of early retirement. IEEE Trans Autom Control 49:409–419MathSciNetCrossRef Boda K, Filar JA, Lin YL, Spanjers L (2004) Stochastic target hitting time and the problem of early retirement. IEEE Trans Autom Control 49:409–419MathSciNetCrossRef
4.
go back to reference Cao XR (2003) Semi-Markov decision problems and performance sensitivity analysis. IEEE Trans Autom Control 48:758–769MathSciNetCrossRef Cao XR (2003) Semi-Markov decision problems and performance sensitivity analysis. IEEE Trans Autom Control 48:758–769MathSciNetCrossRef
5.
go back to reference Fan K (1952) Fixed-point and minimax theorems in locally convex topological linear spaces. Proc Nat Acad Sci 38:121–126MathSciNetCrossRef Fan K (1952) Fixed-point and minimax theorems in locally convex topological linear spaces. Proc Nat Acad Sci 38:121–126MathSciNetCrossRef
6.
go back to reference Guo XP, Hernández-Lerma O (2003) Zero-sum games for continuous-time Markov chains with unbounded transition and average payoff rates. J Appl Probab 40:327–345MathSciNetCrossRef Guo XP, Hernández-Lerma O (2003) Zero-sum games for continuous-time Markov chains with unbounded transition and average payoff rates. J Appl Probab 40:327–345MathSciNetCrossRef
7.
go back to reference Guo XP, Hernández-Lerma O (2005) Nonzero-sum games for continuous-time Markov chains with unbounded discounted payoffs. J Appl Probab 42:303–320MathSciNetCrossRef Guo XP, Hernández-Lerma O (2005) Nonzero-sum games for continuous-time Markov chains with unbounded discounted payoffs. J Appl Probab 42:303–320MathSciNetCrossRef
8.
go back to reference Guo XP, Vykertas M, Zhang Y (2013) Absorbing continuous-time Markov decision processes with total cost criteria. Adv Appl Probab 45:490–519MathSciNetCrossRef Guo XP, Vykertas M, Zhang Y (2013) Absorbing continuous-time Markov decision processes with total cost criteria. Adv Appl Probab 45:490–519MathSciNetCrossRef
9.
go back to reference Ghosh MK, Kumar KS, Pal C (2016) Zero-sum risk-sensitive stochastic games for continuous time Markov chains. Stoch Anal Appl 34:835–851MathSciNetCrossRef Ghosh MK, Kumar KS, Pal C (2016) Zero-sum risk-sensitive stochastic games for continuous time Markov chains. Stoch Anal Appl 34:835–851MathSciNetCrossRef
10.
go back to reference Huang YH, Guo XP, Song XY (2011) Performance analysis for controlled semi-Markov systems with application to maintenance. J Optim Theory Appl 150:395–415MathSciNetCrossRef Huang YH, Guo XP, Song XY (2011) Performance analysis for controlled semi-Markov systems with application to maintenance. J Optim Theory Appl 150:395–415MathSciNetCrossRef
11.
go back to reference Huo HF, Zou XL, Guo XP (2017) The risk probability criterion for discounted continuous-time Markov decision processes. Discrete Event Dyn Syst 27:675–699MathSciNetCrossRef Huo HF, Zou XL, Guo XP (2017) The risk probability criterion for discounted continuous-time Markov decision processes. Discrete Event Dyn Syst 27:675–699MathSciNetCrossRef
12.
go back to reference Hernández-Lerma O, Lasserre JB (2001) Zero-sum stochastic games in Borel spaces: average payoff criterion. SIAM J Control Optim 39:1520–1539CrossRef Hernández-Lerma O, Lasserre JB (2001) Zero-sum stochastic games in Borel spaces: average payoff criterion. SIAM J Control Optim 39:1520–1539CrossRef
13.
go back to reference Hernández-Lerma O, Lasserre JB (1996) Discrete-time Markov control processes. Springer, New YorkCrossRef Hernández-Lerma O, Lasserre JB (1996) Discrete-time Markov control processes. Springer, New YorkCrossRef
14.
go back to reference Hernández-Lerma O, Lasserre JB (1999) Further topics on discrete-time Markov control processes. Springer, New YorkCrossRef Hernández-Lerma O, Lasserre JB (1999) Further topics on discrete-time Markov control processes. Springer, New YorkCrossRef
15.
16.
go back to reference Kira A, Ueno T, Fujita T (2012) Threshold probability of non-terminal type in finite horizon Markov decision processes. J Math Anal Appl 386:461–472MathSciNetCrossRef Kira A, Ueno T, Fujita T (2012) Threshold probability of non-terminal type in finite horizon Markov decision processes. J Math Anal Appl 386:461–472MathSciNetCrossRef
17.
19.
go back to reference Sennott LI (1994) Zero-sum stochastic games with unbounded costs: discounted and average cost cases. Zeitschrift für Oper Res 39:209–225MathSciNetMATH Sennott LI (1994) Zero-sum stochastic games with unbounded costs: discounted and average cost cases. Zeitschrift für Oper Res 39:209–225MathSciNetMATH
20.
go back to reference Sakaguchi M, Ohtsubo Y (2013) Markov decision processes associated with two threshold probability criteria. J Control Appl 11:548–557MathSciNetCrossRef Sakaguchi M, Ohtsubo Y (2013) Markov decision processes associated with two threshold probability criteria. J Control Appl 11:548–557MathSciNetCrossRef
21.
go back to reference Sakaguchi M, Ohtsubo Y (2010) Optimal threshold probability and expectation in semi-Markov decision processes. Appl Math Comput 216:2947–2958MathSciNetMATH Sakaguchi M, Ohtsubo Y (2010) Optimal threshold probability and expectation in semi-Markov decision processes. Appl Math Comput 216:2947–2958MathSciNetMATH
22.
go back to reference Wu CB, Lin YL (1999) Minimizing risk models in Markov decision processes with policies depending on target values. J Math Anal Appl 231:47–67MathSciNetCrossRef Wu CB, Lin YL (1999) Minimizing risk models in Markov decision processes with policies depending on target values. J Math Anal Appl 231:47–67MathSciNetCrossRef
23.
go back to reference White DJ (1993) Minimizing a threshold probability in discounted Markov decision processes. J Math Anal Appl 173:634–646MathSciNetCrossRef White DJ (1993) Minimizing a threshold probability in discounted Markov decision processes. J Math Anal Appl 173:634–646MathSciNetCrossRef
24.
go back to reference Wei QD, Chen X (2016) Stochastic games for continuous-time jump processes under finite-horizon payoff criterion. Appl Math Optim 74:273–301MathSciNetCrossRef Wei QD, Chen X (2016) Stochastic games for continuous-time jump processes under finite-horizon payoff criterion. Appl Math Optim 74:273–301MathSciNetCrossRef
25.
go back to reference Zhang WZ, Wang BF, Chen DW (2018) Continuous-time constrained stochastic games with average criteria. Oper Res Lett. 46:109–115MathSciNetCrossRef Zhang WZ, Wang BF, Chen DW (2018) Continuous-time constrained stochastic games with average criteria. Oper Res Lett. 46:109–115MathSciNetCrossRef
Metadata
Title
Nonzero-Sum Stochastic Games with Probability Criteria
Authors
Xiangxiang Huang
Xianping Guo
Publication date
07-06-2019
Publisher
Springer US
Published in
Dynamic Games and Applications / Issue 2/2020
Print ISSN: 2153-0785
Electronic ISSN: 2153-0793
DOI
https://doi.org/10.1007/s13235-019-00317-z

Other articles of this Issue 2/2020

Dynamic Games and Applications 2/2020 Go to the issue

Premium Partner