Skip to main content
Erschienen in: Acta Informatica 1/2017

27.04.2016 | Original Article

Reactive synthesis without regret

verfasst von: Paul Hunter, Guillermo A. Pérez, Jean-François Raskin

Erschienen in: Acta Informatica | Ausgabe 1/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Two-player zero-sum games of infinite duration and their quantitative versions are used in verification to model the interaction between a controller (Eve) and its environment (Adam). The question usually addressed is that of the existence (and computability) of a strategy for Eve that can maximize her payoff against any strategy of Adam. In this work, we are interested in strategies of Eve that minimize her regret, i.e. strategies that minimize the difference between her actual payoff and the payoff she could have achieved if she had known the strategy of Adam in advance. We give algorithms to compute the strategies of Eve that ensure minimal regret against an adversary whose choice of strategy is (1) unrestricted, (2) limited to positional strategies, or (3) limited to word strategies, and show that the two last cases have natural modelling applications. These results apply for quantitative games defined with the classical payoff functions \(\mathsf {Inf}\), \(\mathsf {Sup}\), \({\mathsf {LimInf}}\), \(\mathsf {LimSup}\), and mean-payoff. We also show that our notion of regret minimization in which Adam is limited to word strategies generalizes the notion of good for games introduced by Henzinger and Piterman, and is related to the notion of determinization by pruning due to Aminof, Kupferman and Lampert.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
W.l.o.g. G is assumed to be total: for each \(v \in V\), there exists \(v' \in V\) such that \((v,v') \in E\).
 
2
The values of all functions are not infinite, and therefore in \(\mathbb {R}\) since we deal with finite graphs only.
 
3
Since \(\delta _i\) is deterministic, we sometimes write \(\delta _i(p,a)\) to denote the unique \(q \in Q_i\) such that \((p,a,q) \in \delta _i\).
 
4
The metric used in [1] is the ratio measure.
 
Literatur
1.
Zurück zum Zitat Aminof, B., Kupferman, O., Lampert, R.: Reasoning about online algorithms with weighted automata. ACM Transactions on Algorithms (2010) Aminof, B., Kupferman, O., Lampert, R.: Reasoning about online algorithms with weighted automata. ACM Transactions on Algorithms (2010)
2.
Zurück zum Zitat Aminof, B., Rubin, S.: First cycle games. In: SR, pp. 83–90 (2014) Aminof, B., Rubin, S.: First cycle games. In: SR, pp. 83–90 (2014)
3.
Zurück zum Zitat Bell, D.E.: Regret in decision making under uncertainty. Oper. Res. 30(5), 961–981 (1982)CrossRefMATH Bell, D.E.: Regret in decision making under uncertainty. Oper. Res. 30(5), 961–981 (1982)CrossRefMATH
4.
Zurück zum Zitat Bloem, R., Chatterjee, K., Greimel, K., Henzinger, T.A., Hofferek, G., Jobstmann, B., Könighofer, B., Könighofer, R.: Synthesizing robust systems. Acta Inf. 51(3–4), 193–220 (2014)MathSciNetCrossRefMATH Bloem, R., Chatterjee, K., Greimel, K., Henzinger, T.A., Hofferek, G., Jobstmann, B., Könighofer, B., Könighofer, R.: Synthesizing robust systems. Acta Inf. 51(3–4), 193–220 (2014)MathSciNetCrossRefMATH
6.
Zurück zum Zitat Brim, L., Chaloupka, J., Doyen, L., Gentilini, R., Raskin, J.-F.: Faster algorithms for mean-payoff games. Form. Methods Syst. Des. 38(2), 97–118 (2011)CrossRefMATH Brim, L., Chaloupka, J., Doyen, L., Gentilini, R., Raskin, J.-F.: Faster algorithms for mean-payoff games. Form. Methods Syst. Des. 38(2), 97–118 (2011)CrossRefMATH
7.
Zurück zum Zitat Chakrabarti, A., de Alfaro, L., Henzinger, T.A., Stoelinga, M.: Resource interfaces. In: EMSOFT, volume 2855 of LNCS, pp. 117–133. Springer (2003) Chakrabarti, A., de Alfaro, L., Henzinger, T.A., Stoelinga, M.: Resource interfaces. In: EMSOFT, volume 2855 of LNCS, pp. 117–133. Springer (2003)
8.
Zurück zum Zitat Chatterjee, K., Doyen, L., Filiot, E., Raskin, JF.: Doomsday equilibria for omega-regular games. In: VMCAI, vol. 8318, pp. 78–97. Springer (2014) Chatterjee, K., Doyen, L., Filiot, E., Raskin, JF.: Doomsday equilibria for omega-regular games. In: VMCAI, vol. 8318, pp. 78–97. Springer (2014)
9.
Zurück zum Zitat Chatterjee, K., Doyen, L., Henzinger, TA.: Quantitative languages. ACM Transactions on Computational Logic 11(4), 1–38 (2010) Chatterjee, K., Doyen, L., Henzinger, TA.: Quantitative languages. ACM Transactions on Computational Logic 11(4), 1–38 (2010)
10.
Zurück zum Zitat Chatterjee, K., Doyen, L., Henzinger, T.A., Raskin, J.-F.: Generalized mean-payoff and energy games. In: FSTTCS, pp. 505–516 (2010) Chatterjee, K., Doyen, L., Henzinger, T.A., Raskin, J.-F.: Generalized mean-payoff and energy games. In: FSTTCS, pp. 505–516 (2010)
11.
Zurück zum Zitat Damm, W., Finkbeiner, B.: Does it pay to extend the perimeter of a world model? In: FM, volume 6664 of LNCS, pp. 12–26. Springer (2011) Damm, W., Finkbeiner, B.: Does it pay to extend the perimeter of a world model? In: FM, volume 6664 of LNCS, pp. 12–26. Springer (2011)
12.
Zurück zum Zitat Degorre, A., Doyen, L., Gentilini, R., Raskin, J.-F., Toruńczyk, S.: Energy and mean-payoff games with imperfect information. In: CSL, pp. 260–274 (2010) Degorre, A., Doyen, L., Gentilini, R., Raskin, J.-F., Toruńczyk, S.: Energy and mean-payoff games with imperfect information. In: CSL, pp. 260–274 (2010)
13.
Zurück zum Zitat Dziembowski, S., Jurdziński, M., Walukiewicz, I.: How much memory is needed to win infinite games? In: LICS of IEEE computer society, pp. 99–110 (1997) Dziembowski, S., Jurdziński, M., Walukiewicz, I.: How much memory is needed to win infinite games? In: LICS of IEEE computer society, pp. 99–110 (1997)
14.
16.
Zurück zum Zitat Filiot, E., Le Gall, T., Raskin, J.-F.: Iterated regret minimization in game graphs. In: MFCS, volume 6281 of LNCS, pp. 342–354. Springer (2010) Filiot, E., Le Gall, T., Raskin, J.-F.: Iterated regret minimization in game graphs. In: MFCS, volume 6281 of LNCS, pp. 342–354. Springer (2010)
17.
Zurück zum Zitat Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman and Company, New York (1979)MATH Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman and Company, New York (1979)MATH
18.
19.
Zurück zum Zitat Henzinger, T.A., Piterman, N.: Solving games without determinization. In CSL, pp. 395–410 (2006) Henzinger, T.A., Piterman, N.: Solving games without determinization. In CSL, pp. 395–410 (2006)
20.
Zurück zum Zitat Hunter, P., Pérez G.A., Raskin, J.-F.: Mean-payoff games with partial-observation-(extended abstract). In: Reachability Problems, pp. 163–175 (2014) Hunter, P., Pérez G.A., Raskin, J.-F.: Mean-payoff games with partial-observation-(extended abstract). In: Reachability Problems, pp. 163–175 (2014)
21.
Zurück zum Zitat Jurdziński, M.: Deciding the winner in parity games is in \({\sf UP} \cup {\sf coUP}\). IPL 68(3), 119–124 (1998)CrossRefMATH Jurdziński, M.: Deciding the winner in parity games is in \({\sf UP} \cup {\sf coUP}\). IPL 68(3), 119–124 (1998)CrossRefMATH
25.
Zurück zum Zitat Piterman, N., Pnueli, A.: Faster solutions of Rabin and Streett games. In: LICS, pp. 275–284 (2006) Piterman, N., Pnueli, A.: Faster solutions of Rabin and Streett games. In: LICS, pp. 275–284 (2006)
26.
Zurück zum Zitat Pnueli, A., Rosner, R.: On the synthesis of a reactive module. In: POPL, pp. 179–190. ACM Press (1989) Pnueli, A., Rosner, R.: On the synthesis of a reactive module. In: POPL, pp. 179–190. ACM Press (1989)
27.
Zurück zum Zitat Wen, M., Ehlers, R., Topcu, U.: Correct-by-synthesis reinforcement learning with temporal logic constraints. In: IEEE of IROS, pp. 4983–4990 (2015) Wen, M., Ehlers, R., Topcu, U.: Correct-by-synthesis reinforcement learning with temporal logic constraints. In: IEEE of IROS, pp. 4983–4990 (2015)
28.
Zurück zum Zitat Zinkevich, M., Johanson, M., Bowling, M., Piccione, C.: Regret minimization in games with incomplete information. In: NIPS, pp. 905–912 (2008) Zinkevich, M., Johanson, M., Bowling, M., Piccione, C.: Regret minimization in games with incomplete information. In: NIPS, pp. 905–912 (2008)
Metadaten
Titel
Reactive synthesis without regret
verfasst von
Paul Hunter
Guillermo A. Pérez
Jean-François Raskin
Publikationsdatum
27.04.2016
Verlag
Springer Berlin Heidelberg
Erschienen in
Acta Informatica / Ausgabe 1/2017
Print ISSN: 0001-5903
Elektronische ISSN: 1432-0525
DOI
https://doi.org/10.1007/s00236-016-0268-z