nach oben

Acta Informatica

Erschienen in:

27.04.2016 | Original Article

Reactive synthesis without regret

verfasst von: Paul Hunter, Guillermo A. Pérez, Jean-François Raskin

Erschienen in: Acta Informatica | Ausgabe 1/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Two-player zero-sum games of infinite duration and their quantitative versions are used in verification to model the interaction between a controller (Eve) and its environment (Adam). The question usually addressed is that of the existence (and computability) of a strategy for Eve that can maximize her payoff against any strategy of Adam. In this work, we are interested in strategies of Eve that minimize her regret, i.e. strategies that minimize the difference between her actual payoff and the payoff she could have achieved if she had known the strategy of Adam in advance. We give algorithms to compute the strategies of Eve that ensure minimal regret against an adversary whose choice of strategy is (1) unrestricted, (2) limited to positional strategies, or (3) limited to word strategies, and show that the two last cases have natural modelling applications. These results apply for quantitative games defined with the classical payoff functions \(\mathsf {Inf}\), \(\mathsf {Sup}\), \({\mathsf {LimInf}}\), \(\mathsf {LimSup}\), and mean-payoff. We also show that our notion of regret minimization in which Adam is limited to word strategies generalizes the notion of good for games introduced by Henzinger and Piterman, and is related to the notion of determinization by pruning due to Aminof, Kupferman and Lampert.

Vorheriger Artikel Special issue: Selected papers from the 26th International Conference on Concurrency Theory (CONCUR 2015)

Nächster Artikel Assume-admissible synthesis

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

W.l.o.g. G is assumed to be total: for each \(v \in V\), there exists \(v' \in V\) such that \((v,v') \in E\).

The values of all functions are not infinite, and therefore in \(\mathbb {R}\) since we deal with finite graphs only.

Since \(\delta _i\) is deterministic, we sometimes write \(\delta _i(p,a)\) to denote the unique \(q \in Q_i\) such that \((p,a,q) \in \delta _i\).

The metric used in [1] is the ratio measure.

Aminof, B., Kupferman, O., Lampert, R.: Reasoning about online algorithms with weighted automata. ACM Transactions on Algorithms (2010)

Aminof, B., Rubin, S.: First cycle games. In: SR, pp. 83–90 (2014)

Bell, D.E.: Regret in decision making under uncertainty. Oper. Res. 30(5), 961–981 (1982)CrossRefMATH

Bloem, R., Chatterjee, K., Greimel, K., Henzinger, T.A., Hofferek, G., Jobstmann, B., Könighofer, B., Könighofer, R.: Synthesizing robust systems. Acta Inf. 51(3–4), 193–220 (2014)MathSciNetCrossRefMATH

Boker, U., Henzinger, TA.: Exact and approximate determinization of discounted-sum automata. LMCS 10(1) (2014). doi:10.2168/LMCS-10(1:10)2014

Brim, L., Chaloupka, J., Doyen, L., Gentilini, R., Raskin, J.-F.: Faster algorithms for mean-payoff games. Form. Methods Syst. Des. 38(2), 97–118 (2011)CrossRefMATH

Chakrabarti, A., de Alfaro, L., Henzinger, T.A., Stoelinga, M.: Resource interfaces. In: EMSOFT, volume 2855 of LNCS, pp. 117–133. Springer (2003)

Chatterjee, K., Doyen, L., Filiot, E., Raskin, JF.: Doomsday equilibria for omega-regular games. In: VMCAI, vol. 8318, pp. 78–97. Springer (2014)

Chatterjee, K., Doyen, L., Henzinger, TA.: Quantitative languages. ACM Transactions on Computational Logic 11(4), 1–38 (2010)

10.

Chatterjee, K., Doyen, L., Henzinger, T.A., Raskin, J.-F.: Generalized mean-payoff and energy games. In: FSTTCS, pp. 505–516 (2010)

11.

Damm, W., Finkbeiner, B.: Does it pay to extend the perimeter of a world model? In: FM, volume 6664 of LNCS, pp. 12–26. Springer (2011)

12.

Degorre, A., Doyen, L., Gentilini, R., Raskin, J.-F., Toruńczyk, S.: Energy and mean-payoff games with imperfect information. In: CSL, pp. 260–274 (2010)

13.

Dziembowski, S., Jurdziński, M., Walukiewicz, I.: How much memory is needed to win infinite games? In: LICS of IEEE computer society, pp. 99–110 (1997)

14.

Ehrenfeucht, A., Mycielski, J.: Positional strategies for mean payoff games. Int. J. Game Theory 8, 109–113 (1979)MathSciNetCrossRefMATH

15.

Eilam-Tzoreff, T.: The disjoint shortest paths problem. Discrete Appl. Math. 85(2), 113–138 (1998)MathSciNetCrossRefMATH

16.

Filiot, E., Le Gall, T., Raskin, J.-F.: Iterated regret minimization in game graphs. In: MFCS, volume 6281 of LNCS, pp. 342–354. Springer (2010)

17.

Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman and Company, New York (1979)MATH

18.

Halpern, J.Y., Pass, R.: Iterated regret minimization: a new solution concept. Games Econ. Behav. 74(1), 184–207 (2012)MathSciNetCrossRefMATH

19.

Henzinger, T.A., Piterman, N.: Solving games without determinization. In CSL, pp. 395–410 (2006)

20.

Hunter, P., Pérez G.A., Raskin, J.-F.: Mean-payoff games with partial-observation-(extended abstract). In: Reachability Problems, pp. 163–175 (2014)

21.

Jurdziński, M.: Deciding the winner in parity games is in \({\sf UP} \cup {\sf coUP}\). IPL 68(3), 119–124 (1998)CrossRefMATH

22.

Jurdzinski, M., Sproston, J., Laroussinie, F.: Model checking probabilistic timed automata with one or two clocks. LMCS 4(3) (2008). doi:10.2168/LMCS-4(3:12)2008

23.

Papadimitriou, C.H., Yannakakis, M.: Shortest paths without a map. TCS 84(1), 127–150 (1991)MathSciNetCrossRefMATH

24.

Piterman, N.: From nondeterministic Büchi and Streett automata to deterministic parity automata. LMCS 3(3) (2007). doi:10.2168/LMCS-3(3:5)2007

25.

Piterman, N., Pnueli, A.: Faster solutions of Rabin and Streett games. In: LICS, pp. 275–284 (2006)

26.

Pnueli, A., Rosner, R.: On the synthesis of a reactive module. In: POPL, pp. 179–190. ACM Press (1989)

27.

Wen, M., Ehlers, R., Topcu, U.: Correct-by-synthesis reinforcement learning with temporal logic constraints. In: IEEE of IROS, pp. 4983–4990 (2015)

28.

Zinkevich, M., Johanson, M., Bowling, M., Piccione, C.: Regret minimization in games with incomplete information. In: NIPS, pp. 905–912 (2008)

29.

Zwick, U., Paterson, M.: The complexity of mean payoff games on graphs. TCS 158(1), 343–359 (1996)MathSciNetCrossRefMATH

Titel: Reactive synthesis without regret
verfasst von: Paul Hunter
Guillermo A. Pérez
Jean-François Raskin
Publikationsdatum: 27.04.2016
Verlag: Springer Berlin Heidelberg
Erschienen in: Acta Informatica / Ausgabe 1/2017
Print ISSN: 0001-5903
Elektronische ISSN: 1432-0525
DOI: https://doi.org/10.1007/s00236-016-0268-z

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"