Abstract
Reinforcement Learning (RL) gained a huge amount of popularity in computer science; applied in fields such as gaming, intelligent robots, remote sensing, and so on. The objective of reinforcement learning is to generate the optimal policy. The main problem of that optimal policy is that it is not fully guaranteed to be satisfied all the system specifications. Model checking is a technique to verify the system to meet the system specifications. PRISM-games is one of the model-checking tools that is used to verify the probabilistic system with competitive or collaborative behavior. Safe Reinforcement Learning via Shielding is a method that uses shield to restrict the action of the RL agent if it violates the specification using temporal logic. This paper presents to compare the winning strategies between three agents; Monte-Carlo Tree Search agent (MCTS), RL agent and shielded RL agent (SRL) which uses PRISM-games to restrict the action based on Tic-Tac-Toe game. Over thousand times of simulations has been made, the experiments show that MCTS agent has the highest win rate compared to other agents, but the losing rate of the shielded agent is reduced by using PRISM-games.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alshiekh, M., Bloem, R., Ehlers, R., Kὄnighofer, B., Niekum, S., Topcu, U.: Safe reinforcement learning via shielding. In: The Thirty-Second AAAI Conference on Artificial Intelligence (2018)
Chen, P., Doan, J., Xu, E.: AI Agents for Ultimate Tic-Tac-Toe, 30 December 2018
Kwiatkowska, M., Norman, G., Parker, D.: PRISM: 4.0: verification of probabilistic real-time systems. In: 23rd International Conference on Computer Aided Verification (2011)
Chen, T., Forejt, V., Kwiakowskam, M., Parker, D., Simaitis, A.: PRISM-games: a model checker for stochastic multiplayer games. In: 19th International Conference on Tools and Algorithms for the Construction and Analysis of Systems (2013)
Ahantab, A., Filip, R.: Formal verification of RL-based approaches
Baier, H., Winands, M.H.M.: Monte-Carlo Tree Search and minimax hybrids
Jamieson, K.: Lecture 19: Monte Carlo Tree Search. CSE599i: Online and Adaptive Machine Learning, Winter (2018)
Mason, G., Calinescu, R., Kudenko, D., Banks, A.: Assured reinforcement learning with formally verified abstract policies. In: 9th International Conference on Agents and Artificial Intelligence (ICAART) (2017)
Mason, G., Calinescu, R., Banks, A.: Assured reinforcement learning for safety-critical applications. In: 10th International Conference on Agents and Artificial Intelligence (ICAART) (2017)
Kwiatkowska, M., Parker, D., Wiltsche, C.: PRISM-games: verification and strategy synthesis for stochastic multi-player games with multiple objectives. Int. J. Softw. Tools Technol. Transf. 20, 195–210 (2018)
Basset, N., Kwiatkowska, M., Wiltsche, C.: Compositional strategy synthesis for stochastic games with multiple objectives
Amrani, M., Lucio, L., Bibal, A.: A survey on the application of machine learning to formal verification
PRISM website. www.prismmodelchecker.org/
PRISM-games website. www.prismmodelchecker.org/games/
Acknowledgments
Foremost, I would like to express my sincere gratitude to my supervisor, Dr. Swe Zin Hlaing for the continuous support of my research, for her patience, motivation, enthusiasm, and immense knowledge. And I would also like to thank all the experts who were involved in developing PRISM and its extension PRISM-Games.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Zaw, H.H., Hlaing, S.Z. (2020). Verifying the Gaming Strategy of Self-learning Game by Using PRISM-Games. In: Vasant, P., Zelinka, I., Weber, GW. (eds) Intelligent Computing and Optimization. ICO 2019. Advances in Intelligent Systems and Computing, vol 1072. Springer, Cham. https://doi.org/10.1007/978-3-030-33585-4_15
Download citation
DOI: https://doi.org/10.1007/978-3-030-33585-4_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33584-7
Online ISBN: 978-3-030-33585-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)