Skip to main content
Top

2021 | OriginalPaper | Chapter

Stabilized Nested Rollout Policy Adaptation

Authors : Tristan Cazenave, Jean-Baptiste Sevestre, Matthieu Toulemont

Published in: Monte Carlo Search

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games. In this paper we propose to modify NRPA in order to improve the stability of the algorithm. Experiments show it improves the algorithm for different application domains: SameGame, Traveling Salesman with Time Windows and Expression Discovery.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bouzy, B.: Monte-Carlo fork search for cooperative path-finding. In: Computer Games - Workshop on Computer Games, CGW 2013, Held in Conjunction with the 23rd International Conference on Artificial Intelligence, IJCAI 2013, Beijing, China, 3 August 2013, Revised Selected Papers, pp. 1–15 (2013) Bouzy, B.: Monte-Carlo fork search for cooperative path-finding. In: Computer Games - Workshop on Computer Games, CGW 2013, Held in Conjunction with the 23rd International Conference on Artificial Intelligence, IJCAI 2013, Beijing, China, 3 August 2013, Revised Selected Papers, pp. 1–15 (2013)
2.
go back to reference Bouzy, B.: Burnt pancake problem: new lower bounds on the diameter and new experimental optimality ratios. In: Proceedings of the Ninth Annual Symposium on Combinatorial Search, SOCS 2016, Tarrytown, NY, USA, 6–8 July 2016, pp. 119–120 (2016) Bouzy, B.: Burnt pancake problem: new lower bounds on the diameter and new experimental optimality ratios. In: Proceedings of the Ninth Annual Symposium on Combinatorial Search, SOCS 2016, Tarrytown, NY, USA, 6–8 July 2016, pp. 119–120 (2016)
4.
go back to reference Cazenave, T.: Nested Monte-Carlo search. In: Boutilier, C. (ed.) IJCAI, pp. 456–461 (2009) Cazenave, T.: Nested Monte-Carlo search. In: Boutilier, C. (ed.) IJCAI, pp. 456–461 (2009)
7.
go back to reference Cazenave, T.: Nested rollout policy adaptation with selective policies. In: CGW at IJCAI 2016 (2016) Cazenave, T.: Nested rollout policy adaptation with selective policies. In: CGW at IJCAI 2016 (2016)
8.
go back to reference Cazenave, T., Hamida, S.B.: Forecasting financial volatility using nested Monte Carlo expression discovery. In: IEEE Symposium Series on Computational Intelligence, SSCI 2015, Cape Town, South Africa, 7–10 December 2015, pp. 726–733 (2015). https://doi.org/10.1109/SSCI.2015.110 Cazenave, T., Hamida, S.B.: Forecasting financial volatility using nested Monte Carlo expression discovery. In: IEEE Symposium Series on Computational Intelligence, SSCI 2015, Cape Town, South Africa, 7–10 December 2015, pp. 726–733 (2015). https://​doi.​org/​10.​1109/​SSCI.​2015.​110
12.
go back to reference Edelkamp, S., Gath, M., Cazenave, T., Teytaud, F.: Algorithm and knowledge engineering for the TSPTW problem. In: 2013 IEEE Symposium on Computational Intelligence in Scheduling (SCIS), pp. 44–51. IEEE (2013) Edelkamp, S., Gath, M., Cazenave, T., Teytaud, F.: Algorithm and knowledge engineering for the TSPTW problem. In: 2013 IEEE Symposium on Computational Intelligence in Scheduling (SCIS), pp. 44–51. IEEE (2013)
15.
go back to reference Edelkamp, S., Greulich, C.: Solving physical traveling salesman problems with policy adaptation. In: 2014 IEEE Conference on Computational Intelligence and Games (CIG), pp. 1–8. IEEE (2014) Edelkamp, S., Greulich, C.: Solving physical traveling salesman problems with policy adaptation. In: 2014 IEEE Conference on Computational Intelligence and Games (CIG), pp. 1–8. IEEE (2014)
16.
go back to reference Edelkamp, S., Tang, Z.: Monte-Carlo tree search for the multiple sequence alignment problem. In: Eighth Annual Symposium on Combinatorial Search (2015) Edelkamp, S., Tang, Z.: Monte-Carlo tree search for the multiple sequence alignment problem. In: Eighth Annual Symposium on Combinatorial Search (2015)
18.
go back to reference Koza, J.R., et al.: Genetic programming II, vol. 17. MIT Press, Cambridge (1994) Koza, J.R., et al.: Genetic programming II, vol. 17. MIT Press, Cambridge (1994)
19.
go back to reference Langdon, W.B., Poli, R., et al.: An analysis of the max problem in genetic programming. Genet. Program. 1(997), 222–230 (1997) Langdon, W.B., Poli, R., et al.: An analysis of the max problem in genetic programming. Genet. Program. 1(997), 222–230 (1997)
20.
go back to reference Méhat, J., Cazenave, T.: Combining UCT and nested Monte Carlo search for single-player general game playing. IEEE Trans. Comput. Intell. AI Games 2(4), 271–277 (2010)CrossRef Méhat, J., Cazenave, T.: Combining UCT and nested Monte Carlo search for single-player general game playing. IEEE Trans. Comput. Intell. AI Games 2(4), 271–277 (2010)CrossRef
21.
go back to reference Nagorko, A.: Parallel nested rollout policy adaptation. In: IEEE Conference on Games (2019) Nagorko, A.: Parallel nested rollout policy adaptation. In: IEEE Conference on Games (2019)
22.
go back to reference Négrevergne, B., Cazenave, T.: Distributed nested rollout policy for samegame. In: Computer Games - 6th Workshop, CGW 2017, Held in Conjunction with the 26th International Conference on Artificial Intelligence, IJCAI 2017, Melbourne, VIC, Australia, 20 August 2017, Revised Selected Papers, pp. 108–120 (2017). https://doi.org/10.1007/978-3-319-75931-9_8 Négrevergne, B., Cazenave, T.: Distributed nested rollout policy for samegame. In: Computer Games - 6th Workshop, CGW 2017, Held in Conjunction with the 26th International Conference on Artificial Intelligence, IJCAI 2017, Melbourne, VIC, Australia, 20 August 2017, Revised Selected Papers, pp. 108–120 (2017). https://​doi.​org/​10.​1007/​978-3-319-75931-9_​8
23.
go back to reference Portela, F.: An unexpectedly effective Monte Carlo technique for the RNA inverse folding problem. bioRxiv, p. 345587 (2018) Portela, F.: An unexpectedly effective Monte Carlo technique for the RNA inverse folding problem. bioRxiv, p. 345587 (2018)
24.
go back to reference Potvin, J.Y., Bengio, S.: The vehicle routing problem with time windows part II: genetic search. INFORMS J. Comput. 8(2), 165–172 (1996)CrossRef Potvin, J.Y., Bengio, S.: The vehicle routing problem with time windows part II: genetic search. INFORMS J. Comput. 8(2), 165–172 (1996)CrossRef
25.
go back to reference Poulding, S.M., Feldt, R.: Generating structured test data with specific properties using nested Monte-Carlo search. In: Genetic and Evolutionary Computation Conference, GECCO 2014, Vancouver, BC, Canada, 12–16 July 2014, pp. 1279–1286 (2014) Poulding, S.M., Feldt, R.: Generating structured test data with specific properties using nested Monte-Carlo search. In: Genetic and Evolutionary Computation Conference, GECCO 2014, Vancouver, BC, Canada, 12–16 July 2014, pp. 1279–1286 (2014)
26.
go back to reference Poulding, S.M., Feldt, R.: Heuristic model checking using a Monte-Carlo tree search algorithm. In: Proceedings of the Genetic and Evolutionary Computation Conference, GECCO 2015, Madrid, Spain, 11–15 July 2015, pp. 1359–1366 (2015) Poulding, S.M., Feldt, R.: Heuristic model checking using a Monte-Carlo tree search algorithm. In: Proceedings of the Genetic and Evolutionary Computation Conference, GECCO 2015, Madrid, Spain, 11–15 July 2015, pp. 1359–1366 (2015)
28.
go back to reference Rosin, C.D.: Nested rollout policy adaptation for Monte Carlo tree search. In: IJCAI, pp. 649–654 (2011) Rosin, C.D.: Nested rollout policy adaptation for Monte Carlo tree search. In: IJCAI, pp. 649–654 (2011)
Metadata
Title
Stabilized Nested Rollout Policy Adaptation
Authors
Tristan Cazenave
Jean-Baptiste Sevestre
Matthieu Toulemont
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-89453-5_2

Premium Partner