Skip to main content

2015 | OriginalPaper | Buchkapitel

19. Implementation and Performance Evaluation of the Fully Enclosed Region Upper Confidence Bound Applied to Trees Algorithm

verfasst von : Lin Wu, Ying Li, Chao Deng, Lei Chen, Meiyu Yuan, Hong Jiang

Erschienen in: Proceedings of the 4th International Conference on Computer Engineering and Networks

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

While the playing performance of UCT (upper confidence bound applied to trees) algorithm is nearly the same as that of top professionals on a 9 × 9 Go board, its performance on a 19 × 19 board still needs great improvements. One possible way is to use multiple local UCT searches in parallel by the same computing resources to reach deeper depth. This paper tries to do some tentative work in this regard. After modifying Fuego’s implementation of global UCT search algorithm, we have implemented a fully enclosed region UCT local search algorithm and tested it by 64 standard tsume go problems. Present results show that the fully enclosed region UCT can reduce original branching factors from 5–16 to 2.3 without using any domain knowledge on Go. According to the accuracy of testing results and computing speed, it is promising to do further research on running multiple local UCT searches in parallel.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
3.
Zurück zum Zitat David S, Gerald T. Monte-Carlo Simulation Balancing[C]. Proceedings of the 26th International Conference on Machine Learning. Omni press, Montreal, Canada, 945–952 (2009) David S, Gerald T. Monte-Carlo Simulation Balancing[C]. Proceedings of the 26th International Conference on Machine Learning. Omni press, Montreal, Canada, 945–952 (2009)
4.
Zurück zum Zitat Arpad Rimmel, Fabien Teytaud and Olivier Teytaud. Biasing Monte-Carlo Simulations through RAVE Values[C]. Proceeding CG'10 Proceedings of the 7th international conference on Computers and games. Berlin, Germany, Springer, 59–68(2011) Arpad Rimmel, Fabien Teytaud and Olivier Teytaud. Biasing Monte-Carlo Simulations through RAVE Values[C]. Proceeding CG'10 Proceedings of the 7th international conference on Computers and games. Berlin, Germany, Springer, 59–68(2011)
5.
Zurück zum Zitat Wolf T. Forward pruning and other heuristic search techniques in tsume go. Special Issue Informat Sci. 2000;122:59–76.CrossRef Wolf T. Forward pruning and other heuristic search techniques in tsume go. Special Issue Informat Sci. 2000;122:59–76.CrossRef
6.
Zurück zum Zitat Li L. Solving life and death problems in Go. Comput Appl. 2005;11:2720–2 (in Chinese). Li L. Solving life and death problems in Go. Comput Appl. 2005;11:2720–2 (in Chinese).
7.
Zurück zum Zitat Lin W, Pierre B. A Scalable Machine Learning Approach to Go [C] Advances in Neural Information Processing Systems 19. British Columbia: MIT Press; 2007. p. 1521–8. Lin W, Pierre B. A Scalable Machine Learning Approach to Go [C] Advances in Neural Information Processing Systems 19. British Columbia: MIT Press; 2007. p. 1521–8.
8.
Zurück zum Zitat Lin W, Baldi P. Learning to play Go using recursive neural networks. Neural Netw. 2008;21:1392–400.CrossRef Lin W, Baldi P. Learning to play Go using recursive neural networks. Neural Netw. 2008;21:1392–400.CrossRef
9.
Zurück zum Zitat Auer P, Cesa-Bianchi N, Fischer P. Finite-time analysis of the multi-armed bandit problem. Mach Learn. 2002;47:235–56.CrossRefMATH Auer P, Cesa-Bianchi N, Fischer P. Finite-time analysis of the multi-armed bandit problem. Mach Learn. 2002;47:235–56.CrossRefMATH
10.
Zurück zum Zitat Gelley S, Wang Y. Exploration Exploitation in Go: UCT for Monte-Carlo Go[C]. In: Twentieth Annual Conference on Neural Information Processing Systems (NIPS 2006): On-line Trading of Exploration and Exploitation, British Columbia: MIT Press, (2006). p. 1121–1128 Gelley S, Wang Y. Exploration Exploitation in Go: UCT for Monte-Carlo Go[C]. In: Twentieth Annual Conference on Neural Information Processing Systems (NIPS 2006): On-line Trading of Exploration and Exploitation, British Columbia: MIT Press, (2006). p. 1121–1128
11.
Zurück zum Zitat Coulom R. Efficient selectivity and backup operators in Monte-Carlo tree Search[C]. 5th International conference, CG2006, Springer, p. 72–83(2007) Coulom R. Efficient selectivity and backup operators in Monte-Carlo tree Search[C]. 5th International conference, CG2006, Springer, p. 72–83(2007)
Metadaten
Titel
Implementation and Performance Evaluation of the Fully Enclosed Region Upper Confidence Bound Applied to Trees Algorithm
verfasst von
Lin Wu
Ying Li
Chao Deng
Lei Chen
Meiyu Yuan
Hong Jiang
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-11104-9_19

Neuer Inhalt