Skip to main content
Top

2018 | OriginalPaper | Chapter

Use of Neural Networks in Q-Learning Algorithm

Authors : Nataliya Boyko, Volodymyr Korkishko, Bohdan Dohnyak, Olena Vovk

Published in: Computer and Information Sciences

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This article is devoted to the algorithm of training with reinforcement (reinforcement learning). This article will cover various modifications of the Q-Learning algorithm, along with its techniques, which can accelerate learning through the use of neural networks. We also talk about different ways of approximating the tables of this algorithm, consider its implementation in the code and analyze its behavior in different environments. We set the optimal parameters for its implementation, and we will evaluate its performance in two parameters: the number of necessary neural network weight corrections and quality of training.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Leskovec J., Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets, p. 470. Cambridge University Press, Massachusetts (2014) Leskovec J., Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets, p. 470. Cambridge University Press, Massachusetts (2014)
2.
go back to reference Mayer-Schoenberger V., Cukier, K.: A Revolution That Will Transform How We Live, Work, and Think, p. 230. Houghton Mifflin Harcourt, Boston, New York (2013) Mayer-Schoenberger V., Cukier, K.: A Revolution That Will Transform How We Live, Work, and Think, p. 230. Houghton Mifflin Harcourt, Boston, New York (2013)
3.
go back to reference Boyko N., Shakhovska N., Sviridova N.: Use of machine learning in the forecast of clinical consequences of cancer diseases. In: 7th Mediterranean Conference on Embedded Computing - MECO 2018, pp. 531–536. IEEE (2018) Boyko N., Shakhovska N., Sviridova N.: Use of machine learning in the forecast of clinical consequences of cancer diseases. In: 7th Mediterranean Conference on Embedded Computing - MECO 2018, pp. 531–536. IEEE (2018)
4.
go back to reference Maass, W., Natschger, T., Markram, H.: Real-time computing without stable states: a new framework for neural computations based on perturbations. In: Neural Computation: Proceedings, Institute for Theoretical Computer Science, Switzerland, vol. 11, pp. 2531–2560 (2002)CrossRef Maass, W., Natschger, T., Markram, H.: Real-time computing without stable states: a new framework for neural computations based on perturbations. In: Neural Computation: Proceedings, Institute for Theoretical Computer Science, Switzerland, vol. 11, pp. 2531–2560 (2002)CrossRef
5.
go back to reference Schrauwen, B., Verstraeten, D., Campenhout, J.V.: An overview of reservoir computing theory, applications and implementations. In: Proceedings of the 15th European Symposium on Artificial Neural Networks, Belgium, Bruges, pp. 471–482 (2007) Schrauwen, B., Verstraeten, D., Campenhout, J.V.: An overview of reservoir computing theory, applications and implementations. In: Proceedings of the 15th European Symposium on Artificial Neural Networks, Belgium, Bruges, pp. 471–482 (2007)
6.
go back to reference Coombes, S.: Waves, bumps, and patterns in neural field theories. Biol. Cybern. 93(2), 91–108 (2005). Proceedings. University of Nottingham, Nottingham Coombes, S.: Waves, bumps, and patterns in neural field theories. Biol. Cybern. 93(2), 91–108 (2005). Proceedings. University of Nottingham, Nottingham
7.
go back to reference Antonopoulos, N., Gillam, L (eds).: Cloud Computing: Principles, Systems and Applications, p. 379. Springer, London (2010)MATH Antonopoulos, N., Gillam, L (eds).: Cloud Computing: Principles, Systems and Applications, p. 379. Springer, London (2010)MATH
8.
go back to reference Gosavi, N., Shinde, S.S., Dhakulkar, B.: Use of cloud computing in library and information science field. Int. J. Digital Library Serv. 2(3), 51–60 (2012) Gosavi, N., Shinde, S.S., Dhakulkar, B.: Use of cloud computing in library and information science field. Int. J. Digital Library Serv. 2(3), 51–60 (2012)
9.
go back to reference Dhamdhere, S.N. (ed.).: Cloud Computing and Virtualization, p. 385 (2013) Dhamdhere, S.N. (ed.).: Cloud Computing and Virtualization, p. 385 (2013)
10.
go back to reference Monirul Islam, M.: Necessity of cloud computing for digital libraries: Bangladesh perspective. In: International Conference on Digital Libraries (ICDL) Vision 2020: Looking Back 10 Years and Forging New Frontiers, pp. 513–524 (2013) Monirul Islam, M.: Necessity of cloud computing for digital libraries: Bangladesh perspective. In: International Conference on Digital Libraries (ICDL) Vision 2020: Looking Back 10 Years and Forging New Frontiers, pp. 513–524 (2013)
11.
go back to reference Mell, P., Grance, T.: The NIST Definition of Cloud Computing: Recommendations of the National Institute of Standards and Technology (2011) Mell, P., Grance, T.: The NIST Definition of Cloud Computing: Recommendations of the National Institute of Standards and Technology (2011)
12.
go back to reference Boyko, N.I.: Perspective technologies of study of large data in distributed information systems. Radioelectronics, Computer Science, Management, vol. 4, pp. 66–77. Zaporizhzhya National Technical University, Zaporozhye (2017) Boyko, N.I.: Perspective technologies of study of large data in distributed information systems. Radioelectronics, Computer Science, Management, vol. 4, pp. 66–77. Zaporizhzhya National Technical University, Zaporozhye (2017)
Metadata
Title
Use of Neural Networks in Q-Learning Algorithm
Authors
Nataliya Boyko
Volodymyr Korkishko
Bohdan Dohnyak
Olena Vovk
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-030-00840-6_21

Premium Partner