Skip to main content

2018 | OriginalPaper | Buchkapitel

Last-Position Elimination-Based Parallel Learning Automata

verfasst von : Yuyang Huang, Hao Ge, Jinchao Huang, Fanming Wang, Shenghong Li

Erschienen in: Communications, Signal Processing, and Systems

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The updating scheme for Learning Automata (LA) is important. Thathachar and Arvind first proposed parallel operation of leaning automata (LA), which was a promising mechanism that could maintain the accuracy while reducing convergence time. In this paper, we implement this mechanism which helps to break the limit of the convergence speed of single LA. In contrast to existing scheme, the proposed scheme eliminates the worst performed LA in the sequence of interactions, until there is only one LA left. We compare Last-Position Elimination-Based Parallel Learning Automata (LEPLA) scheme with the classic one incorporating two pursuit schemes, discretized generalized pursuit algorithm (DGPA) and discretized pursuit algorithm with reward-inaction (DPRI), respectively. Simulations prove that the proposed scheme gets an evidently higher accuracy and faster convergence than the classic ones.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat M.A. Thathachar, P.S. Sastry, Networks of Learning Automata: Techniques for Online Stochastic Optimization (Springer Science and Business Media, 2011) M.A. Thathachar, P.S. Sastry, Networks of Learning Automata: Techniques for Online Stochastic Optimization (Springer Science and Business Media, 2011)
2.
Zurück zum Zitat B.J. Oommen, M. Agache, Continuous and discretized pursuit learning schemes: various algorithms and their comparison. Syst. Man Cybern. Part B Cybern. 31, 277–287 (2001)CrossRef B.J. Oommen, M. Agache, Continuous and discretized pursuit learning schemes: various algorithms and their comparison. Syst. Man Cybern. Part B Cybern. 31, 277–287 (2001)CrossRef
3.
Zurück zum Zitat O. Tilak, R. Martin, S. Mukhopadhyay, Decentralized indirect methods for learning automata games. 41(10) (2011) O. Tilak, R. Martin, S. Mukhopadhyay, Decentralized indirect methods for learning automata games. 41(10) (2011)
4.
Zurück zum Zitat S. Misra, B.J. Oommen, S. Yanamandra, M.S. Obaidat, Random early detection for congestion avoidance in wired networks: a discretized pursuit learning-automata-like solution. Syst. Man Cybern. Part B Cybern. 66–76 (2010) S. Misra, B.J. Oommen, S. Yanamandra, M.S. Obaidat, Random early detection for congestion avoidance in wired networks: a discretized pursuit learning-automata-like solution. Syst. Man Cybern. Part B Cybern. 66–76 (2010)
5.
Zurück zum Zitat B.J. Oommen, M.K. Hashem, Modeling a student classroom interaction in a tutorial-like system using learning automata. Man Cybern. Part B Cybern. 29–42 (2010) B.J. Oommen, M.K. Hashem, Modeling a student classroom interaction in a tutorial-like system using learning automata. Man Cybern. Part B Cybern. 29–42 (2010)
6.
Zurück zum Zitat M.A.L. Thathachar, P.S. Sastry, Estimator algorithms for learning automata, in Platinum Jubilee Conference, vol. 12 (1986), pp. 29–32 M.A.L. Thathachar, P.S. Sastry, Estimator algorithms for learning automata, in Platinum Jubilee Conference, vol. 12 (1986), pp. 29–32
7.
Zurück zum Zitat H. Ge, Y. Wang, J. Li, W. Jiang, S. Li, A novel parallel framework for pursuit learning. Nucl. Phys. B 1–20 (2016) H. Ge, Y. Wang, J. Li, W. Jiang, S. Li, A novel parallel framework for pursuit learning. Nucl. Phys. B 1–20 (2016)
8.
Zurück zum Zitat J. Zhang, C. Wang, M. Zhou, Last-position elimination-based learning automata. Cybernetics 2484–2492 (2014) J. Zhang, C. Wang, M. Zhou, Last-position elimination-based learning automata. Cybernetics 2484–2492 (2014)
9.
Zurück zum Zitat M. Agache, B.J. Oommen, Generalized pursuit learning schemes: new families of continuous and discretized learning automata. Syst. Man Cybern. Part B Cybern. 738–749 (2002) M. Agache, B.J. Oommen, Generalized pursuit learning schemes: new families of continuous and discretized learning automata. Syst. Man Cybern. Part B Cybern. 738–749 (2002)
10.
Zurück zum Zitat M.A.L. Thathachar, B.J. Oommen, Discretized reward-inaction earning automata. J. Cybern. Inform. Sci. 2, 24–29 (1979) M.A.L. Thathachar, B.J. Oommen, Discretized reward-inaction earning automata. J. Cybern. Inform. Sci. 2, 24–29 (1979)
11.
Zurück zum Zitat B.J. Oommen, J.K. Lanctot, Discretized pursuit learning automata. Syst. Man Cybern. 931–938 (1990) B.J. Oommen, J.K. Lanctot, Discretized pursuit learning automata. Syst. Man Cybern. 931–938 (1990)
12.
Zurück zum Zitat R.S. Sutton, A.G. Barto, Reinforcement Learning: An Introduction R.S. Sutton, A.G. Barto, Reinforcement Learning: An Introduction
Metadaten
Titel
Last-Position Elimination-Based Parallel Learning Automata
verfasst von
Yuyang Huang
Hao Ge
Jinchao Huang
Fanming Wang
Shenghong Li
Copyright-Jahr
2018
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-3229-5_77

Neuer Inhalt