Skip to main content

2018 | OriginalPaper | Buchkapitel

A Re-description Based Developmental Approach to the Generation of Value Functions for Cognitive Robots

verfasst von : A. Romero, F. Bellas, A. Prieto, R. J. Duro

Erschienen in: Hybrid Artificial Intelligent Systems

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Motivation is a fundamental topic when implementing cognitive architectures aimed at lifelong open-ended learning in autonomous robots. In particular, it is of paramount importance for these types of architectures to be able to establish goals that provide purpose to the robot’s interaction with the world as well as to progressively learn value functions within its state space that allow reaching those goals whatever the starting point. This paper aims at exploring a developmental approach to the generation of high level neural network based value functions in complex continuous state spaces through a re-description process. This process starts by obtaining relatively simple Separable Utility Regions (SURs) which allow the system to consistently achieve goals, although not necessarily in the most efficient manner. The traces obtained by these SURs are then used to provide training data for a neural network based value function. Through a simple experiment with the Robobo robot, we show that this procedure can be more generalizable than attempting to directly obtain the value function through more traditional means.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Sweller, J.: Evolution of human cognitive architecture. In: The Psychology of Learning and Motivation (2003) Sweller, J.: Evolution of human cognitive architecture. In: The Psychology of Learning and Motivation (2003)
2.
Zurück zum Zitat Scott, P.D., Markovitch, S.: Learning novel domains through curiosity and conjecture. In: IJCAI, pp. 669–674 (1989) Scott, P.D., Markovitch, S.: Learning novel domains through curiosity and conjecture. In: IJCAI, pp. 669–674 (1989)
3.
Zurück zum Zitat Asada, M., et al.: Cognitive developmental robotics: a survey. IEEE Trans. Auton. Ment. Dev. 1(1), 12–34 (2009)CrossRef Asada, M., et al.: Cognitive developmental robotics: a survey. IEEE Trans. Auton. Ment. Dev. 1(1), 12–34 (2009)CrossRef
4.
Zurück zum Zitat Karmiloff-Smith, A.: Beyond modularity: a developmental perspective on cognitive science. Behav. Brain Sci. (1992) Karmiloff-Smith, A.: Beyond modularity: a developmental perspective on cognitive science. Behav. Brain Sci. (1992)
5.
Zurück zum Zitat Carassa, A., Tirassa, M.: Representational redescription and cognitive architectures, pp. 711–712 (1994)CrossRef Carassa, A., Tirassa, M.: Representational redescription and cognitive architectures, pp. 711–712 (1994)CrossRef
6.
Zurück zum Zitat Maslow, A.H.: A theory of human motivation. Psychol. Rev. 50(13), 370–396 (1943)CrossRef Maslow, A.H.: A theory of human motivation. Psychol. Rev. 50(13), 370–396 (1943)CrossRef
7.
Zurück zum Zitat Maslow, A.H.: Human motivation. Hum. Motiv. (1987) Maslow, A.H.: Human motivation. Hum. Motiv. (1987)
8.
Zurück zum Zitat Hull, C.L.: Principles of Behavioir. Appleton-Century-Crofts, New York (1943) Hull, C.L.: Principles of Behavioir. Appleton-Century-Crofts, New York (1943)
9.
Zurück zum Zitat Kober, J., Bagnell, J.A., Peters, J.: Reinforcement learning in robotics: a survey. Int. J. Robot. Res. 32(11), 1238–1274 (2013)CrossRef Kober, J., Bagnell, J.A., Peters, J.: Reinforcement learning in robotics: a survey. Int. J. Robot. Res. 32(11), 1238–1274 (2013)CrossRef
10.
Zurück zum Zitat Sutton, R.S., Barto, A.G.: Introduction to reinforcement learning. Learning 4(1996), 1–5 (1998) Sutton, R.S., Barto, A.G.: Introduction to reinforcement learning. Learning 4(1996), 1–5 (1998)
11.
Zurück zum Zitat Harlow, H.F.: Learning and satiation of response in intrinsically motivated complex puzzle performance by monkeys. J. Comp. Physiol. Psychol. 43(4), 289–294 (1950)CrossRef Harlow, H.F.: Learning and satiation of response in intrinsically motivated complex puzzle performance by monkeys. J. Comp. Physiol. Psychol. 43(4), 289–294 (1950)CrossRef
12.
Zurück zum Zitat Ryan, R., Deci, E.: Intrinsic and extrinsic motivations: classic definitions and new directions. Contemp. Educ. Psychol. 25(1), 54–67 (2000)CrossRef Ryan, R., Deci, E.: Intrinsic and extrinsic motivations: classic definitions and new directions. Contemp. Educ. Psychol. 25(1), 54–67 (2000)CrossRef
13.
Zurück zum Zitat Kagan, J.: Motives and development. J. Pers. Soc. Psychol. 22(1), 51 (1972)CrossRef Kagan, J.: Motives and development. J. Pers. Soc. Psychol. 22(1), 51 (1972)CrossRef
14.
Zurück zum Zitat Oudeyer, P.-Y.: Intelligent adaptive curiosity: a source of self-development. Science 80 (2004) Oudeyer, P.-Y.: Intelligent adaptive curiosity: a source of self-development. Science 80 (2004)
16.
Zurück zum Zitat Friston, K.J., Tononi, G., Reeke, G.N., Sporns, O., Edelman, G.M.: Value-dependent selection in the brain: simulation in a synthetic neural model. Neuroscience 59, 229–243 (1994)CrossRef Friston, K.J., Tononi, G., Reeke, G.N., Sporns, O., Edelman, G.M.: Value-dependent selection in the brain: simulation in a synthetic neural model. Neuroscience 59, 229–243 (1994)CrossRef
17.
Zurück zum Zitat Rolf, M., Asada, M.: What are goals? And if so, how many? pp. 332–339 (2015) Rolf, M., Asada, M.: What are goals? And if so, how many? pp. 332–339 (2015)
18.
Zurück zum Zitat Baldassarre, G., Stafford, T., Mirolli, M., Redgrave, P., Ryan, R., Barto, A.: Intrinsic motivations and open-ended development in animals, humans, and robots: an overview. Front. Psychol. 5, 1–8 (2014)CrossRef Baldassarre, G., Stafford, T., Mirolli, M., Redgrave, P., Ryan, R., Barto, A.: Intrinsic motivations and open-ended development in animals, humans, and robots: an overview. Front. Psychol. 5, 1–8 (2014)CrossRef
19.
Zurück zum Zitat Huang, X., Weng, J.: Value system development for a robot. In: Proceedings of the IEEE International Conference on Neural Networks, vol. 4, pp. 2883–2888 (2004) Huang, X., Weng, J.: Value system development for a robot. In: Proceedings of the IEEE International Conference on Neural Networks, vol. 4, pp. 2883–2888 (2004)
20.
Zurück zum Zitat Huang, X., Weng, J.: Novelty and reinforcement learning in the value system of developmental robots. In: Proceedings of the Second International Workshop on Epigenetic Robotics, pp. 47–55 (2002) Huang, X., Weng, J.: Novelty and reinforcement learning in the value system of developmental robots. In: Proceedings of the Second International Workshop on Epigenetic Robotics, pp. 47–55 (2002)
21.
Zurück zum Zitat Zhang, Y., Weng, J.: Action chaining by a developmental robot with a value system. In: Proceedings of the 2nd International Conference on Development and Learning, ICDL 2002 (2002) Zhang, Y., Weng, J.: Action chaining by a developmental robot with a value system. In: Proceedings of the 2nd International Conference on Development and Learning, ICDL 2002 (2002)
22.
Zurück zum Zitat Salgado, R., Prieto, A., Bellas, F., Duro, R.J.: Motivational engine for cognitive robotics in non-static tasks. In: Ferrández Vicente, J.M., Álvarez-Sánchez, J.R., de la Paz López, F., Toledo Moreo, J., Adeli, H. (eds.) IWINAC 2017. LNCS, vol. 10337, pp. 32–42. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59740-9_4CrossRef Salgado, R., Prieto, A., Bellas, F., Duro, R.J.: Motivational engine for cognitive robotics in non-static tasks. In: Ferrández Vicente, J.M., Álvarez-Sánchez, J.R., de la Paz López, F., Toledo Moreo, J., Adeli, H. (eds.) IWINAC 2017. LNCS, vol. 10337, pp. 32–42. Springer, Cham (2017). https://​doi.​org/​10.​1007/​978-3-319-59740-9_​4CrossRef
23.
Zurück zum Zitat Salgado, R., Prieto, A., Bellas, F., Calvo-Varela, L., Duro, R.J.: Motivational engine with autonomous sub-goal identification for the Multilevel Darwinist Brain. Biol. Inspir. Cogn. Archit. 17, 1–11 (2016) Salgado, R., Prieto, A., Bellas, F., Calvo-Varela, L., Duro, R.J.: Motivational engine with autonomous sub-goal identification for the Multilevel Darwinist Brain. Biol. Inspir. Cogn. Archit. 17, 1–11 (2016)
24.
Zurück zum Zitat Salgado, R., Prieto, A., Bellas, F., Calvo-Varela, L., Duro, R.J.: Neuroevolutionary motivational engine for autonomous robots. In: GECCO 2016 Companion - Proceedings of the 2016 Genetic and Evolutionary Computation Conference (2016) Salgado, R., Prieto, A., Bellas, F., Calvo-Varela, L., Duro, R.J.: Neuroevolutionary motivational engine for autonomous robots. In: GECCO 2016 Companion - Proceedings of the 2016 Genetic and Evolutionary Computation Conference (2016)
Metadaten
Titel
A Re-description Based Developmental Approach to the Generation of Value Functions for Cognitive Robots
verfasst von
A. Romero
F. Bellas
A. Prieto
R. J. Duro
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-92639-1_56

Premium Partner