Skip to main content
Top
Published in:
Cover of the book

2018 | OriginalPaper | Chapter

Task and Spatial Planning by the Cognitive Agent with Human-Like Knowledge Representation

Authors : Ermek Aitygulov, Gleb Kiselev, Aleksandr I. Panov

Published in: Interactive Collaborative Robotics

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The paper considers the task of simultaneous learning and planning actions for moving a cognitive agent in two-dimensional space. Planning is carried out by an agent who uses an anthropic way of knowledge representation that allows him to build transparent and understood planes, which is especially important in case of human-machine interaction. Learning actions to manipulate objects is carried out through reinforcement learning and demonstrates the possibilities of replenishing the agent’s procedural knowledge. The presented approach was demonstrated in an experiment in the Gazebo simulation environment.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Laird, J.E.: The Soar Cognitive Architecture. MIT Press, Cambridge (2012) Laird, J.E.: The Soar Cognitive Architecture. MIT Press, Cambridge (2012)
2.
go back to reference Sun, R., Hlie, S.: Psychologically realistic cognitive agents: taking human cognition seriously. J. Exp. Theor. Artif. Intell. 25, 65–92 (2012)CrossRef Sun, R., Hlie, S.: Psychologically realistic cognitive agents: taking human cognition seriously. J. Exp. Theor. Artif. Intell. 25, 65–92 (2012)CrossRef
3.
go back to reference Osipov, G.S., Panov, A.I., Chudova, N.V.: Behavior control as a function of consciousness. I. World model and goal setting. J. Comput. Syst. Sci. Int. 53, 517–529 (2014)MathSciNetCrossRef Osipov, G.S., Panov, A.I., Chudova, N.V.: Behavior control as a function of consciousness. I. World model and goal setting. J. Comput. Syst. Sci. Int. 53, 517–529 (2014)MathSciNetCrossRef
4.
go back to reference Osipov, G.S., Panov, A.I., Chudova, N.V.: Behavior control as a function of consciousness. II. Synthesis of a behavior plan. J. Comput. Syst. Sci. Int. 54, 882–896 (2015)MathSciNetCrossRef Osipov, G.S., Panov, A.I., Chudova, N.V.: Behavior control as a function of consciousness. II. Synthesis of a behavior plan. J. Comput. Syst. Sci. Int. 54, 882–896 (2015)MathSciNetCrossRef
5.
go back to reference Panov, A.I.: Behavior planning of intelligent agent with sign world model. Biol. Inspired Cogn. Archit. 19, 21–31 (2017) Panov, A.I.: Behavior planning of intelligent agent with sign world model. Biol. Inspired Cogn. Archit. 19, 21–31 (2017)
6.
go back to reference Leontyev, A.N.: The Development of Mind. Erythros Press and Media, Kettering (2009) Leontyev, A.N.: The Development of Mind. Erythros Press and Media, Kettering (2009)
7.
go back to reference Vygotsky, L.S.: Thought and Language. MIT Press, Cambridge (1986) Vygotsky, L.S.: Thought and Language. MIT Press, Cambridge (1986)
8.
go back to reference Pospelov, D.A., Osipov, G.S.: Knowledge in semiotic models. In: Proceedings of the Second Workshop on Applied Semiotics, Seventh International Conference on Artificial Intelligence and Information-Control Systems of Robots (AIICSR 1997), Bratislava, pp. 1–12 (1997) Pospelov, D.A., Osipov, G.S.: Knowledge in semiotic models. In: Proceedings of the Second Workshop on Applied Semiotics, Seventh International Conference on Artificial Intelligence and Information-Control Systems of Robots (AIICSR 1997), Bratislava, pp. 1–12 (1997)
9.
go back to reference Emelyanov, S., Makarov, D., Panov, A.I., Yakovlev, K.: Multilayer cognitive architecture for UAV control. Cogn. Syst. Res. 39, 58–72 (2016)CrossRef Emelyanov, S., Makarov, D., Panov, A.I., Yakovlev, K.: Multilayer cognitive architecture for UAV control. Cogn. Syst. Res. 39, 58–72 (2016)CrossRef
10.
go back to reference Brooks, R.A.: Intelligence without representation. Artif. Intell. 47, 139–159 (1991)CrossRef Brooks, R.A.: Intelligence without representation. Artif. Intell. 47, 139–159 (1991)CrossRef
11.
go back to reference Siagian, C., Itti, L.: Biologically-inspired robotics vision Monte-Carlo localization in the outdoor environment. In: IEEE International Conference on Intelligent Robots and Systems, pp. 1723–1730 (2007) Siagian, C., Itti, L.: Biologically-inspired robotics vision Monte-Carlo localization in the outdoor environment. In: IEEE International Conference on Intelligent Robots and Systems, pp. 1723–1730 (2007)
12.
go back to reference Schulman, J., Levine, S., Moritz, P., Jordan, M., Abbeel, P.: Trust region policy optimization (2015) Schulman, J., Levine, S., Moritz, P., Jordan, M., Abbeel, P.: Trust region policy optimization (2015)
13.
go back to reference Kakade, S.: A natural policy gradient (2002) Kakade, S.: A natural policy gradient (2002)
14.
go back to reference Daniel, K., Nash, A., Koenig, S., Felner, A.: Theta*: any-angle path planning on grids. J. Artif. Intell. Res. 39, 533–579 (2010)MathSciNetCrossRef Daniel, K., Nash, A., Koenig, S., Felner, A.: Theta*: any-angle path planning on grids. J. Artif. Intell. Res. 39, 533–579 (2010)MathSciNetCrossRef
15.
go back to reference Palacios, J.C., Olayo, M.G., Cruz, G.J., Chvez, J.A.: Thin film composites of polyallylamine-silver. Superficies y Vacio (2012) Palacios, J.C., Olayo, M.G., Cruz, G.J., Chvez, J.A.: Thin film composites of polyallylamine-silver. Superficies y Vacio (2012)
16.
go back to reference Erdem, U.M., Hasselmo, M.E.: A biologically inspired hierarchical goal directed navigation model. J. Physiol. Paris 108(1), 28–37 (2014)CrossRef Erdem, U.M., Hasselmo, M.E.: A biologically inspired hierarchical goal directed navigation model. J. Physiol. Paris 108(1), 28–37 (2014)CrossRef
17.
go back to reference Morris, R.G.M., Garrud, P., Rawlins, J.N.P., O’Keefe, J.: Place navigation impaired in rats with hippocampal lesions. Nature 297(5868), 681–683 (1982)CrossRef Morris, R.G.M., Garrud, P., Rawlins, J.N.P., O’Keefe, J.: Place navigation impaired in rats with hippocampal lesions. Nature 297(5868), 681–683 (1982)CrossRef
18.
go back to reference Steele, R.J., Morris, R.G.M.: Delay-dependent impairment of a matching-to- place task with chronic and intrahippocampal infusion of the NMDA-antagonist D-AP5. Hippocampus 9(2), 118–136 (1999)CrossRef Steele, R.J., Morris, R.G.M.: Delay-dependent impairment of a matching-to- place task with chronic and intrahippocampal infusion of the NMDA-antagonist D-AP5. Hippocampus 9(2), 118–136 (1999)CrossRef
19.
go back to reference Steffenach, H.-A., Witter, M., Moser, M.-B., Moser, E.I.: Spatial memory in the rat requires the dorsolateral band of the entorhinal cortex. Neuron 45(2), 301–313 (2005)CrossRef Steffenach, H.-A., Witter, M., Moser, M.-B., Moser, E.I.: Spatial memory in the rat requires the dorsolateral band of the entorhinal cortex. Neuron 45(2), 301–313 (2005)CrossRef
20.
go back to reference Milford, M., Wyeth, G.: Persistent navigation and mapping using a biologically inspired slam system. Int. J. Robot. Res. 29(9), 1131–1153 (2010)CrossRef Milford, M., Wyeth, G.: Persistent navigation and mapping using a biologically inspired slam system. Int. J. Robot. Res. 29(9), 1131–1153 (2010)CrossRef
21.
go back to reference Milford, M., Schulz, R.: Principles of goal-directed spatial robot navigation in biomimetic models. Philos. Trans. R. Soc. B Biol. Sci. 369(1655), 20130484–20130484 (2014)CrossRef Milford, M., Schulz, R.: Principles of goal-directed spatial robot navigation in biomimetic models. Philos. Trans. R. Soc. B Biol. Sci. 369(1655), 20130484–20130484 (2014)CrossRef
22.
go back to reference Epstein, S.L., Aroor, A., Sklar, E.I., Parsons, S.: Navigation with learned spatial affordances, pp. 1–6 (2013) Epstein, S.L., Aroor, A., Sklar, E.I., Parsons, S.: Navigation with learned spatial affordances, pp. 1–6 (2013)
23.
go back to reference Epstein, S.L., Aroor, A., Evanusa, M., Sklar, E.I., Parsons, S.: Spatial abstraction for autonomous robot navigation. Cogn. Process. 16, 215–219 (2015)CrossRef Epstein, S.L., Aroor, A., Evanusa, M., Sklar, E.I., Parsons, S.: Spatial abstraction for autonomous robot navigation. Cogn. Process. 16, 215–219 (2015)CrossRef
24.
go back to reference Kiselev, G.A., Panov, A.I.: Sign-based approach to the task of role distribution in the coalition of cognitive agents. SPIIRAS Proc. 57, 161–187 (2018)CrossRef Kiselev, G.A., Panov, A.I.: Sign-based approach to the task of role distribution in the coalition of cognitive agents. SPIIRAS Proc. 57, 161–187 (2018)CrossRef
25.
go back to reference Albers, A., Yan, W., Frietsch, M.: Application of reinforcement learning for a 2-DOF robot arm control, November 2009 Albers, A., Yan, W., Frietsch, M.: Application of reinforcement learning for a 2-DOF robot arm control, November 2009
26.
go back to reference Stephen, J., Edward, J.: 3D simulation for robot arm control with deep Q-learning (2016) Stephen, J., Edward, J.: 3D simulation for robot arm control with deep Q-learning (2016)
27.
go back to reference Watkins, C.J.C.H.: Learning from delayed rewards (1989) Watkins, C.J.C.H.: Learning from delayed rewards (1989)
28.
go back to reference Gu, S., Holly, E., Lillicrap, T., Levine, S.: Deep reinforcement learning for robotic manipulation with asynchronous off-policy update (2016) Gu, S., Holly, E., Lillicrap, T., Levine, S.: Deep reinforcement learning for robotic manipulation with asynchronous off-policy update (2016)
29.
go back to reference Sutton, R.S., McAllester, D., Singh, S., Mansour, Y.: Policy gradient methods for reinforcement learning with function approximation (1999) Sutton, R.S., McAllester, D., Singh, S., Mansour, Y.: Policy gradient methods for reinforcement learning with function approximation (1999)
30.
go back to reference Osipov, G.S.: Sign-based representation and word model of actor. In: Yager, R., Sgurev, V., Hadjiski, M., and Jotsov, V. (eds.) 2016 IEEE 8th International Conference on Intelligent Systems (IS), pp. 22–26. IEEE (2016) Osipov, G.S.: Sign-based representation and word model of actor. In: Yager, R., Sgurev, V., Hadjiski, M., and Jotsov, V. (eds.) 2016 IEEE 8th International Conference on Intelligent Systems (IS), pp. 22–26. IEEE (2016)
Metadata
Title
Task and Spatial Planning by the Cognitive Agent with Human-Like Knowledge Representation
Authors
Ermek Aitygulov
Gleb Kiselev
Aleksandr I. Panov
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-99582-3_1

Premium Partner