nach oben

Erschienen in:

2018 | OriginalPaper | Buchkapitel

A Phenomenologically Justifiable Simulation of Mental Modeling

verfasst von : Mark Wernsdorfer

Erschienen in: Artificial General Intelligence

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Real-world agents need to learn how to react to their environment. To achieve this, it is crucial that they have a model of this environment that is adapted during interaction and although important aspects may be hidden. This paper presents a new type of model for partially observable environments that enables an agent to represent hidden states but can still be generated and queried in realtime. Agents can use such a model to predict the outcomes of their actions and to infer action policies. These policies turn out to be better than the optimal policy in a partially observable Markov decision process as it can be inferred, for example, by Q- or Sarsa-learning. The structure and generation of these models are motivated both by phenomenological considerations from semiotics and the philosophy of mind. The performance of these models is compared to a baseline of Markov models for prediction and interaction in partially observable environments.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Perception from an AGI Perspective

Nächstes Kapitel A Time-Critical Simulation of Language Comprehension

In the last one, the authors explicitly state that “[t]he agent rarely observes the exact same frame from a previous episode” which makes the environment according to a state based conception practically be fully observable.

Comparisons with \(n \ge 2 \) yield similar results.

As a consequence, the performance of the baseline approach is lower than in experiments with memory reset.

Bengio, Y., Courville, A., Vincent, P.: Unsupervised feature learning and deep learning: a review and new perspectives. CoRR abs/1206.5538 (2012)

Corneil, D., Gerstner, W., Brea, J.: Efficient model-based deep reinforcement learning with variational state tabulation. arXiv preprint arXiv:1802.04325 (2018)

Crook, P., Hayes, G.: Learning in a state of confusion: perceptual aliasing in grid world navigation. In: Towards Intelligent Mobile Robots, vol. 4 (2003)

Drescher, G.: Made-Up Minds. MIT press, Cambridge (1991)

Fikes, R., Nilsson, N.: Strips: a new approach to the application of theorem proving to problem solving. Artif. Intell. 2(3–4), 189–208 (1971)CrossRef

Gelfond, M., Lifschitz, V.: Action languages. Electron. Trans. AI 3, 195–210 (1998)

Holmes, M., Isbell, C.: Schema learning: experience-based construction of predictive action models. In: Advances in Neural Information Processing Systems, pp. 585–592 (2005)

Kaelbling, L.P., Littman, M.L., Cassandra, A.R.: Planning and acting in partially observable stochastic domains. Artif. Intell. 101(1), 99–134 (1998)MathSciNetCrossRef

Kansky, K., et al.: Schema networks: zero-shot transfer with a generative causal model of intuitive physics. arXiv preprint arXiv:1706.04317 (2017)

10.

Lifschitz, V., Turner, H.: Representing transition systems by logic programs. In: Gelfond, M., Leone, N., Pfeifer, G. (eds.) LPNMR 1999. LNCS (LNAI), vol. 1730, pp. 92–106. Springer, Heidelberg (1999). https://doi.org/10.1007/3-540-46767-X_7CrossRefMATH

11.

Marty, R.: C.S. Peirce’s phaneroscopy and semiotics. Semiotica 41(1–4), 169–182 (1982)

12.

Maturana, H.: Autopoiesis and Cognition: The Realization of the Living. Springer, Dordrecht (1980). https://doi.org/10.1007/978-94-009-8947-4

13.

McCallum, A.: Overcoming incomplete perception with utile distinction memory. In: Proceedings of the 10th International Conference on Machine Learning, pp. 190–196 (1993)

14.

McCallum, A.: Instance-based state identification for reinforcement learning. In: Advances in Neural Information Processing Systems, pp. 377–384 (1995)

15.

McCallum, A.: Reinforcement learning with selective perception and hidden state. Ph.D. thesis, University of Rochester, Department of Computer Science (1996)

16.

van Otterlo, M.: The Logic of Adaptive Behavior: Knowledge Representation and Algorithms for Adaptive Sequential Decision Making Under Uncertainty in First-Order and Relational Domains. Ios Press, Amsterdam (2009). Frontiers in artificial intelligence and applications

17.

Perotto, F.S., Buisson, J.C., Alvares, L.O.C.: Constructivist anticipatory learning mechanism (calm): dealing with partially deterministic and partially observable environments. In: International Conference on Epigenetic Robotics, pp. 117–127. Lund University Cognitive Science (2007)

18.

Ring, M., Schaul, T., Schmidhuber, J.: The two-dimensional organization of behavior. In: 2011 IEEE International Conference on Development and Learning (ICDL), vol. 2, pp. 1–8. IEEE (2011)

19.

Searle, J.: Intrinsic intentionality. Behav. Brain Sci. 3(03), 450–457 (1980)CrossRef

20.

Searle, J.: Intentionality: An Essay in the Philosophy of Mind. Cambridge Univ. Press, Cambridge Paperback Library, Cambridge (1983)CrossRef

21.

Sun, R., Sessions, C.: Self-segmentation of sequences: automatic formation of hierarchies of sequential behaviors. Syst. Man Cybern. Part B Cybern. 30(3), 403–418 (2000)CrossRef

22.

Sutton, R.: Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In: Proceedings of the 7th International Conference on Machine Learning, pp. 216–224 (1990)

23.

Sutton, R., Barto, A.: Reinforcement Learning: An Introduction, vol. 1. MIT press, Cambridge (1998)

Titel: A Phenomenologically Justifiable Simulation of Mental Modeling
verfasst von: Mark Wernsdorfer
Verlag: Springer International Publishing
Buch: Artificial General Intelligence
Print ISBN: 978-3-319-97675-4

Electronic ISBN: 978-3-319-97676-1

Copyright-Jahr: 2018
DOI: https://doi.org/10.1007/978-3-319-97676-1_26

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"