2011 | OriginalPaper | Buchkapitel
An Action Selection Method Based on Estimation of Other’s Intention in Time-Varying Multi-agent Environments
verfasst von : Kunikazu Kobayashi, Ryu Kanehira, Takashi Kuremoto, Masanao Obayashi
Erschienen in: Neural Information Processing
Verlag: Springer Berlin Heidelberg
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
An action selection method based on the estimation of other’s intention is proposed to treat with time-varying multi-agent environments. Firstly, the estimation level of other’s intention is stratified as active, passive and thoughtful levels. Secondly, three estimation levels are formulated by a policy estimation method. Thirdly, a new action selection method by switching three estimation levels is proposed to cope with time-varying environments. Fourthly, the estimation methods of other’s intention are applied to the Q-learning method. Finally, through computer simulations using pursuit problems, the performance of the estimation methods are investigated. As a result, it is shown that the proposed method can select the appropriate estimation level in time-varying environments.