Skip to main content
Top

2021 | OriginalPaper | Chapter

Critic Guided Segmentation of Rewarding Objects in First-Person Views

Authors : Andrew Melnik, Augustin Harter, Christian Limberg, Krishan Rana, Niko Sünderhauf, Helge Ritter

Published in: KI 2021: Advances in Artificial Intelligence

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This work discusses a learning approach to mask rewarding objects in images using sparse reward signals from an imitation learning dataset. For that we train an Hourglass network using only feedback from a critic model. The Hourglass network learns to produce a mask to decrease the critic’s score of a high score image and increase the critic’s score of a low score image by swapping the masked areas between these two images. We trained the model on an imitation learning dataset from the NeurIPS 2020 MineRL Competition Track, where our model learned to mask rewarding objects in a complex interactive 3D environment with a sparse reward signal. This approach was part of the 1st place winning solution in this competition. Video demonstration and code: https://​rebrand.​ly/​critic-guided-segmentation.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Greydanus, S., Koul, A., Dodge, J., Fern, A.: Visualizing and understanding Atari agents. In: International Conference on Machine Learning, pp. 1792–1801. PMLR (2018) Greydanus, S., Koul, A., Dodge, J., Fern, A.: Visualizing and understanding Atari agents. In: International Conference on Machine Learning, pp. 1792–1801. PMLR (2018)
3.
go back to reference Gunning, D., Aha, D.: Darpa’s explainable artificial intelligence (XAI) program. AI Mag. 40(2), 44–58 (2019) Gunning, D., Aha, D.: Darpa’s explainable artificial intelligence (XAI) program. AI Mag. 40(2), 44–58 (2019)
5.
go back to reference Harter, A., Melnik, A., Kumar, G., Agarwal, D., Garg, A., Ritter, H.: Solving physics puzzles by reasoning about paths. In: 1st NeurIPS workshop on Interpretable Inductive Biases and Physically Structured Learning (2020). https://arxiv.org/abs/2011.07357 Harter, A., Melnik, A., Kumar, G., Agarwal, D., Garg, A., Ritter, H.: Solving physics puzzles by reasoning about paths. In: 1st NeurIPS workshop on Interpretable Inductive Biases and Physically Structured Learning (2020). https://​arxiv.​org/​abs/​2011.​07357
9.
go back to reference Konen, K., Korthals, T., Melnik, A., Schilling, M.: Biologically-inspired deep reinforcement learning of modular control for a six-legged robot. In: 2019 IEEE International Conference on Robotics and Automation Workshop on Learning Legged Locomotion Workshop, (ICRA) 2019, Montreal, CA, 20–25 May 2019 (2019) Konen, K., Korthals, T., Melnik, A., Schilling, M.: Biologically-inspired deep reinforcement learning of modular control for a six-legged robot. In: 2019 IEEE International Conference on Robotics and Automation Workshop on Learning Legged Locomotion Workshop, (ICRA) 2019, Montreal, CA, 20–25 May 2019 (2019)
10.
go back to reference König, P., Melnik, A., Goeke, C., Gert, A.L., König, S.U., Kietzmann, T.C.: Embodied cognition. In: 2018 6th International Conference on Brain-Computer Interface (BCI), pp. 1–4. IEEE (2018) König, P., Melnik, A., Goeke, C., Gert, A.L., König, S.U., Kietzmann, T.C.: Embodied cognition. In: 2018 6th International Conference on Brain-Computer Interface (BCI), pp. 1–4. IEEE (2018)
13.
go back to reference Melnik, A., Bramlage, L., Voss, H., Rossetto, F., Ritter, H.: Combining causal modelling and deep reinforcement learning for autonomous agents in minecraft. In: 4th Workshop on Semantic Policy and Action Representations for Autonomous Robots at IROS 2019 (2019) Melnik, A., Bramlage, L., Voss, H., Rossetto, F., Ritter, H.: Combining causal modelling and deep reinforcement learning for autonomous agents in minecraft. In: 4th Workshop on Semantic Policy and Action Representations for Autonomous Robots at IROS 2019 (2019)
14.
go back to reference Melnik, A., Fleer, S., Schilling, M., Ritter, H.: Modularization of end-to-end learning: case study in arcade games. In: 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Workshop on Causal Learning (2018). https://arxiv.org/pdf/1901.09895.pdf Melnik, A., Fleer, S., Schilling, M., Ritter, H.: Modularization of end-to-end learning: case study in arcade games. In: 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Workshop on Causal Learning (2018). https://​arxiv.​org/​pdf/​1901.​09895.​pdf
17.
go back to reference Olah, C., Mordvintsev, A., Schubert, L.: Feature visualization. Distill 2(11), e7 (2017)CrossRef Olah, C., Mordvintsev, A., Schubert, L.: Feature visualization. Distill 2(11), e7 (2017)CrossRef
18.
go back to reference Olah, C., et al.: The building blocks of interpretability. Distill 3(3), e10 (2018)CrossRef Olah, C., et al.: The building blocks of interpretability. Distill 3(3), e10 (2018)CrossRef
20.
go back to reference Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps. arXiv preprint arXiv:1312.6034 (2013) Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps. arXiv preprint arXiv:​1312.​6034 (2013)
21.
go back to reference Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps (2014) Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps (2014)
22.
go back to reference Srinivas, A., Laskin, M., Abbeel, P.: Curl: contrastive unsupervised representations for reinforcement learning. arXiv preprint arXiv:2004.04136 (2020) Srinivas, A., Laskin, M., Abbeel, P.: Curl: contrastive unsupervised representations for reinforcement learning. arXiv preprint arXiv:​2004.​04136 (2020)
Metadata
Title
Critic Guided Segmentation of Rewarding Objects in First-Person Views
Authors
Andrew Melnik
Augustin Harter
Christian Limberg
Krishan Rana
Niko Sünderhauf
Helge Ritter
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-87626-5_25

Premium Partner