Skip to main content
Top

2019 | OriginalPaper | Chapter

Planar Pose Estimation Using Object Detection and Reinforcement Learning

Authors : Frederik Nørby Rasmussen, Sebastian Terp Andersen, Bjarne Grossmann, Evangelos Boukas, Lazaros Nalpantidis

Published in: Computer Vision Systems

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Pose estimation concerns systems or models dealing with the determination of a static object’s pose using, in this case, vision. This paper approaching the problem with an active vision-based solution, that integrates both perception and action in the same model. The problem is solved using a combination of neural networks for object detection and a reinforcement learning architecture for moving a camera and estimating the pose. A robotic implementation of the proposed active vision system is used for testing with promising results. Experiments show that our approach does not only solve the simple task of planar visual pose estimation, but also exhibits robustness to changes in the environment.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
3.
go back to reference Brooks, R.A.: Intelligence without representation. Artif. Intell. 47, 139–159 (1991)CrossRef Brooks, R.A.: Intelligence without representation. Artif. Intell. 47, 139–159 (1991)CrossRef
4.
go back to reference Huang, J., et al.: Speed/accuracy trade-offs for modern convolutional object detectors. In: Computer Vision and Pattern Recognition (2017) Huang, J., et al.: Speed/accuracy trade-offs for modern convolutional object detectors. In: Computer Vision and Pattern Recognition (2017)
5.
go back to reference Jia, Z., Chang, Y.J., Chen, T.: Active view selection for object and pose recognition. In: 2009 IEEE 12th International Conference on Computer Vision Workshops, September 2009, pp. 641–648 (2009) Jia, Z., Chang, Y.J., Chen, T.: Active view selection for object and pose recognition. In: 2009 IEEE 12th International Conference on Computer Vision Workshops, September 2009, pp. 641–648 (2009)
6.
go back to reference Kostavelis, I., Nalpantidis, L., Gasteratos, A.: Object recognition using saliency maps and HTM learning. In: IEEE International Conference on Imaging Systems and Techniques, pp. 528–532. IEEE, Manchester (2012) Kostavelis, I., Nalpantidis, L., Gasteratos, A.: Object recognition using saliency maps and HTM learning. In: IEEE International Conference on Imaging Systems and Techniques, pp. 528–532. IEEE, Manchester (2012)
7.
go back to reference Krull, A., Brachmann, E., Michel, F., Yang, M.Y., Gumhold, S.: Learning analysis-by-synthesis for 6D pose estimation in RGB-D images. In: 2015 IEEE International Conference on Computer Vision, December 2015, pp. 954–962 (2015) Krull, A., Brachmann, E., Michel, F., Yang, M.Y., Gumhold, S.: Learning analysis-by-synthesis for 6D pose estimation in RGB-D images. In: 2015 IEEE International Conference on Computer Vision, December 2015, pp. 954–962 (2015)
8.
go back to reference Krull, A., Brachmann, E., Nowozin, S., Michel, F., Shotton, J., Rother, C.: PoseAgent: budget-constrained 6D object pose estimation via reinforcement learning. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition, July 2017, pp. 2566–2574 (2017) Krull, A., Brachmann, E., Nowozin, S., Michel, F., Shotton, J., Rother, C.: PoseAgent: budget-constrained 6D object pose estimation via reinforcement learning. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition, July 2017, pp. 2566–2574 (2017)
10.
go back to reference Piater, J., Jodogne, S., Detry, R., Kraft, D., Krueger, N., Kroemer, O.: Learning visual representations for perception-action systems. Int. J. Robot. Res. 30, 294–307 (2015)CrossRef Piater, J., Jodogne, S., Detry, R., Kraft, D., Krueger, N., Kroemer, O.: Learning visual representations for perception-action systems. Int. J. Robot. Res. 30, 294–307 (2015)CrossRef
12.
go back to reference Polydoros, A.S., Boukas, E., Nalpantidis, L.: Online multi-target learning of inverse dynamics models for computed-torque control of compliant manipulators. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver (2017) Polydoros, A.S., Boukas, E., Nalpantidis, L.: Online multi-target learning of inverse dynamics models for computed-torque control of compliant manipulators. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver (2017)
13.
go back to reference Toshev, A., Szegedy, C.: DeepPose: human pose estimation via deep neural networks. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, June 2014, pp. 1653–1660 (2014) Toshev, A., Szegedy, C.: DeepPose: human pose estimation via deep neural networks. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, June 2014, pp. 1653–1660 (2014)
14.
go back to reference Tremblay, J., To, T., Sundaralingam, B., Xiang, Y., Fox, D., Birchfield, S.: Deep object pose estimation for semantic robotic grasping of household objects. Cornell University Library (2018) Tremblay, J., To, T., Sundaralingam, B., Xiang, Y., Fox, D., Birchfield, S.: Deep object pose estimation for semantic robotic grasping of household objects. Cornell University Library (2018)
Metadata
Title
Planar Pose Estimation Using Object Detection and Reinforcement Learning
Authors
Frederik Nørby Rasmussen
Sebastian Terp Andersen
Bjarne Grossmann
Evangelos Boukas
Lazaros Nalpantidis
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-34995-0_32

Premium Partner