Skip to main content
Top

2021 | OriginalPaper | Chapter

Deep Reinforcement Learning Applied to a Robotic Pick-and-Place Application

Authors : Natanael Magno Gomes, Felipe N. Martins, José Lima, Heinrich Wörtche

Published in: Optimization, Learning Algorithms and Applications

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Industrial robot manipulators are widely used for repetitive applications that require high precision, like pick-and-place. In many cases, the movements of industrial robot manipulators are hard-coded or manually defined, and need to be adjusted if the objects being manipulated change position. To increase flexibility, an industrial robot should be able to adjust its configuration in order to grasp objects in variable/unknown positions. This can be achieved by off-the-shelf vision-based solutions, but most require prior knowledge about each object to be manipulated. To address this issue, this work presents a ROS-based deep reinforcement learning solution to robotic grasping for a Collaborative Robot (Cobot) using a depth camera. The solution uses deep Q-learning to process the color and depth images and generate a \(\epsilon \)-greedy policy used to define the robot action. The Q-values are estimated using Convolutional Neural Network (CNN) based on pre-trained models for feature extraction. Experiments were carried out in a simulated environment to compare the performance of four different pre-trained CNN models (RexNext, MobileNet, MNASNet and DenseNet). Results show that the best performance in our application was reached by MobileNet, with an average of 84 % accuracy after training in simulated environment.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference ISO/TS 15066 Robots and robotic devices - Collaborative robots. International Organization for Standardization, Geneva, CH, Standard, February 2016 ISO/TS 15066 Robots and robotic devices - Collaborative robots. International Organization for Standardization, Geneva, CH, Standard, February 2016
7.
go back to reference Bhutta, M.U.M., Aslam, S., Yun, P., Jiao, J., Liu, M.: Smart-inspect: micro scale localization and classification of smartphone glass defects for industrial automation. arXiv: 2010.00741, October 2020 Bhutta, M.U.M., Aslam, S., Yun, P., Jiao, J., Liu, M.: Smart-inspect: micro scale localization and classification of smartphone glass defects for industrial automation. arXiv:​ 2010.​00741, October 2020
8.
go back to reference Shafii, N., Kasaei, S.H., Lopes, L.S.: Learning to grasp familiar objects using object view recognition and template matching. In: IEEE International Conference on Intelligent Robots and Systems, vol. 2016-November, pp. 2895–2900. Institute of Electrical and Electronics Engineers Inc., November 2016. https://doi.org/10.1109/IROS.2016.7759448. ISBN: 9781509037629 Shafii, N., Kasaei, S.H., Lopes, L.S.: Learning to grasp familiar objects using object view recognition and template matching. In: IEEE International Conference on Intelligent Robots and Systems, vol. 2016-November, pp. 2895–2900. Institute of Electrical and Electronics Engineers Inc., November 2016. https://​doi.​org/​10.​1109/​IROS.​2016.​7759448. ISBN: 9781509037629
19.
go back to reference Sutton, R.S. Barto, A.G.: Reinforcement Learning: An Introduction, 2nd edn, p. 552. The MIT Press, Cambridge (2018). ISBN: 978-0-262-03924-6 Sutton, R.S. Barto, A.G.: Reinforcement Learning: An Introduction, 2nd edn, p. 552. The MIT Press, Cambridge (2018). ISBN: 978-0-262-03924-6
21.
go back to reference Zhang, F., Leitner, J., Milford, M., Upcroft, B., Corke, P.: Towards vision-based deep reinforcement learning for robotic motion control. arXiv: 1511.03791, November 2015 Zhang, F., Leitner, J., Milford, M., Upcroft, B., Corke, P.: Towards vision-based deep reinforcement learning for robotic motion control. arXiv:​ 1511.​03791, November 2015
24.
go back to reference Hase, H., Azampour, M.F., Tirindelli, M., et al.: Ultrasound-guided robotic navigation with deep reinforcement learning. arXiv: 2003.13321, March 2020 Hase, H., Azampour, M.F., Tirindelli, M., et al.: Ultrasound-guided robotic navigation with deep reinforcement learning. arXiv:​ 2003.​13321, March 2020
28.
go back to reference Ayala, A., Cruz, F., Campos, D., Rubio, R., Fernandes, B., Dazeley, R.: A comparison of humanoid robot simulators: a quantitative approach, pp. 1–10. arXiv: 2008.04627 (2020) Ayala, A., Cruz, F., Campos, D., Rubio, R., Fernandes, B., Dazeley, R.: A comparison of humanoid robot simulators: a quantitative approach, pp. 1–10. arXiv:​ 2008.​04627 (2020)
32.
go back to reference Manual Robotiq 2F-85 & 2F-140 for e-series universal robots, Robotic, 145 pp., November 2018 Manual Robotiq 2F-85 & 2F-140 for e-series universal robots, Robotic, 145 pp., November 2018
34.
go back to reference Kingma, D.P., Ba, J.L.: Adam: a method for stochastic optimization. In: 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings, December 2015. arXiv: 1412.6980 Kingma, D.P., Ba, J.L.: Adam: a method for stochastic optimization. In: 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings, December 2015. arXiv:​ 1412.​6980
37.
go back to reference Brys, T., Harutyunyan, A., Suay, H.B., Chernova, S., Taylor, M.E., Nowé, A.: Reinforcement learning from demonstration through shaping. In: IJCAI International Joint Conference on Artificial Intelligence, vol. 2015-January, pp. 3352–3358 (2015). ISBN: 9781577357384 Brys, T., Harutyunyan, A., Suay, H.B., Chernova, S., Taylor, M.E., Nowé, A.: Reinforcement learning from demonstration through shaping. In: IJCAI International Joint Conference on Artificial Intelligence, vol. 2015-January, pp. 3352–3358 (2015). ISBN: 9781577357384
Metadata
Title
Deep Reinforcement Learning Applied to a Robotic Pick-and-Place Application
Authors
Natanael Magno Gomes
Felipe N. Martins
José Lima
Heinrich Wörtche
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-91885-9_18

Premium Partner