Top

Published in:

2021 | OriginalPaper | Chapter

Deep Reinforcement Learning Applied to a Robotic Pick-and-Place Application

Authors : Natanael Magno Gomes, Felipe N. Martins, José Lima, Heinrich Wörtche

Published in: Optimization, Learning Algorithms and Applications

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Industrial robot manipulators are widely used for repetitive applications that require high precision, like pick-and-place. In many cases, the movements of industrial robot manipulators are hard-coded or manually defined, and need to be adjusted if the objects being manipulated change position. To increase flexibility, an industrial robot should be able to adjust its configuration in order to grasp objects in variable/unknown positions. This can be achieved by off-the-shelf vision-based solutions, but most require prior knowledge about each object to be manipulated. To address this issue, this work presents a ROS-based deep reinforcement learning solution to robotic grasping for a Collaborative Robot (Cobot) using a depth camera. The solution uses deep Q-learning to process the color and depth images and generate a \(\epsilon \)-greedy policy used to define the robot action. The Q-values are estimated using Convolutional Neural Network (CNN) based on pre-trained models for feature extraction. Experiments were carried out in a simulated environment to compare the performance of four different pre-trained CNN models (RexNext, MobileNet, MNASNet and DenseNet). Results show that the best performance in our application was reached by MobileNet, with an average of 84 % accuracy after training in simulated environment.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

previous chapter Sensor Fusion for Mobile Robot Localization Using Extended Kalman Filter, UWB ToF and ArUco Markers

next chapter An IoT Approach for Animals Tracking

Siciliano, B., Khatib, O. (eds.): Springer Handbook of Robotics, pp. 1–2227. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-32552-1

ISO/TS 15066 Robots and robotic devices - Collaborative robots. International Organization for Standardization, Geneva, CH, Standard, February 2016

Saxena, A., Driemeyer, J., Kearns, J., Ng, A.Y.: Robotic grasping of novel objects. In: Advances in Neural Information Processing Systems, pp. 1209–1216 (2007). https://doi.org/10.7551/mitpress/7503.003.0156. ISBN: 9780262195683

Torras, C.: Computer Vision: Theory and Industrial Applications, p. 455. Springer, Heidelberg (1992). https://doi.org/10.1007/978-3-642-48675-3. ISBN: 3642486754

Gomes, J.F.S., Leta, F.R.: Applications of computer vision techniques in the agriculture and food industry: a review. Eur. Food Res. Technol. 235, 989–1000 (2012). https://doi.org/10.1007/s00217-012-1844-2CrossRef

Arakeri, M.P., Lakshmana: Computer vision based fruit grading system for quality evaluation of tomato in agriculture industry. Procedia Comput. Sci. 79, 426–433 (2016). https://doi.org/10.1016/j.procs.2016.03.055

Bhutta, M.U.M., Aslam, S., Yun, P., Jiao, J., Liu, M.: Smart-inspect: micro scale localization and classification of smartphone glass defects for industrial automation. arXiv: 2010.00741, October 2020

Shafii, N., Kasaei, S.H., Lopes, L.S.: Learning to grasp familiar objects using object view recognition and template matching. In: IEEE International Conference on Intelligent Robots and Systems, vol. 2016-November, pp. 2895–2900. Institute of Electrical and Electronics Engineers Inc., November 2016. https://doi.org/10.1109/IROS.2016.7759448. ISBN: 9781509037629

Kumra, S., Kanan, C.: Robotic grasp detection using deep convolutional neural networks. In: IEEE International Conference on Intelligent Robots and Systems, vol. 2017-September, pp. 769–776. Institute of Electrical and Electronics Engineers Inc., November 2017. https://doi.org/10.1109/IROS.2017.8202237. arXiv: 1611.08036. ISBN: 9781538626825

10.

Morrison, D., Corke, P., Leitner, J.: Learning robust, real-time, reactive robotic grasping. Int. J. Robot. Res. 39(2–3), 183–201 (2020). https://doi.org/10.1177/0278364919859066. ISSN: 0278-3649

11.

Mittal, S., Vaishay, S.: A survey of techniques for optimizing deep learning on GPUs. J. Syst. Archit. 99, 101635 (2019). https://doi.org/10.1016/j.sysarc.2019.101635. http://www.sciencedirect.com/science/article/pii/S1383762119302656. ISSN: 1383-7621

12.

Saha, S.: A comprehensive guide to convolutional neural networks - the ELI5 way - by Sumit Saha - towards data science (2018). https://towardsdatascience.com/a-comprehensiveguide-to-convolutional-neural-networks-the-eli5-way-3bd2b1164a53. Accessed 20 June 2020

13.

Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014). https://doi.org/10.1109/CVPR.2014.81. arXiv: 1311.2524. ISBN: 9781479951178

14.

Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, vol. 2015 Inter, pp. 1440–1448 (2015). https://doi.org/10.1109/ICCV.2015.169. arXiv: 1504.08083. ISSN: 15505499

15.

Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017). https://doi.org/10.1109/TPAMI.2016.2577031. arXiv: 1506.01497. ISSN: 01628828

16.

He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. IEEE Trans. Pattern Anal. Mach. Intell. 42(2), 386–397 (2020). https://doi.org/10.1109/TPAMI.2018.2844175. arXiv: 1703.06870. ISSN: 19393539

17.

Girshick, R., Radosavovic, I., Gkioxari, G., Dollár, P., He, K.: Detectron (2018). https://github.com/facebookresearch/detectron

18.

Debkowski, D.: SuperBadCode/Depth-Mask-RCNN: using Kinect2 depth sensors to train neural network for object detection & interaction. https://github.com/SuperBadCode/Depth-Mask-RCNN. Accessed 20 June 2020

19.

Sutton, R.S. Barto, A.G.: Reinforcement Learning: An Introduction, 2nd edn, p. 552. The MIT Press, Cambridge (2018). ISBN: 978-0-262-03924-6

20.

Zanuttigh, P., Marin, G., Dal Mutto, C., Dominio, F., Minto, L., Cortelazzo, G.M.: Time-of-Flight and Structured Light Depth Cameras, pp. 1–355. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30973-6. ISBN: 9783319309736

21.

Zhang, F., Leitner, J., Milford, M., Upcroft, B., Corke, P.: Towards vision-based deep reinforcement learning for robotic motion control. arXiv: 1511.03791, November 2015

22.

Joshi, S., Kumra, S., Sahin, F.: Robotic grasping using deep reinforcement learning. In: 2020 IEEE 16th International Conference on Automation Science and Engineering (CASE), pp. 1461–1466. IEEE, August 2020. https://doi.org/10.1109/CASE48305.2020.9216986. ISBN: 978-1-7281-6904-0

23.

Rahman, M.D.M., Rashid, S.M.H., Hossain, M.M.: Implementation of Q learning and deep Q network for controlling a self balancing robot model. Robot. Biomim. 5(1), 1–6 (2018). https://doi.org/10.1186/s40638-018-0091-9. arXiv: 1807.08272. ISSN: 2197-3768

24.

Hase, H., Azampour, M.F., Tirindelli, M., et al.: Ultrasound-guided robotic navigation with deep reinforcement learning. arXiv: 2003.13321, March 2020

25.

Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. Technical Report. arXiv: 1608.06993v5. https://github.com/liuzhuang13/DenseNet

26.

Torchvision.models (2019). https://pytorch.org/docs/stable/torchvision/models.html. Accessed 17 Jan 2021

27.

Webots. Commercial Mobile Robot Simulation Software. Cyberbotics Ltd., Ed. https://www.cyberbotics.com

28.

Ayala, A., Cruz, F., Campos, D., Rubio, R., Fernandes, B., Dazeley, R.: A comparison of humanoid robot simulators: a quantitative approach, pp. 1–10. arXiv: 2008.04627 (2020)

29.

Rajeswaran, A., Kumar, V., Gupta, A., et al.: Learning complex dexterous manipulation with deep reinforcement learning and demonstrations. Technical Report. arXiv: 1709.10087v2. http://sites.google.com/view/deeprl-dexterous-manipulation

30.

Hawkins, K.P.: Analytic inverse kinematics for the universal robots UR-5/UR-10 arms. Technical Report, December 2013. https://smartech.gatech.edu/handle/1853/50782

31.

Universal Robots - Parameters for calculations of kinematics and dynamics. https://www.universal-robots.com/articles/ur/parameters-for-calculations-of-kinematics-anddynamics/. Accessed 31 Dec 2020

32.

Manual Robotiq 2F-85 & 2F-140 for e-series universal robots, Robotic, 145 pp., November 2018

33.

SmoothL1Loss – PyTorch 1.7.0 documentation. https://pytorch.org/docs/stable/generated/torch.nn.SmoothL1Loss.html. Accessed 15 Jan 2021

34.

Kingma, D.P., Ba, J.L.: Adam: a method for stochastic optimization. In: 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings, December 2015. arXiv: 1412.6980

35.

Loshchilov, I., Hutter, F.: Decoupled weight decay regularization, November 2017. arXiv: 1711.05101. http://arxiv.org/abs/1711.05101

36.

De Bruin, T., Kober, J., Tuyls, K., Babuška, R.: Experience selection in deep reinforcement learning for control. J. Mach. Learn. Res. 19, 1–56 (2018). https://doi.org/10.5555/3291125.3291134. http://jmlr.org/papers/v19/17-131.html. ISSN: 15337928

37.

Brys, T., Harutyunyan, A., Suay, H.B., Chernova, S., Taylor, M.E., Nowé, A.: Reinforcement learning from demonstration through shaping. In: IJCAI International Joint Conference on Artificial Intelligence, vol. 2015-January, pp. 3352–3358 (2015). ISBN: 9781577357384

Title: Deep Reinforcement Learning Applied to a Robotic Pick-and-Place Application
Authors: Natanael Magno Gomes
Felipe N. Martins
José Lima
Heinrich Wörtche
Publisher: Springer International Publishing
Book: Optimization, Learning Algorithms and Applications
Print ISBN: 978-3-030-91884-2

Electronic ISBN: 978-3-030-91885-9

Copyright Year: 2021
DOI: https://doi.org/10.1007/978-3-030-91885-9_18

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner