Top

Neural Computing and Applications

Published in:

07-01-2023 | S.I.: AI based Techniques and Applications for Intelligent IoT Systems

Cleaning of object surfaces based on deep learning: a method for generating manipulator trajectories using RGB-D semantic segmentation

Authors: Lizhe Qi, Zhongxue Gan, Zhongwei Hua, Daming Du, Wenxuan Jiang, Yunquan Sun

Published in: Neural Computing and Applications | Issue 12/2023

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

A mobile robot with a robotic arm needs to be able to autonomously perceive the operating environment and plan the trajectory of the object’s surface in order to perform surface cleaning tasks in a complex, unstructured environment. This study suggests an autonomous trajectory planning technique for cleaning an object’s surface based on RGB-D semantic segmentation, which enables the robotic arm to move the cleaning mechanism on the object’s surface smoothly and steadily and finish the cleaning process. More particularly, it contains the following: (1) A Double Attention Fusion Net (DAFNet) RGB-D semantic segmentation network is proposed, which successfully integrates color texture features and spatial structure features and enhances the semantic segmentation performance of indoor objects. This network is based on the dual attention mechanism (channel attention and spatial attention). (2) The trajectory planning algorithm for the robot arm is created, and the semantically segmented data is clustered using DBCSCAN. In order to achieve autonomous planning of the cleaning trajectory, the target subject is first extracted, and then the working trajectory of the robot arm is generated via the processes of edge detection, slicing, sampling, fitting, etc. We also compare the accuracy of DAFNet semantic segmentation and other algorithms on SUNRGBD and self-built datasets, experiment with trajectory generation for various objects, and evaluate the online surface cleaning procedure. According to the experimental findings, the DAFNet semantic segmentation model is more accurate than the current models. According to the online test, the trajectory generated has a good degree of smoothness and continuity, and the robotic arm is capable of completing the surface cleaning operation effectively.

previous article HRNet- and PSPNet-based multiband semantic segmentation of remote sensing images

next article Multimodal deep collaborative filtering recommendation based on dual attention

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Chen H, Fuhlbrigge T, Li X (2009) A review of cad-based robot path planning for spray painting. Ind Robot Int J

Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440

Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 234–241

Zhao H, Shi J, Qi X, Wang X, Jia J (2017) Pyramid scene parsing network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2881–2890

Chen L-C, Zhu Y, Papandreou G, Schroff F, Adam H (2018) Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European conference on computer vision (ECCV), pp 801–818

Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125

Li X, You A, Zhu Z, Zhao H, Yang M, Yang K, Tan S, Tong Y (2020) Semantic flow for fast and accurate scene parsing. In: European conference on computer vision. Springer, pp 775–793

Li X, Zhao H, Han L, Tong Y, Tan S, Yang K (2020) Gated fully fusion for semantic segmentation. Proc AAAI Conf Artif Intell 34:11418–11425

Hu P, Perazzi F, Heilbron FC, Wang O, Lin Z, Saenko K, Sclaroff S (2020) Real-time semantic segmentation with fast attention. IEEE Robot Autom Lett 6(1):263–270CrossRef

10.

Li Y, Song L, Chen Y, Li Z, Zhang X, Wang X, Sun J (2020) Learning dynamic routing for semantic segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 8553–8562

11.

Hu X, Yang K, Fei L, Wang K (2019) ACNET: attention based network to exploit complementary features for RGBD semantic segmentation. In: IEEE international conference on image processing (ICIP). IEEE, pp 1440–1444

12.

Park S-J, Hong K-S, Lee S (2017) Rdfnet: Rgb-d multi-level residual feature fusion for indoor semantic segmentation. In: Proceedings of the IEEE international conference on computer vision, pp 4980–4989

13.

Chen L-Z, Lin Z, Wang Z, Yang Y-L, Cheng M-M (2021) Spatial information guided convolution for real-time rgbd semantic segmentation. IEEE Trans Image Process 30:2313–2324CrossRef

14.

Cao J, Leng H, Lischinski D, Cohen-Or D, Tu C, Li Y (2021) Shapeconv: shape-aware convolutional layer for indoor RGB-D semantic segmentation. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 7088–7097

15.

Cui H, Dong J, Hou G, Xiao Z, Chen Y, Zhao Z (2013) Analysis on arc-welding robot visual control tracking system. In: International conference on quality, reliability, risk, maintenance, and safety engineering (QR2MSE)

16.

Martínez D, Alenya G, Torras C (2015) Planning robot manipulation to clean planar surfaces. Eng Appl Artif Intell 39:23–32CrossRef

17.

Chen W, Zhao D (2013) Path planning for spray painting robot of workpiece surfaces. Math Probl Eng. https://doi.org/10.1155/2013/659457CrossRef

18.

Gasparetto A, Vidoni R, Pillan D, Saccavini E (2012) Automatic path and trajectory planning for robotic spray painting. In: 7th German conference on robotics ROBOTIK 2012. VDE, pp 1–6

19.

Chen H, Xi N (2008) Automated tool trajectory planning of industrial robots for painting composite surfaces. Int J Adv Manuf Technol 35(7):680–696CrossRef

20.

Atkar PN, Greenfield A, Conner DC, Choset H, Rizzi AA (2005) Uniform coverage of automotive surface patches. Int J Robot Res 24(11):883–898CrossRef

21.

Wang G, Cheng J, Li R, Chen K (2015) A new point cloud slicing based path planning algorithm for robotic spray painting. In: IEEE international conference on robotics and biomimetics (ROBIO). IEEE, pp 1717–1722

22.

Wong C-C, Yeh L-Y, Liu C-C, Tsai C-Y, Aoyama H (2021) Manipulation planning for object re-orientation based on semantic segmentation keypoint detection. Sensors 21(7):2280CrossRef

23.

Yin J, Apuroop KGS, Tamilselvam YK, Mohan RE, Ramalingam B, Le AV (2020) Table cleaning task by human support robot using deep learning technique. Sensors 20(6):1698CrossRef

24.

Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19

25.

Zhao H, Jiang L, Jia J, Torr PH, Koltun V (2021) Point transformer. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 16259–16268

26.

Ester M, Kriegel H-P, Sander J, Xu X et al (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, vol 96, pp 226–231

27.

Guo G, Wang H, Bell D, Bi Y, Greer K (2003) KNN model-based approach in classification. In: OTM confederated international conferences on the move to meaningful internet systems. Springer, pp 986–996

28.

Wold S, Esbensen K, Geladi P (1987) Principal component analysis. Chemom Intell Lab Syst 2(1–3):37–52CrossRef

29.

Wu Y, Wong Y, Loh HT, Zhang Y (2004) Modelling cloud data using an adaptive slicing approach. Comput Aided Des 36(3):231–240CrossRef

30.

Woo H, Kang E, Wang S, Lee KH (2002) A new segmentation method for point cloud data. Int J Mach Tools Manuf 42(2):167–178CrossRef

31.

Fischler MA, Bolles RC (1981) Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun ACM 24(6):381–395MathSciNetCrossRef

32.

Intel RealSense\(^{\text{TM}}\) (2018) Depth module D400 series custom calibration; Intel Corporation:Santa Clara, CA, USA

Title: Cleaning of object surfaces based on deep learning: a method for generating manipulator trajectories using RGB-D semantic segmentation
Authors: Lizhe Qi
Zhongxue Gan
Zhongwei Hua
Daming Du
Wenxuan Jiang
Yunquan Sun
Publication date: 07-01-2023
Publisher: Springer London
Published in: Neural Computing and Applications / Issue 12/2023
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI: https://doi.org/10.1007/s00521-022-07930-x

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Other articles of this Issue 12/2023

A method of forecasting trade export volume based on back-propagation neural network

NeuroGAN: image reconstruction from EEG signals via an attention-based GAN

Multifeature video modularized arm movement algorithm evaluation and simulation

Spiking Neural P System with weight model of majority voting technique for reliable interactive image segmentation

Fusion of forehead EEG with machine vision for real-time fatigue detection in an automatic processing pipeline

Mapping of water bodies from sentinel-2 images using deep learning-based feature fusion approach

Premium Partner