Skip to main content
Top
Published in: Neural Computing and Applications 12/2023

07-01-2023 | S.I.: AI based Techniques and Applications for Intelligent IoT Systems

Cleaning of object surfaces based on deep learning: a method for generating manipulator trajectories using RGB-D semantic segmentation

Authors: Lizhe Qi, Zhongxue Gan, Zhongwei Hua, Daming Du, Wenxuan Jiang, Yunquan Sun

Published in: Neural Computing and Applications | Issue 12/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

A mobile robot with a robotic arm needs to be able to autonomously perceive the operating environment and plan the trajectory of the object’s surface in order to perform surface cleaning tasks in a complex, unstructured environment. This study suggests an autonomous trajectory planning technique for cleaning an object’s surface based on RGB-D semantic segmentation, which enables the robotic arm to move the cleaning mechanism on the object’s surface smoothly and steadily and finish the cleaning process. More particularly, it contains the following: (1) A Double Attention Fusion Net (DAFNet) RGB-D semantic segmentation network is proposed, which successfully integrates color texture features and spatial structure features and enhances the semantic segmentation performance of indoor objects. This network is based on the dual attention mechanism (channel attention and spatial attention). (2) The trajectory planning algorithm for the robot arm is created, and the semantically segmented data is clustered using DBCSCAN. In order to achieve autonomous planning of the cleaning trajectory, the target subject is first extracted, and then the working trajectory of the robot arm is generated via the processes of edge detection, slicing, sampling, fitting, etc. We also compare the accuracy of DAFNet semantic segmentation and other algorithms on SUNRGBD and self-built datasets, experiment with trajectory generation for various objects, and evaluate the online surface cleaning procedure. According to the experimental findings, the DAFNet semantic segmentation model is more accurate than the current models. According to the online test, the trajectory generated has a good degree of smoothness and continuity, and the robotic arm is capable of completing the surface cleaning operation effectively.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Chen H, Fuhlbrigge T, Li X (2009) A review of cad-based robot path planning for spray painting. Ind Robot Int J Chen H, Fuhlbrigge T, Li X (2009) A review of cad-based robot path planning for spray painting. Ind Robot Int J
2.
go back to reference Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440 Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440
3.
go back to reference Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 234–241 Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 234–241
4.
go back to reference Zhao H, Shi J, Qi X, Wang X, Jia J (2017) Pyramid scene parsing network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2881–2890 Zhao H, Shi J, Qi X, Wang X, Jia J (2017) Pyramid scene parsing network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2881–2890
5.
go back to reference Chen L-C, Zhu Y, Papandreou G, Schroff F, Adam H (2018) Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European conference on computer vision (ECCV), pp 801–818 Chen L-C, Zhu Y, Papandreou G, Schroff F, Adam H (2018) Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European conference on computer vision (ECCV), pp 801–818
6.
go back to reference Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125 Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125
7.
go back to reference Li X, You A, Zhu Z, Zhao H, Yang M, Yang K, Tan S, Tong Y (2020) Semantic flow for fast and accurate scene parsing. In: European conference on computer vision. Springer, pp 775–793 Li X, You A, Zhu Z, Zhao H, Yang M, Yang K, Tan S, Tong Y (2020) Semantic flow for fast and accurate scene parsing. In: European conference on computer vision. Springer, pp 775–793
8.
go back to reference Li X, Zhao H, Han L, Tong Y, Tan S, Yang K (2020) Gated fully fusion for semantic segmentation. Proc AAAI Conf Artif Intell 34:11418–11425 Li X, Zhao H, Han L, Tong Y, Tan S, Yang K (2020) Gated fully fusion for semantic segmentation. Proc AAAI Conf Artif Intell 34:11418–11425
9.
go back to reference Hu P, Perazzi F, Heilbron FC, Wang O, Lin Z, Saenko K, Sclaroff S (2020) Real-time semantic segmentation with fast attention. IEEE Robot Autom Lett 6(1):263–270CrossRef Hu P, Perazzi F, Heilbron FC, Wang O, Lin Z, Saenko K, Sclaroff S (2020) Real-time semantic segmentation with fast attention. IEEE Robot Autom Lett 6(1):263–270CrossRef
10.
go back to reference Li Y, Song L, Chen Y, Li Z, Zhang X, Wang X, Sun J (2020) Learning dynamic routing for semantic segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 8553–8562 Li Y, Song L, Chen Y, Li Z, Zhang X, Wang X, Sun J (2020) Learning dynamic routing for semantic segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 8553–8562
11.
go back to reference Hu X, Yang K, Fei L, Wang K (2019) ACNET: attention based network to exploit complementary features for RGBD semantic segmentation. In: IEEE international conference on image processing (ICIP). IEEE, pp 1440–1444 Hu X, Yang K, Fei L, Wang K (2019) ACNET: attention based network to exploit complementary features for RGBD semantic segmentation. In: IEEE international conference on image processing (ICIP). IEEE, pp 1440–1444
12.
go back to reference Park S-J, Hong K-S, Lee S (2017) Rdfnet: Rgb-d multi-level residual feature fusion for indoor semantic segmentation. In: Proceedings of the IEEE international conference on computer vision, pp 4980–4989 Park S-J, Hong K-S, Lee S (2017) Rdfnet: Rgb-d multi-level residual feature fusion for indoor semantic segmentation. In: Proceedings of the IEEE international conference on computer vision, pp 4980–4989
13.
go back to reference Chen L-Z, Lin Z, Wang Z, Yang Y-L, Cheng M-M (2021) Spatial information guided convolution for real-time rgbd semantic segmentation. IEEE Trans Image Process 30:2313–2324CrossRef Chen L-Z, Lin Z, Wang Z, Yang Y-L, Cheng M-M (2021) Spatial information guided convolution for real-time rgbd semantic segmentation. IEEE Trans Image Process 30:2313–2324CrossRef
14.
go back to reference Cao J, Leng H, Lischinski D, Cohen-Or D, Tu C, Li Y (2021) Shapeconv: shape-aware convolutional layer for indoor RGB-D semantic segmentation. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 7088–7097 Cao J, Leng H, Lischinski D, Cohen-Or D, Tu C, Li Y (2021) Shapeconv: shape-aware convolutional layer for indoor RGB-D semantic segmentation. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 7088–7097
15.
go back to reference Cui H, Dong J, Hou G, Xiao Z, Chen Y, Zhao Z (2013) Analysis on arc-welding robot visual control tracking system. In: International conference on quality, reliability, risk, maintenance, and safety engineering (QR2MSE) Cui H, Dong J, Hou G, Xiao Z, Chen Y, Zhao Z (2013) Analysis on arc-welding robot visual control tracking system. In: International conference on quality, reliability, risk, maintenance, and safety engineering (QR2MSE)
16.
go back to reference Martínez D, Alenya G, Torras C (2015) Planning robot manipulation to clean planar surfaces. Eng Appl Artif Intell 39:23–32CrossRef Martínez D, Alenya G, Torras C (2015) Planning robot manipulation to clean planar surfaces. Eng Appl Artif Intell 39:23–32CrossRef
18.
go back to reference Gasparetto A, Vidoni R, Pillan D, Saccavini E (2012) Automatic path and trajectory planning for robotic spray painting. In: 7th German conference on robotics ROBOTIK 2012. VDE, pp 1–6 Gasparetto A, Vidoni R, Pillan D, Saccavini E (2012) Automatic path and trajectory planning for robotic spray painting. In: 7th German conference on robotics ROBOTIK 2012. VDE, pp 1–6
19.
go back to reference Chen H, Xi N (2008) Automated tool trajectory planning of industrial robots for painting composite surfaces. Int J Adv Manuf Technol 35(7):680–696CrossRef Chen H, Xi N (2008) Automated tool trajectory planning of industrial robots for painting composite surfaces. Int J Adv Manuf Technol 35(7):680–696CrossRef
20.
go back to reference Atkar PN, Greenfield A, Conner DC, Choset H, Rizzi AA (2005) Uniform coverage of automotive surface patches. Int J Robot Res 24(11):883–898CrossRef Atkar PN, Greenfield A, Conner DC, Choset H, Rizzi AA (2005) Uniform coverage of automotive surface patches. Int J Robot Res 24(11):883–898CrossRef
21.
go back to reference Wang G, Cheng J, Li R, Chen K (2015) A new point cloud slicing based path planning algorithm for robotic spray painting. In: IEEE international conference on robotics and biomimetics (ROBIO). IEEE, pp 1717–1722 Wang G, Cheng J, Li R, Chen K (2015) A new point cloud slicing based path planning algorithm for robotic spray painting. In: IEEE international conference on robotics and biomimetics (ROBIO). IEEE, pp 1717–1722
22.
go back to reference Wong C-C, Yeh L-Y, Liu C-C, Tsai C-Y, Aoyama H (2021) Manipulation planning for object re-orientation based on semantic segmentation keypoint detection. Sensors 21(7):2280CrossRef Wong C-C, Yeh L-Y, Liu C-C, Tsai C-Y, Aoyama H (2021) Manipulation planning for object re-orientation based on semantic segmentation keypoint detection. Sensors 21(7):2280CrossRef
23.
go back to reference Yin J, Apuroop KGS, Tamilselvam YK, Mohan RE, Ramalingam B, Le AV (2020) Table cleaning task by human support robot using deep learning technique. Sensors 20(6):1698CrossRef Yin J, Apuroop KGS, Tamilselvam YK, Mohan RE, Ramalingam B, Le AV (2020) Table cleaning task by human support robot using deep learning technique. Sensors 20(6):1698CrossRef
24.
go back to reference Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19 Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19
25.
go back to reference Zhao H, Jiang L, Jia J, Torr PH, Koltun V (2021) Point transformer. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 16259–16268 Zhao H, Jiang L, Jia J, Torr PH, Koltun V (2021) Point transformer. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 16259–16268
26.
go back to reference Ester M, Kriegel H-P, Sander J, Xu X et al (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, vol 96, pp 226–231 Ester M, Kriegel H-P, Sander J, Xu X et al (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, vol 96, pp 226–231
27.
go back to reference Guo G, Wang H, Bell D, Bi Y, Greer K (2003) KNN model-based approach in classification. In: OTM confederated international conferences on the move to meaningful internet systems. Springer, pp 986–996 Guo G, Wang H, Bell D, Bi Y, Greer K (2003) KNN model-based approach in classification. In: OTM confederated international conferences on the move to meaningful internet systems. Springer, pp 986–996
28.
go back to reference Wold S, Esbensen K, Geladi P (1987) Principal component analysis. Chemom Intell Lab Syst 2(1–3):37–52CrossRef Wold S, Esbensen K, Geladi P (1987) Principal component analysis. Chemom Intell Lab Syst 2(1–3):37–52CrossRef
29.
go back to reference Wu Y, Wong Y, Loh HT, Zhang Y (2004) Modelling cloud data using an adaptive slicing approach. Comput Aided Des 36(3):231–240CrossRef Wu Y, Wong Y, Loh HT, Zhang Y (2004) Modelling cloud data using an adaptive slicing approach. Comput Aided Des 36(3):231–240CrossRef
30.
go back to reference Woo H, Kang E, Wang S, Lee KH (2002) A new segmentation method for point cloud data. Int J Mach Tools Manuf 42(2):167–178CrossRef Woo H, Kang E, Wang S, Lee KH (2002) A new segmentation method for point cloud data. Int J Mach Tools Manuf 42(2):167–178CrossRef
31.
go back to reference Fischler MA, Bolles RC (1981) Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun ACM 24(6):381–395MathSciNetCrossRef Fischler MA, Bolles RC (1981) Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun ACM 24(6):381–395MathSciNetCrossRef
32.
go back to reference Intel RealSense\(^{\text{TM}}\) (2018) Depth module D400 series custom calibration; Intel Corporation:Santa Clara, CA, USA Intel RealSense\(^{\text{TM}}\) (2018) Depth module D400 series custom calibration; Intel Corporation:Santa Clara, CA, USA
Metadata
Title
Cleaning of object surfaces based on deep learning: a method for generating manipulator trajectories using RGB-D semantic segmentation
Authors
Lizhe Qi
Zhongxue Gan
Zhongwei Hua
Daming Du
Wenxuan Jiang
Yunquan Sun
Publication date
07-01-2023
Publisher
Springer London
Published in
Neural Computing and Applications / Issue 12/2023
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-022-07930-x

Other articles of this Issue 12/2023

Neural Computing and Applications 12/2023 Go to the issue

S.I.: AI based Techniques and Applications for Intelligent IoT Systems

A method of forecasting trade export volume based on back-propagation neural network

S.i.: Ai Based Techniques and Applications for Intelligent Iot Systems

Multifeature video modularized arm movement algorithm evaluation and simulation

Premium Partner