nach oben

Neural Computing and Applications

Erschienen in:

07.01.2023 | S.I.: AI based Techniques and Applications for Intelligent IoT Systems

Cleaning of object surfaces based on deep learning: a method for generating manipulator trajectories using RGB-D semantic segmentation

verfasst von: Lizhe Qi, Zhongxue Gan, Zhongwei Hua, Daming Du, Wenxuan Jiang, Yunquan Sun

Erschienen in: Neural Computing and Applications | Ausgabe 12/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

A mobile robot with a robotic arm needs to be able to autonomously perceive the operating environment and plan the trajectory of the object’s surface in order to perform surface cleaning tasks in a complex, unstructured environment. This study suggests an autonomous trajectory planning technique for cleaning an object’s surface based on RGB-D semantic segmentation, which enables the robotic arm to move the cleaning mechanism on the object’s surface smoothly and steadily and finish the cleaning process. More particularly, it contains the following: (1) A Double Attention Fusion Net (DAFNet) RGB-D semantic segmentation network is proposed, which successfully integrates color texture features and spatial structure features and enhances the semantic segmentation performance of indoor objects. This network is based on the dual attention mechanism (channel attention and spatial attention). (2) The trajectory planning algorithm for the robot arm is created, and the semantically segmented data is clustered using DBCSCAN. In order to achieve autonomous planning of the cleaning trajectory, the target subject is first extracted, and then the working trajectory of the robot arm is generated via the processes of edge detection, slicing, sampling, fitting, etc. We also compare the accuracy of DAFNet semantic segmentation and other algorithms on SUNRGBD and self-built datasets, experiment with trajectory generation for various objects, and evaluate the online surface cleaning procedure. According to the experimental findings, the DAFNet semantic segmentation model is more accurate than the current models. According to the online test, the trajectory generated has a good degree of smoothness and continuity, and the robotic arm is capable of completing the surface cleaning operation effectively.

Vorheriger Artikel HRNet- and PSPNet-based multiband semantic segmentation of remote sensing images

Nächster Artikel Multimodal deep collaborative filtering recommendation based on dual attention

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Chen H, Fuhlbrigge T, Li X (2009) A review of cad-based robot path planning for spray painting. Ind Robot Int J

Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440

Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 234–241

Zhao H, Shi J, Qi X, Wang X, Jia J (2017) Pyramid scene parsing network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2881–2890

Chen L-C, Zhu Y, Papandreou G, Schroff F, Adam H (2018) Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European conference on computer vision (ECCV), pp 801–818

Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125

Li X, You A, Zhu Z, Zhao H, Yang M, Yang K, Tan S, Tong Y (2020) Semantic flow for fast and accurate scene parsing. In: European conference on computer vision. Springer, pp 775–793

Li X, Zhao H, Han L, Tong Y, Tan S, Yang K (2020) Gated fully fusion for semantic segmentation. Proc AAAI Conf Artif Intell 34:11418–11425

Hu P, Perazzi F, Heilbron FC, Wang O, Lin Z, Saenko K, Sclaroff S (2020) Real-time semantic segmentation with fast attention. IEEE Robot Autom Lett 6(1):263–270CrossRef

10.

Li Y, Song L, Chen Y, Li Z, Zhang X, Wang X, Sun J (2020) Learning dynamic routing for semantic segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 8553–8562

11.

Hu X, Yang K, Fei L, Wang K (2019) ACNET: attention based network to exploit complementary features for RGBD semantic segmentation. In: IEEE international conference on image processing (ICIP). IEEE, pp 1440–1444

12.

Park S-J, Hong K-S, Lee S (2017) Rdfnet: Rgb-d multi-level residual feature fusion for indoor semantic segmentation. In: Proceedings of the IEEE international conference on computer vision, pp 4980–4989

13.

Chen L-Z, Lin Z, Wang Z, Yang Y-L, Cheng M-M (2021) Spatial information guided convolution for real-time rgbd semantic segmentation. IEEE Trans Image Process 30:2313–2324CrossRef

14.

Cao J, Leng H, Lischinski D, Cohen-Or D, Tu C, Li Y (2021) Shapeconv: shape-aware convolutional layer for indoor RGB-D semantic segmentation. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 7088–7097

15.

Cui H, Dong J, Hou G, Xiao Z, Chen Y, Zhao Z (2013) Analysis on arc-welding robot visual control tracking system. In: International conference on quality, reliability, risk, maintenance, and safety engineering (QR2MSE)

16.

Martínez D, Alenya G, Torras C (2015) Planning robot manipulation to clean planar surfaces. Eng Appl Artif Intell 39:23–32CrossRef

17.

Chen W, Zhao D (2013) Path planning for spray painting robot of workpiece surfaces. Math Probl Eng. https://doi.org/10.1155/2013/659457CrossRef

18.

Gasparetto A, Vidoni R, Pillan D, Saccavini E (2012) Automatic path and trajectory planning for robotic spray painting. In: 7th German conference on robotics ROBOTIK 2012. VDE, pp 1–6

19.

Chen H, Xi N (2008) Automated tool trajectory planning of industrial robots for painting composite surfaces. Int J Adv Manuf Technol 35(7):680–696CrossRef

20.

Atkar PN, Greenfield A, Conner DC, Choset H, Rizzi AA (2005) Uniform coverage of automotive surface patches. Int J Robot Res 24(11):883–898CrossRef

21.

Wang G, Cheng J, Li R, Chen K (2015) A new point cloud slicing based path planning algorithm for robotic spray painting. In: IEEE international conference on robotics and biomimetics (ROBIO). IEEE, pp 1717–1722

22.

Wong C-C, Yeh L-Y, Liu C-C, Tsai C-Y, Aoyama H (2021) Manipulation planning for object re-orientation based on semantic segmentation keypoint detection. Sensors 21(7):2280CrossRef

23.

Yin J, Apuroop KGS, Tamilselvam YK, Mohan RE, Ramalingam B, Le AV (2020) Table cleaning task by human support robot using deep learning technique. Sensors 20(6):1698CrossRef

24.

Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19

25.

Zhao H, Jiang L, Jia J, Torr PH, Koltun V (2021) Point transformer. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 16259–16268

26.

Ester M, Kriegel H-P, Sander J, Xu X et al (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, vol 96, pp 226–231

27.

Guo G, Wang H, Bell D, Bi Y, Greer K (2003) KNN model-based approach in classification. In: OTM confederated international conferences on the move to meaningful internet systems. Springer, pp 986–996

28.

Wold S, Esbensen K, Geladi P (1987) Principal component analysis. Chemom Intell Lab Syst 2(1–3):37–52CrossRef

29.

Wu Y, Wong Y, Loh HT, Zhang Y (2004) Modelling cloud data using an adaptive slicing approach. Comput Aided Des 36(3):231–240CrossRef

30.

Woo H, Kang E, Wang S, Lee KH (2002) A new segmentation method for point cloud data. Int J Mach Tools Manuf 42(2):167–178CrossRef

31.

Fischler MA, Bolles RC (1981) Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun ACM 24(6):381–395MathSciNetCrossRef

32.

Intel RealSense\(^{\text{TM}}\) (2018) Depth module D400 series custom calibration; Intel Corporation:Santa Clara, CA, USA

Titel: Cleaning of object surfaces based on deep learning: a method for generating manipulator trajectories using RGB-D semantic segmentation
verfasst von: Lizhe Qi
Zhongxue Gan
Zhongwei Hua
Daming Du
Wenxuan Jiang
Yunquan Sun
Publikationsdatum: 07.01.2023
Verlag: Springer London
Erschienen in: Neural Computing and Applications / Ausgabe 12/2023
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI: https://doi.org/10.1007/s00521-022-07930-x

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Weitere Artikel der Ausgabe 12/2023

How visual chirality affects the performance of image hashing

Multimodal deep collaborative filtering recommendation based on dual attention

Robust amplitude-limited interval type-3 neuro-fuzzy controller for robot manipulators with prescribed performance by output feedback

Recognition of abnormal human behavior in dual-channel convolutional 3D construction site based on deep learning

NeuroGAN: image reconstruction from EEG signals via an attention-based GAN

Comparative analysis of rail transit braking digital command control strategies based on neural network

Premium Partner