nach oben

International Journal of Computer Assisted Radiology and Surgery

Erschienen in:

30.05.2023 | Original Article

Using hand pose estimation to automate open surgery training feedback

verfasst von: Eddie Bkheet, Anne-Lise D’Angelo, Adam Goldbraikh, Shlomi Laufer

Erschienen in: International Journal of Computer Assisted Radiology and Surgery | Ausgabe 7/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Purpose

This research aims to facilitate the use of state-of-the-art computer vision algorithms for the automated training of surgeons and the analysis of surgical footage. By estimating 2D hand poses, we model the movement of the practitioner’s hands, and their interaction with surgical instruments, to study their potential benefit for surgical training.

Methods

We leverage pre-trained models on a publicly available hands dataset to create our own in-house dataset of 100 open surgery simulation videos with 2D hand poses. We also assess the ability of pose estimations to segment surgical videos into gestures and tool-usage segments and compare them to kinematic sensors and I3D features. Furthermore, we introduce 6 novel surgical dexterity proxies stemming from domain experts’ training advice, all of which our framework can automatically detect given raw video footage.

Results

State-of-the-art gesture segmentation accuracy of 88.35% on the open surgery simulation dataset is achieved with the fusion of 2D poses and I3D features from multiple angles. The introduced surgical skill proxies presented significant differences for novices compared to experts and produced actionable feedback for improvement.

Conclusion

This research demonstrates the benefit of pose estimations for open surgery by analyzing their effectiveness in gesture segmentation and skill assessment. Gesture segmentation using pose estimations achieved comparable results to physical sensors while being remote and markerless. Surgical dexterity proxies that rely on pose estimation proved they can be used to work toward automated training feedback. We hope our findings encourage additional collaboration on novel skill proxies to make surgical training more efficient.

Vorheriger Artikel Reference-free Bayesian model for pointing errors of typein neurosurgical planning

Nächster Artikel Deep anatomy learning for lung airway and artery-vein modeling with contrast-enhanced CT synthesis

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Kotsis SV, Chung KC (2013) Application of the “see one, do one, teach one’’ concept in surgical training. Plastic Reconstr Surg 131(5):1194–1201. https://doi.org/10.1097/PRS.0b013e318287a0b3CrossRef

Ismail Fawaz H, Forestier G, Weber J, Idoumghar L, Muller P-A (2018) Evaluating surgical skills from kinematic data using convolutional neural networks. In: MICCAI, pp 214–221

Goldbraikh A, Volk T, Pugh CM, Laufer S (2022) Using open surgery simulation kinematic data for tool and gesture recognition. IJCARS 17(6):965–979. https://doi.org/10.1007/s11548-022-02615-1CrossRef

Sarikaya D, Jannin P (2019) Surgical gesture recognition with optical flow only. arXiv . https://doi.org/10.48550/ARXIV.1904.01143

Funke I, Mees ST, Weitz J, Speidel S (2019) Video-based surgical skill assessment using 3D convolutional neural networks. IJCARS 14(7):1217–1225. https://doi.org/10.1007/s11548-019-01995-1CrossRef

Goldbraikh A, D’Angelo A-L, Pugh CM, Laufer S (2022) Video-based fully automatic assessment of open surgery suturing skills. IJCARS 17(3):437–448. https://doi.org/10.1007/s11548-022-02559-6CrossRef

Lavanchy JL, Zindel J, Kirtac K, Twick I, Hosgor E, Candinas D, Beldi G (2021) Automation of surgical skill assessment using a three-stage machine learning algorithm. Sci Rep 11(1):5197. https://doi.org/10.1038/s41598-021-84295-6CrossRefPubMedPubMedCentral

Liu D, Li Q, Jiang T, Wang Y, Miao R, Shan F, Li Z (2021) Towards unified surgical skill assessment. In: 2021 IEEE/CVF (CVPR), pp 9517–9526 . https://doi.org/10.1109/CVPR46437.2021.00940

Wang T, Jin M, Li M (2021) Towards accurate and interpretable surgical skill assessment: a video-based method for skill score prediction and guiding feedback generation. IJCARS 16(9):1595–1605. https://doi.org/10.1007/s11548-021-02448-4CrossRef

10.

Zhang J, Nie Y, Lyu Y, Yang X, Chang J, Zhang JJ (2021) SD-Net: joint surgical gesture recognition and skill assessment. IJCARS 16(10):1675–1682. https://doi.org/10.1007/s11548-021-02495-xCrossRef

11.

Gao Y, Vedula SS, Reiley CE, Ahmidi N, Varadarajan B, Lin HC, Tao L, Zappella L, B’ejar B, Yuh DD et al (2014) Jhu-isi gesture and skill assessment working set (jigsaws): a surgical activity dataset for human motion modeling. In: MICCAI Workshop: M2cai, vol 3, p 3

12.

van Amsterdam B, Funke I, Edwards E, Speidel S, Collins J, Sridhar A, Kelly J, Clarkson MJ, Stoyanov D (2022) Gesture recognition in robotic surgery with multimodal attention. IEEE Trans Med Imag 41(7):1677–1687CrossRef

13.

Zhang M, Cheng X, Copeland D, Desai A, Guan MY, Brat GA, Yeung S (2020) Using computer vision to automate hand detection and tracking of surgeon movements in videos of open surgery. AMIA ... Annual Symposium proceedings. AMIA Symposium 2020, pp 1373–1382

14.

Louis N, Zhou L, Yule SJ, Dias RD, Manojlovich M, Pagani FD, Likosky DS, Corso JJ (2022) Temporally guided articulated hand pose tracking in surgical videos. IJCARS

15.

Goodman ED, Patel KK, Zhang Y, Locke W, Kennedy CJ, Mehrotra R, Ren S, Guan MY, Downing M, Chen HW, Clark JZ, Brat GA, Yeung S (2021) A real-time spatiotemporal AI model analyzes skill in open surgical videos. arXiv

16.

Jin A, Yeung S, Jopling J, Krause J, Azagury D, Milstein A, Fei-Fei L (2018) Tool detection and operative skill assessment in surgical videos using region-based convolutional neural networks, pp 691–699. https://doi.org/10.1109/WACV.2018.00081. IEEE

17.

Basiev K, Goldbraikh A, Pugh CM, Laufer S (2022) Open surgery tool classification and hand utilization using a multi-camera system. IJCARS 17(8):1497–1505. https://doi.org/10.1007/s11548-022-02691-3CrossRef

18.

Schneider P, Memmesheimer R, Kramer I, Paulus D (2019) Gesture recognition in rgb videos using human body keypoints and dynamic time warping. In: Chalup S, Niemueller T, Suthakorn J, Williams M-A (eds.) RoboCup 2019: Robot world cup XXIII, pp 281–293. Springer

19.

Ge Z, Liu S, Wang F, Li Z, Sun J (2021) YOLOX: Exceeding YOLO Series in 2021. arXiv

20.

Chen K, Wang J, Pang J, Cao Y, Xiong Y, Li X, Sun S, Feng W, Liu Z, Xu J, Zhang Z, Cheng D, Zhu C, Cheng T, Zhao Q, Li B, Lu X, Zhu R, Wu Y, Dai J, Wang J, Shi J, Ouyang W, Loy CC, Lin D (2019) MMDetection: Open MMLab Detection Toolbox and Benchmark. arXiv

21.

Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft coco: common objects in context. In: Fleet D, Pajdla T, Schiele B, Tuytelaars T (eds.) ECCV, pp 740–755. Springer

22.

MMPose contributors: OpenMMLab pose estimation toolbox and benchmark. https://github.com/open-mmlab/mmpose

23.

Sun K, Xiao B, Liu D, Wang J (2019) Deep high-resolution representation learning for human pose estimation. In: CVPR. IEEE

24.

Xiao B, Wu H, Wei Y (2018) Simple baselines for human pose estimation and tracking. In: ECCV, pp 472–487

25.

Wang Y, Peng C, Liu Y (2019) Mask-pose cascaded CNN for 2D hand pose estimation from single color image. IEEE Trans Circuits Syst Video Technol 29(11):3258–3268CrossRef

26.

Savitzky A, Golay MJE (1964) Smoothing and differentiation of data by simplified least squares procedures. Anal Chem 36(8):1627–1639. https://doi.org/10.1021/ac60214a047CrossRef

27.

Li S, Farha Y, Liu Y, Cheng M-M, Gall J (2020) MS-TCN++: Multi-stage temporal convolutional network for action segmentation. In: IEEE Transactions on pattern analysis and machine intelligence

28.

Carreira J, Zisserman A (2017) Quo vadis, action recognition? a new model and the kinetics dataset. In: CVPR, pp 4724–4733. IEEE

29.

Kay W, Carreira J, Simonyan K, Zhang B, Hillier C, Vijayanarasimhan S, Viola F, Green T, Back T, Natsev P, Suleyman M, Zisserman A (2017) The kinetics human action video dataset. arXiv

30.

Trott AT (2012) In: Wounds and Lacerations. Saunders, Fourth edition edn. W.B

31.

Kantor J (2017) Atlas of suturing techniques: approaches to surgical wound, laceration, and cosmetic repair. McGraw-Hill Education

32.

Meloni S, Mastenbjork M (2019) In: Suture like a surgeon vol. 1. Medical Creations

Titel: Using hand pose estimation to automate open surgery training feedback
verfasst von: Eddie Bkheet
Anne-Lise D’Angelo
Adam Goldbraikh
Shlomi Laufer
Publikationsdatum: 30.05.2023
Verlag: Springer International Publishing
Erschienen in: International Journal of Computer Assisted Radiology and Surgery / Ausgabe 7/2023
Print ISSN: 1861-6410
Elektronische ISSN: 1861-6429
DOI: https://doi.org/10.1007/s11548-023-02947-6

Springer Professional

Abstract

Purpose

Methods

Results

Conclusion

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 7/2023

Reference-free Bayesian model for pointing errors of typein neurosurgical planning

Extending bioelectric navigation for displacement and direction detection

Deep anatomy learning for lung airway and artery-vein modeling with contrast-enhanced CT synthesis

On the importance of patient acceptance for medical robotic imaging

An autonomous X-ray image acquisition and interpretation system for assisting percutaneous pelvic fracture fixation

An accurate scapula registration process in shoulder arthroplasty using mixed reality

Premium Partner