nach oben

International Journal of Machine Learning and Cybernetics

Erschienen in:

05.06.2023 | Original Article

Multi-stages de-smoking model based on CycleGAN for surgical de-smoking

verfasst von: Xinpei Su, Qiuxia Wu

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 11/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Smoke generated during laparoscopic surgery blocks the doctor’s sight and degrades the quality of the images severely; thus, surgical de-smoking is a crucial task during laparoscopic surgery. Previous deep learning methods extract the features of smoke images to restore clear images using convolutional neural networks. However, these methods training on simulated images result in performance degradation when generalized to real smoke images. In this paper, we introduce cycle generative adversarial networks to bridge the gap between simulated and real surgical images. Therefore, we propose a multi-stages surgical de-smoking model based on cycle generative adversarial networks(MS-CycleGAN). By leveraging the convolutional neural networks-based de-smoking module in the first stage, we additionally utilize the simulated-to-real module in the second stage to pull simulated smoke-free images to the real surgical domain, generating real-like smoke-free images that even the discriminator cannot distinguish from real smoke-free images. Furthermore, to make real images and de-smoking images more consistent in image feature space instead of pixel space, the perceptual loss function is employed to calculate the loss in feature space. MS-CycleGAN outperforms state-of-the-art de-smoking methods on the evaluation metrics of both Peak Signal to Noise Ratio and Structural Similarity Index Measure. Most importantly, our MS-CycleGAN achieves qualitatively superior results on de-smoking for real surgical smoke images.

Vorheriger Artikel Multi-label feature selection via joint label enhancement and pairwise label correlations

Nächster Artikel Brain-inspired learning to deeper inductive reasoning for video captioning

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

ATZelectronics worldwide

ATZlectronics worldwide is up-to-speed on new trends and developments in automotive electronics on a scientific level with a high depth of information.

Order your 30-days-trial for free and without any commitment.

Jetzt informieren

ATZelektronik

Die Fachzeitschrift ATZelektronik bietet für Entwickler und Entscheider in der Automobil- und Zulieferindustrie qualitativ hochwertige und fundierte Informationen aus dem gesamten Spektrum der Pkw- und Nutzfahrzeug-Elektronik.

Lassen Sie sich jetzt unverbindlich 2 kostenlose Ausgabe zusenden.

Jetzt informieren

https://github.com/SquidDev/Python-Clouds.

Chen L, Tang W, John NW, Wan TR, Zhang JJ (2020) De-smokegcn: generative cooperative networks for joint surgical smoke detection and removal. IEEE Trans Med Imaging 39(5):1615–1625CrossRef

Tchaka K, Pawar V M, Stoyanov D (2017). Chromaticity based smoke removal in endoscopic images. In: Med. Imaging 2017: Image Processing, pp 463–470.

Bolkar S, Wang C, Cheikh FA, Yildirim S (2018) Deep smoke removal from minimally invasive surgery videos. In: Proc. IEEE int. conf. image process, pp 3403–3407

Chen D, He M, Fan Q, Liao J, Zhang L, Hou D, Yuan L, Hua G (2019) Gated context aggregation network for image dehazing and deraining. In: Proc. IEEE winter conf. appl. comput. vis., pp 1375–1383

Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. Proc. Adv. Neural Inf. Process. Syst. 2:2672–2680

Bousmalis K, Silberman N, Dohan D, Erhan D, Krishnan D (2017) Unsupervised pixel-level domain adaptation with generative adversarial networks. In: Proc. IEEE conf. comput. vis. pattern recog., pp 3722–3731

Chang H, Lu J, Yu F, Finkelstein A (2018) Pairedcyclegan: asymmetric style transfer for applying and removing makeup. In: Proc. IEEE conf. comput. vis. pattern recog., pp 40–48

Zhu J-Y, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proc. IEEE conf. comput. vis. pattern recog., pp 2223–2232

Johnson J, Alahi A, Fei-Fei L (2016) Perceptual losses for real-time style transfer and super-resolution. In: Proc. Eur. conf. comput. vis., pp 694–711

10.

Hide R (1977) Optics of the atmosphere: scattering by molecules and particles. Phys Bull 28(11):521CrossRef

11.

He K, Sun J, Tang X (2010) Single image haze removal using dark channel prior. IEEE Trans Pattern Anal Mach Intell 33(12):2341–2353

12.

Zhu Q, Mai J, Shao L (2015) A fast single image haze removal algorithm using color attenuation prior. IEEE Trans Image Process 24(11):3522–3533MathSciNetCrossRefMATH

13.

Berman D, treibitz T, Avidan S (2016) Non-local image dehazing. In: Proc. IEEE conf. comput. vis. pattern recog., pp 1674–1682

14.

Wang C, Cheikh FA, Kaaniche M, Beghdadi A, Elle OJ (2018) Variational based smoke removal in laparoscopic images. Biomed Eng Online 17(1):1–18CrossRef

15.

Kotwal A, Bhalodia R, Awate SP (2016) Joint desmoking and denoising of laparoscopy images. In: Proc. IEEE comput. soc. conf. comput. vis. pattern recogn., pp 1050–1054

16.

Baid A, Kotwal A, Bhalodia R, Merchant S, Awate SP (2017) Joint desmoking, specularity removal, and denoising of laparoscopy images via graphical models and Bayesian inference. In: Proc. IEEE comput. soc. conf. comput. vis. pattern recogn., pp 732–736

17.

Luo X, McLeod AJ, Pautler SE, Schlachta CM, Peters TM (2017) Vision-based surgical field defogging. IEEE Trans Med Imaging 36(10):2021–2030CrossRef

18.

Cai B, Xu X, Jia K, Qing C, Tao D (2016) Dehazenet: an end-to-end system for single image haze removal. IEEE Trans Image Process 25(11):5187–5198MathSciNetCrossRefMATH

19.

Li B, Peng X, Wang Z, Xu J, Feng D (2017) Aod-net: all-in-one dehazing network. In: Proc. IEEE conf. comput. vis. pattern recog., pp 4770–4778

20.

Kanakatte A, Seemakurthy K, Gubbi J, Saha J, Ghose A, Purushothaman B (2021) Surgical smoke dehazing and color reconstruction. In: Proc. IEEE comput. soc. conf. comput. vis. pattern recogn. IEEE, pp 280–284

21.

Ren W, Ma L, Zhang J, Pan J, Cao X, Liu W, Yang M-H (2018) Gated fusion network for single image dehazing. In: Proc. IEEE conf. comput. vis. pattern recogn., pp 3253–3261

22.

Wang C, Mohammed AK, Cheikh FA, Beghdadi A, Elle OJ (2019) Multiscale deep desmoking for laparoscopic surgery. In: Med. imaging 2019: image process, vol 10949, pp 109491Y–1

23.

Sengar V, Seemakurthy K, Gubbi J (2021) Multi-task learning based approach for surgical video desmoking. In: Proceedings of the twelfth Indian conference on computer vision, graphics and image processing, pp 1–9

24.

Azam MA, Khan KB, Rehman E, Khan SU (2022) Smoke removal and image enhancement of laparoscopic images by an artificial multi-exposure image fusion method. Soft Comput 26:8003–8015CrossRef

25.

Bai H, Pan J, Xiang X, Tang J (2022) Self-guided image dehazing using progressive feature fusion. IEEE Trans Image Process 31:1217–1229CrossRef

26.

Salazar-Colores S, Jiménez HM, Ortiz-Echeverri CJ, Flores G (2020) Desmoking laparoscopy surgery images using an image-to-image translation guided by an embedded dark channel. IEEE Access 8:208898–208909CrossRef

27.

Vishal V, Sharma N, Singh M (2019) Guided unsupervised desmoking of laparoscopic images using cycle-desmoke. OR 2.0 context-aware operating theaters and machine learning in clinical neuroimaging. Springer, New York, pp 21–28CrossRef

28.

Venkatesh V, Sharma N, Srivastava V, Singh M (2020) Unsupervised smoke to desmoked laparoscopic surgery images using contrast driven cyclic-desmokegan. Comput Biol Med 123:103873CrossRef

29.

Huang Y, Chen X, Xu L, Li K (2021) Single image desmoking via attentive generative adversarial network for smoke detection process. Fire Technol 57(6):3021–3040CrossRef

30.

Wu H, Qu Y, Lin S, Zhou J, Qiao R, Zhang Z, Xie Y, Ma L (2021) Contrastive learning for compact single image dehazing. In: Proc. IEEE conf. comput. vis. pattern recogn., pp 10551–10560

31.

Chen X, Fan Z, Li P, Dai L, Kong C, Zheng Z, Huang Y, Li Y (2022) Unpaired deep image dehazing using contrastive disentanglement learning. In: European conference on computer vision. Springer, pp 632–648

32.

Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proc. IEEE conf. comput. vis. pattern recogn., pp 2117–2125

33.

Kirillov A, Girshick R, He K, Dollar P (2019) Panoptic feature pyramid networks. In: Proc. IEEE conf. comput. vis. pattern recogn., pp 6399–6408

34.

Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556

35.

Twinanda AP, Shehata S, Mutter D, Marescaux J, De Mathelin M, Padoy N (2016) Endonet: a deep architecture for recognition tasks on laparoscopic videos. IEEE Trans Med Imaging 36(1):86–97CrossRef

36.

Leibetseder A, Primus MJ, Petscharnig S, Schoeffmann K (2017) Real-time image-based smoke detection in endoscopic videos. In: Proc. themat. workshops ACM multimed., pp 296–304

37.

Hore A, Ziou D (2010) Image quality metrics: Psnr vs. ssim. In: Int. conf. pattern recognit., pp 2366–2369

38.

Shao Y, Li L, Ren W, Gao C, Sang N (2020) Domain adaptation for image dehazing. In: Proc. IEEE conf. comput. vis. pattern recog., pp 2808–2817

39.

Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: Proc. int. conf. med. image comp. comput.-assisted intervention, pp 234–241

40.

Chaurasia A, Culurciello E (2017) Linknet: exploiting encoder representations for efficient semantic segmentation. In: Proc. IEEE vis. commun. image process., pp 1–4

41.

Cheng K, You J, Wu S, Chen Z, Zhou Z, Guan J, Peng B, Wang X (2021) Artificial intelligence-based automated laparoscopic cholecystectomy surgical phase recognition and analysis. Surg Endosc 36(5):3160–3168CrossRef

42.

Jin Y, Long Y, Chen C, Zhao Z, Dou Q, Heng P-A (2021) Temporal memory relation network for workflow recognition from surgical video. IEEE Trans Med Imaging 40(7):1911–1923CrossRef

43.

Kondo S (2021) Lapformer: surgical tool detection in laparoscopic surgical video using transformer architecture. Computer Methods Biomech Biomed Eng Imaging Vis 9(3):302–307CrossRef

44.

Yi F, Jiang T (2021) Not end-to-end: Explore multi-stage architecture for online surgical phase recognition. arXiv preprint. arXiv:2107.04810

45.

Loukas C (2018) Surgical phase recognition of short video shots based on temporal modeling of deep features. arXiv preprint. arXiv:1807.07853

46.

Yang Y, Zhao Z, Shi P, Hu S (2021) An efficient one-stage detector for real-time surgical tools detection in robot-assisted surgery. Annual conference on medical image understanding and analysis. Springer, Berlin, pp 18–29

47.

Gao X, Jin Y, Long Y, Dou Q, Heng P-A (2021) Trans-svnet: accurate phase recognition from surgical videos via hybrid embedding aggregation transformer. International conference on medical image computing and computer-assisted intervention. Springer, New York, pp 593–603

Titel: Multi-stages de-smoking model based on CycleGAN for surgical de-smoking
verfasst von: Xinpei Su
Qiuxia Wu
Publikationsdatum: 05.06.2023
Verlag: Springer Berlin Heidelberg
Erschienen in: International Journal of Machine Learning and Cybernetics / Ausgabe 11/2023
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI: https://doi.org/10.1007/s13042-023-01875-w

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Die Gewinner und Laudatoren des Sustainability Award in Automotive 2024/© Uli Regenscheit | ATZlive, Search Icon, Banner Hanser, Additiv gefertigte Teile/© Marina_Skoropadskaya | Getty Images | iStock, Warnschild "Land unter"/© Bluedesign / Fotolia, Gardiner von Trapp/© Alpega Group, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade, chassis.tech plus 2023/© [M] ATZlive / TÜV SÜD PRODUCT SERVICE GMBH, adäsion-Webinar-Matinee/© krystiannawrocki_ Getty Images

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

ATZelectronics worldwide

ATZelektronik

Weitere Artikel der Ausgabe 11/2023

Optimal scale selection based on three-way decisions with decision-theoretic rough sets in multi-scale set-valued decision tables

Multi-granular labels with three-way decisions for multi-label classification

Linear-combined rough vague sets and their three-way decision modeling and uncertainty measurement optimization

A hybrid dimensionality reduction method for outlier detection in high-dimensional data

Robust multimedia recommender system based on dynamic collaborative filtering and directed adversarial learning

Multi-label feature selection via joint label enhancement and pairwise label correlations

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.