Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 11/2023

05.06.2023 | Original Article

Multi-stages de-smoking model based on CycleGAN for surgical de-smoking

verfasst von: Xinpei Su, Qiuxia Wu

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 11/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Smoke generated during laparoscopic surgery blocks the doctor’s sight and degrades the quality of the images severely; thus, surgical de-smoking is a crucial task during laparoscopic surgery. Previous deep learning methods extract the features of smoke images to restore clear images using convolutional neural networks. However, these methods training on simulated images result in performance degradation when generalized to real smoke images. In this paper, we introduce cycle generative adversarial networks to bridge the gap between simulated and real surgical images. Therefore, we propose a multi-stages surgical de-smoking model based on cycle generative adversarial networks(MS-CycleGAN). By leveraging the convolutional neural networks-based de-smoking module in the first stage, we additionally utilize the simulated-to-real module in the second stage to pull simulated smoke-free images to the real surgical domain, generating real-like smoke-free images that even the discriminator cannot distinguish from real smoke-free images. Furthermore, to make real images and de-smoking images more consistent in image feature space instead of pixel space, the perceptual loss function is employed to calculate the loss in feature space. MS-CycleGAN outperforms state-of-the-art de-smoking methods on the evaluation metrics of both Peak Signal to Noise Ratio and Structural Similarity Index Measure. Most importantly, our MS-CycleGAN achieves qualitatively superior results on de-smoking for real surgical smoke images.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
1.
Zurück zum Zitat Chen L, Tang W, John NW, Wan TR, Zhang JJ (2020) De-smokegcn: generative cooperative networks for joint surgical smoke detection and removal. IEEE Trans Med Imaging 39(5):1615–1625CrossRef Chen L, Tang W, John NW, Wan TR, Zhang JJ (2020) De-smokegcn: generative cooperative networks for joint surgical smoke detection and removal. IEEE Trans Med Imaging 39(5):1615–1625CrossRef
2.
Zurück zum Zitat Tchaka K, Pawar V M, Stoyanov D (2017). Chromaticity based smoke removal in endoscopic images. In: Med. Imaging 2017: Image Processing, pp 463–470. Tchaka K, Pawar V M, Stoyanov D (2017). Chromaticity based smoke removal in endoscopic images. In: Med. Imaging 2017: Image Processing, pp 463–470.
3.
Zurück zum Zitat Bolkar S, Wang C, Cheikh FA, Yildirim S (2018) Deep smoke removal from minimally invasive surgery videos. In: Proc. IEEE int. conf. image process, pp 3403–3407 Bolkar S, Wang C, Cheikh FA, Yildirim S (2018) Deep smoke removal from minimally invasive surgery videos. In: Proc. IEEE int. conf. image process, pp 3403–3407
4.
Zurück zum Zitat Chen D, He M, Fan Q, Liao J, Zhang L, Hou D, Yuan L, Hua G (2019) Gated context aggregation network for image dehazing and deraining. In: Proc. IEEE winter conf. appl. comput. vis., pp 1375–1383 Chen D, He M, Fan Q, Liao J, Zhang L, Hou D, Yuan L, Hua G (2019) Gated context aggregation network for image dehazing and deraining. In: Proc. IEEE winter conf. appl. comput. vis., pp 1375–1383
5.
Zurück zum Zitat Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. Proc. Adv. Neural Inf. Process. Syst. 2:2672–2680 Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. Proc. Adv. Neural Inf. Process. Syst. 2:2672–2680
6.
Zurück zum Zitat Bousmalis K, Silberman N, Dohan D, Erhan D, Krishnan D (2017) Unsupervised pixel-level domain adaptation with generative adversarial networks. In: Proc. IEEE conf. comput. vis. pattern recog., pp 3722–3731 Bousmalis K, Silberman N, Dohan D, Erhan D, Krishnan D (2017) Unsupervised pixel-level domain adaptation with generative adversarial networks. In: Proc. IEEE conf. comput. vis. pattern recog., pp 3722–3731
7.
Zurück zum Zitat Chang H, Lu J, Yu F, Finkelstein A (2018) Pairedcyclegan: asymmetric style transfer for applying and removing makeup. In: Proc. IEEE conf. comput. vis. pattern recog., pp 40–48 Chang H, Lu J, Yu F, Finkelstein A (2018) Pairedcyclegan: asymmetric style transfer for applying and removing makeup. In: Proc. IEEE conf. comput. vis. pattern recog., pp 40–48
8.
Zurück zum Zitat Zhu J-Y, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proc. IEEE conf. comput. vis. pattern recog., pp 2223–2232 Zhu J-Y, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proc. IEEE conf. comput. vis. pattern recog., pp 2223–2232
9.
Zurück zum Zitat Johnson J, Alahi A, Fei-Fei L (2016) Perceptual losses for real-time style transfer and super-resolution. In: Proc. Eur. conf. comput. vis., pp 694–711 Johnson J, Alahi A, Fei-Fei L (2016) Perceptual losses for real-time style transfer and super-resolution. In: Proc. Eur. conf. comput. vis., pp 694–711
10.
Zurück zum Zitat Hide R (1977) Optics of the atmosphere: scattering by molecules and particles. Phys Bull 28(11):521CrossRef Hide R (1977) Optics of the atmosphere: scattering by molecules and particles. Phys Bull 28(11):521CrossRef
11.
Zurück zum Zitat He K, Sun J, Tang X (2010) Single image haze removal using dark channel prior. IEEE Trans Pattern Anal Mach Intell 33(12):2341–2353 He K, Sun J, Tang X (2010) Single image haze removal using dark channel prior. IEEE Trans Pattern Anal Mach Intell 33(12):2341–2353
12.
Zurück zum Zitat Zhu Q, Mai J, Shao L (2015) A fast single image haze removal algorithm using color attenuation prior. IEEE Trans Image Process 24(11):3522–3533MathSciNetCrossRefMATH Zhu Q, Mai J, Shao L (2015) A fast single image haze removal algorithm using color attenuation prior. IEEE Trans Image Process 24(11):3522–3533MathSciNetCrossRefMATH
13.
Zurück zum Zitat Berman D, treibitz T, Avidan S (2016) Non-local image dehazing. In: Proc. IEEE conf. comput. vis. pattern recog., pp 1674–1682 Berman D, treibitz T, Avidan S (2016) Non-local image dehazing. In: Proc. IEEE conf. comput. vis. pattern recog., pp 1674–1682
14.
Zurück zum Zitat Wang C, Cheikh FA, Kaaniche M, Beghdadi A, Elle OJ (2018) Variational based smoke removal in laparoscopic images. Biomed Eng Online 17(1):1–18CrossRef Wang C, Cheikh FA, Kaaniche M, Beghdadi A, Elle OJ (2018) Variational based smoke removal in laparoscopic images. Biomed Eng Online 17(1):1–18CrossRef
15.
Zurück zum Zitat Kotwal A, Bhalodia R, Awate SP (2016) Joint desmoking and denoising of laparoscopy images. In: Proc. IEEE comput. soc. conf. comput. vis. pattern recogn., pp 1050–1054 Kotwal A, Bhalodia R, Awate SP (2016) Joint desmoking and denoising of laparoscopy images. In: Proc. IEEE comput. soc. conf. comput. vis. pattern recogn., pp 1050–1054
16.
Zurück zum Zitat Baid A, Kotwal A, Bhalodia R, Merchant S, Awate SP (2017) Joint desmoking, specularity removal, and denoising of laparoscopy images via graphical models and Bayesian inference. In: Proc. IEEE comput. soc. conf. comput. vis. pattern recogn., pp 732–736 Baid A, Kotwal A, Bhalodia R, Merchant S, Awate SP (2017) Joint desmoking, specularity removal, and denoising of laparoscopy images via graphical models and Bayesian inference. In: Proc. IEEE comput. soc. conf. comput. vis. pattern recogn., pp 732–736
17.
Zurück zum Zitat Luo X, McLeod AJ, Pautler SE, Schlachta CM, Peters TM (2017) Vision-based surgical field defogging. IEEE Trans Med Imaging 36(10):2021–2030CrossRef Luo X, McLeod AJ, Pautler SE, Schlachta CM, Peters TM (2017) Vision-based surgical field defogging. IEEE Trans Med Imaging 36(10):2021–2030CrossRef
18.
Zurück zum Zitat Cai B, Xu X, Jia K, Qing C, Tao D (2016) Dehazenet: an end-to-end system for single image haze removal. IEEE Trans Image Process 25(11):5187–5198MathSciNetCrossRefMATH Cai B, Xu X, Jia K, Qing C, Tao D (2016) Dehazenet: an end-to-end system for single image haze removal. IEEE Trans Image Process 25(11):5187–5198MathSciNetCrossRefMATH
19.
Zurück zum Zitat Li B, Peng X, Wang Z, Xu J, Feng D (2017) Aod-net: all-in-one dehazing network. In: Proc. IEEE conf. comput. vis. pattern recog., pp 4770–4778 Li B, Peng X, Wang Z, Xu J, Feng D (2017) Aod-net: all-in-one dehazing network. In: Proc. IEEE conf. comput. vis. pattern recog., pp 4770–4778
20.
Zurück zum Zitat Kanakatte A, Seemakurthy K, Gubbi J, Saha J, Ghose A, Purushothaman B (2021) Surgical smoke dehazing and color reconstruction. In: Proc. IEEE comput. soc. conf. comput. vis. pattern recogn. IEEE, pp 280–284 Kanakatte A, Seemakurthy K, Gubbi J, Saha J, Ghose A, Purushothaman B (2021) Surgical smoke dehazing and color reconstruction. In: Proc. IEEE comput. soc. conf. comput. vis. pattern recogn. IEEE, pp 280–284
21.
Zurück zum Zitat Ren W, Ma L, Zhang J, Pan J, Cao X, Liu W, Yang M-H (2018) Gated fusion network for single image dehazing. In: Proc. IEEE conf. comput. vis. pattern recogn., pp 3253–3261 Ren W, Ma L, Zhang J, Pan J, Cao X, Liu W, Yang M-H (2018) Gated fusion network for single image dehazing. In: Proc. IEEE conf. comput. vis. pattern recogn., pp 3253–3261
22.
Zurück zum Zitat Wang C, Mohammed AK, Cheikh FA, Beghdadi A, Elle OJ (2019) Multiscale deep desmoking for laparoscopic surgery. In: Med. imaging 2019: image process, vol 10949, pp 109491Y–1 Wang C, Mohammed AK, Cheikh FA, Beghdadi A, Elle OJ (2019) Multiscale deep desmoking for laparoscopic surgery. In: Med. imaging 2019: image process, vol 10949, pp 109491Y–1
23.
Zurück zum Zitat Sengar V, Seemakurthy K, Gubbi J (2021) Multi-task learning based approach for surgical video desmoking. In: Proceedings of the twelfth Indian conference on computer vision, graphics and image processing, pp 1–9 Sengar V, Seemakurthy K, Gubbi J (2021) Multi-task learning based approach for surgical video desmoking. In: Proceedings of the twelfth Indian conference on computer vision, graphics and image processing, pp 1–9
24.
Zurück zum Zitat Azam MA, Khan KB, Rehman E, Khan SU (2022) Smoke removal and image enhancement of laparoscopic images by an artificial multi-exposure image fusion method. Soft Comput 26:8003–8015CrossRef Azam MA, Khan KB, Rehman E, Khan SU (2022) Smoke removal and image enhancement of laparoscopic images by an artificial multi-exposure image fusion method. Soft Comput 26:8003–8015CrossRef
25.
Zurück zum Zitat Bai H, Pan J, Xiang X, Tang J (2022) Self-guided image dehazing using progressive feature fusion. IEEE Trans Image Process 31:1217–1229CrossRef Bai H, Pan J, Xiang X, Tang J (2022) Self-guided image dehazing using progressive feature fusion. IEEE Trans Image Process 31:1217–1229CrossRef
26.
Zurück zum Zitat Salazar-Colores S, Jiménez HM, Ortiz-Echeverri CJ, Flores G (2020) Desmoking laparoscopy surgery images using an image-to-image translation guided by an embedded dark channel. IEEE Access 8:208898–208909CrossRef Salazar-Colores S, Jiménez HM, Ortiz-Echeverri CJ, Flores G (2020) Desmoking laparoscopy surgery images using an image-to-image translation guided by an embedded dark channel. IEEE Access 8:208898–208909CrossRef
27.
Zurück zum Zitat Vishal V, Sharma N, Singh M (2019) Guided unsupervised desmoking of laparoscopic images using cycle-desmoke. OR 2.0 context-aware operating theaters and machine learning in clinical neuroimaging. Springer, New York, pp 21–28CrossRef Vishal V, Sharma N, Singh M (2019) Guided unsupervised desmoking of laparoscopic images using cycle-desmoke. OR 2.0 context-aware operating theaters and machine learning in clinical neuroimaging. Springer, New York, pp 21–28CrossRef
28.
Zurück zum Zitat Venkatesh V, Sharma N, Srivastava V, Singh M (2020) Unsupervised smoke to desmoked laparoscopic surgery images using contrast driven cyclic-desmokegan. Comput Biol Med 123:103873CrossRef Venkatesh V, Sharma N, Srivastava V, Singh M (2020) Unsupervised smoke to desmoked laparoscopic surgery images using contrast driven cyclic-desmokegan. Comput Biol Med 123:103873CrossRef
29.
Zurück zum Zitat Huang Y, Chen X, Xu L, Li K (2021) Single image desmoking via attentive generative adversarial network for smoke detection process. Fire Technol 57(6):3021–3040CrossRef Huang Y, Chen X, Xu L, Li K (2021) Single image desmoking via attentive generative adversarial network for smoke detection process. Fire Technol 57(6):3021–3040CrossRef
30.
Zurück zum Zitat Wu H, Qu Y, Lin S, Zhou J, Qiao R, Zhang Z, Xie Y, Ma L (2021) Contrastive learning for compact single image dehazing. In: Proc. IEEE conf. comput. vis. pattern recogn., pp 10551–10560 Wu H, Qu Y, Lin S, Zhou J, Qiao R, Zhang Z, Xie Y, Ma L (2021) Contrastive learning for compact single image dehazing. In: Proc. IEEE conf. comput. vis. pattern recogn., pp 10551–10560
31.
Zurück zum Zitat Chen X, Fan Z, Li P, Dai L, Kong C, Zheng Z, Huang Y, Li Y (2022) Unpaired deep image dehazing using contrastive disentanglement learning. In: European conference on computer vision. Springer, pp 632–648 Chen X, Fan Z, Li P, Dai L, Kong C, Zheng Z, Huang Y, Li Y (2022) Unpaired deep image dehazing using contrastive disentanglement learning. In: European conference on computer vision. Springer, pp 632–648
32.
Zurück zum Zitat Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proc. IEEE conf. comput. vis. pattern recogn., pp 2117–2125 Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proc. IEEE conf. comput. vis. pattern recogn., pp 2117–2125
33.
Zurück zum Zitat Kirillov A, Girshick R, He K, Dollar P (2019) Panoptic feature pyramid networks. In: Proc. IEEE conf. comput. vis. pattern recogn., pp 6399–6408 Kirillov A, Girshick R, He K, Dollar P (2019) Panoptic feature pyramid networks. In: Proc. IEEE conf. comput. vis. pattern recogn., pp 6399–6408
34.
Zurück zum Zitat Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:​1409.​1556
35.
Zurück zum Zitat Twinanda AP, Shehata S, Mutter D, Marescaux J, De Mathelin M, Padoy N (2016) Endonet: a deep architecture for recognition tasks on laparoscopic videos. IEEE Trans Med Imaging 36(1):86–97CrossRef Twinanda AP, Shehata S, Mutter D, Marescaux J, De Mathelin M, Padoy N (2016) Endonet: a deep architecture for recognition tasks on laparoscopic videos. IEEE Trans Med Imaging 36(1):86–97CrossRef
36.
Zurück zum Zitat Leibetseder A, Primus MJ, Petscharnig S, Schoeffmann K (2017) Real-time image-based smoke detection in endoscopic videos. In: Proc. themat. workshops ACM multimed., pp 296–304 Leibetseder A, Primus MJ, Petscharnig S, Schoeffmann K (2017) Real-time image-based smoke detection in endoscopic videos. In: Proc. themat. workshops ACM multimed., pp 296–304
37.
Zurück zum Zitat Hore A, Ziou D (2010) Image quality metrics: Psnr vs. ssim. In: Int. conf. pattern recognit., pp 2366–2369 Hore A, Ziou D (2010) Image quality metrics: Psnr vs. ssim. In: Int. conf. pattern recognit., pp 2366–2369
38.
Zurück zum Zitat Shao Y, Li L, Ren W, Gao C, Sang N (2020) Domain adaptation for image dehazing. In: Proc. IEEE conf. comput. vis. pattern recog., pp 2808–2817 Shao Y, Li L, Ren W, Gao C, Sang N (2020) Domain adaptation for image dehazing. In: Proc. IEEE conf. comput. vis. pattern recog., pp 2808–2817
39.
Zurück zum Zitat Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: Proc. int. conf. med. image comp. comput.-assisted intervention, pp 234–241 Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: Proc. int. conf. med. image comp. comput.-assisted intervention, pp 234–241
40.
Zurück zum Zitat Chaurasia A, Culurciello E (2017) Linknet: exploiting encoder representations for efficient semantic segmentation. In: Proc. IEEE vis. commun. image process., pp 1–4 Chaurasia A, Culurciello E (2017) Linknet: exploiting encoder representations for efficient semantic segmentation. In: Proc. IEEE vis. commun. image process., pp 1–4
41.
Zurück zum Zitat Cheng K, You J, Wu S, Chen Z, Zhou Z, Guan J, Peng B, Wang X (2021) Artificial intelligence-based automated laparoscopic cholecystectomy surgical phase recognition and analysis. Surg Endosc 36(5):3160–3168CrossRef Cheng K, You J, Wu S, Chen Z, Zhou Z, Guan J, Peng B, Wang X (2021) Artificial intelligence-based automated laparoscopic cholecystectomy surgical phase recognition and analysis. Surg Endosc 36(5):3160–3168CrossRef
42.
Zurück zum Zitat Jin Y, Long Y, Chen C, Zhao Z, Dou Q, Heng P-A (2021) Temporal memory relation network for workflow recognition from surgical video. IEEE Trans Med Imaging 40(7):1911–1923CrossRef Jin Y, Long Y, Chen C, Zhao Z, Dou Q, Heng P-A (2021) Temporal memory relation network for workflow recognition from surgical video. IEEE Trans Med Imaging 40(7):1911–1923CrossRef
43.
Zurück zum Zitat Kondo S (2021) Lapformer: surgical tool detection in laparoscopic surgical video using transformer architecture. Computer Methods Biomech Biomed Eng Imaging Vis 9(3):302–307CrossRef Kondo S (2021) Lapformer: surgical tool detection in laparoscopic surgical video using transformer architecture. Computer Methods Biomech Biomed Eng Imaging Vis 9(3):302–307CrossRef
44.
Zurück zum Zitat Yi F, Jiang T (2021) Not end-to-end: Explore multi-stage architecture for online surgical phase recognition. arXiv preprint. arXiv:2107.04810 Yi F, Jiang T (2021) Not end-to-end: Explore multi-stage architecture for online surgical phase recognition. arXiv preprint. arXiv:​2107.​04810
45.
Zurück zum Zitat Loukas C (2018) Surgical phase recognition of short video shots based on temporal modeling of deep features. arXiv preprint. arXiv:1807.07853 Loukas C (2018) Surgical phase recognition of short video shots based on temporal modeling of deep features. arXiv preprint. arXiv:​1807.​07853
46.
Zurück zum Zitat Yang Y, Zhao Z, Shi P, Hu S (2021) An efficient one-stage detector for real-time surgical tools detection in robot-assisted surgery. Annual conference on medical image understanding and analysis. Springer, Berlin, pp 18–29 Yang Y, Zhao Z, Shi P, Hu S (2021) An efficient one-stage detector for real-time surgical tools detection in robot-assisted surgery. Annual conference on medical image understanding and analysis. Springer, Berlin, pp 18–29
47.
Zurück zum Zitat Gao X, Jin Y, Long Y, Dou Q, Heng P-A (2021) Trans-svnet: accurate phase recognition from surgical videos via hybrid embedding aggregation transformer. International conference on medical image computing and computer-assisted intervention. Springer, New York, pp 593–603 Gao X, Jin Y, Long Y, Dou Q, Heng P-A (2021) Trans-svnet: accurate phase recognition from surgical videos via hybrid embedding aggregation transformer. International conference on medical image computing and computer-assisted intervention. Springer, New York, pp 593–603
Metadaten
Titel
Multi-stages de-smoking model based on CycleGAN for surgical de-smoking
verfasst von
Xinpei Su
Qiuxia Wu
Publikationsdatum
05.06.2023
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 11/2023
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-023-01875-w

Weitere Artikel der Ausgabe 11/2023

International Journal of Machine Learning and Cybernetics 11/2023 Zur Ausgabe

Neuer Inhalt