Skip to main content
Erschienen in: International Journal of Computer Assisted Radiology and Surgery 11/2016

01.11.2016 | Original Article

Shot boundary detection in endoscopic surgery videos using a variational Bayesian framework

verfasst von: Constantinos Loukas, Nikolaos Nikiteas, Dimitrios Schizas, Evangelos Georgiou

Erschienen in: International Journal of Computer Assisted Radiology and Surgery | Ausgabe 11/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Purpose

Over the last decade, the demand for content management of video recordings of surgical procedures has greatly increased. Although a few research methods have been published toward this direction, the related literature is still in its infancy. In this paper, we address the problem of shot detection in endoscopic surgery videos, a fundamental step in content-based video analysis.

Methods

The video is first decomposed into short clips that are processed sequentially. After feature extraction, we employ spatiotemporal Gaussian mixture models (GMM) for each clip and apply a variational Bayesian (VB) algorithm to approximate the posterior distribution of the model parameters. The proper number of components is handled automatically by the VBGMM algorithm. The estimated components are matched along the video sequence via their Kullback–Leibler divergence. Shot borders are defined when component tracking fails, signifying a different visual appearance of the surgical scene.

Results

Experimental evaluation was performed on laparoscopic videos containing a variable number of shots. Performance was measured via precision, recall, coverage and overflow metrics. The proposed method was compared with GMM and a shot detection method based on spatiotemporal motion differences (MotionDiff). The results demonstrate that VBGMM has higher performance than all other methods for most assessment metrics: precision and recall >80 %, coverage: 84 %. Overflow for VBGMM was worse than MotionDiff (37 vs. 27 %).

Conclusions

The proposed method generated promising results for shot border detection. Spatiotemporal modeling via VBGMMs provides a means to explore additional applications such as component tracking.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Abdelsattar JM, Pandian TK, Finnesgard EJ, El Khatib MM, Rowse PG, Buckarma EH, Gas BL, Heller SF, Farley DR (2015) Do you see what I see? How we use video as an adjunct to general surgery resident education. J Surg Educ 72:e145–e150. doi:10.1016/j.jsurg.2015.07.012 CrossRefPubMed Abdelsattar JM, Pandian TK, Finnesgard EJ, El Khatib MM, Rowse PG, Buckarma EH, Gas BL, Heller SF, Farley DR (2015) Do you see what I see? How we use video as an adjunct to general surgery resident education. J Surg Educ 72:e145–e150. doi:10.​1016/​j.​jsurg.​2015.​07.​012 CrossRefPubMed
2.
Zurück zum Zitat Zevin B, Bonrath EM, Aggarwal R, Dedy NJ, Ahmed N, Grantcharov TP (2013) Development, feasibility, validity, and reliability of a scale for objective assessment of operative performance in laparoscopic gastric bypass surgery. J Am Coll Surg 216:955–965.e8; quiz 1029–31, 1033. doi:10.1016/j.jamcollsurg.2013.01.003 Zevin B, Bonrath EM, Aggarwal R, Dedy NJ, Ahmed N, Grantcharov TP (2013) Development, feasibility, validity, and reliability of a scale for objective assessment of operative performance in laparoscopic gastric bypass surgery. J Am Coll Surg 216:955–965.e8; quiz 1029–31, 1033. doi:10.​1016/​j.​jamcollsurg.​2013.​01.​003
3.
Zurück zum Zitat Primus MJ, Schoeffmann K, Böszörmenyi L (2013) Segmentation of recorded endoscopic videos by detecting significant motion changes. In: 11th International work. Content-Based Multimed. Index. Veszprem, Hungary, pp 223–228 Primus MJ, Schoeffmann K, Böszörmenyi L (2013) Segmentation of recorded endoscopic videos by detecting significant motion changes. In: 11th International work. Content-Based Multimed. Index. Veszprem, Hungary, pp 223–228
4.
5.
Zurück zum Zitat Gao X, Li J, Shi Y (2006) A video shot boundary detection algorithm based on feature tracking. Lect Notes Comput Sci 4062:651–658CrossRef Gao X, Li J, Shi Y (2006) A video shot boundary detection algorithm based on feature tracking. Lect Notes Comput Sci 4062:651–658CrossRef
8.
Zurück zum Zitat Jacobs A, Miene A, Ioannidis GT, Herzog O (2004) Automatic shot boundary detection combining color, edge, and motion features of adjacent frames. In: TRECVID 2004 Work. Notebook papers, pp 197–206 Jacobs A, Miene A, Ioannidis GT, Herzog O (2004) Automatic shot boundary detection combining color, edge, and motion features of adjacent frames. In: TRECVID 2004 Work. Notebook papers, pp 197–206
11.
Zurück zum Zitat Giannarou S, Yang G (2010) Content-based surgical workflow representation using probabilistic motion modeling. Lect Notes Comput Sci 6326:314–323CrossRef Giannarou S, Yang G (2010) Content-based surgical workflow representation using probabilistic motion modeling. Lect Notes Comput Sci 6326:314–323CrossRef
12.
Zurück zum Zitat Twinanda AP, De Mathelin M, Padoy N (2014) Fisher kernel based task boundary retrieval in laparoscopic database with single video query. In: Golland P, Hata N, Barillot C, Hornegger J, Howe R (eds) Medical image computing and computer-assisted intervention–MICCAI 2014, Springer International Publishing, Switzerland, pp 409–416 Twinanda AP, De Mathelin M, Padoy N (2014) Fisher kernel based task boundary retrieval in laparoscopic database with single video query. In: Golland P, Hata N, Barillot C, Hornegger J, Howe R (eds) Medical image computing and computer-assisted intervention–MICCAI 2014, Springer International Publishing, Switzerland, pp 409–416
14.
Zurück zum Zitat Munzer B, Schoeffmann K, Boszormenyi L (2013) Relevance segmentation of laparoscopic videos. In: IEEE international symposium on multimedia, IEEE, Anaheim, CA, USA, pp 84–91 Munzer B, Schoeffmann K, Boszormenyi L (2013) Relevance segmentation of laparoscopic videos. In: IEEE international symposium on multimedia, IEEE, Anaheim, CA, USA, pp 84–91
16.
Zurück zum Zitat Loukas C, Georgiou E (2015) Performance comparison of various feature detector-descriptors and temporal models for video-based assessment of laparoscopic skills. Int J Med Robot Comput Assist Surg. doi:10.1002/rcs.1702 Loukas C, Georgiou E (2015) Performance comparison of various feature detector-descriptors and temporal models for video-based assessment of laparoscopic skills. Int J Med Robot Comput Assist Surg. doi:10.​1002/​rcs.​1702
17.
Zurück zum Zitat Beecks C, Schoeffmann K, Lux M, Uysal MS, Seidl T (2015) Endoscopic video retrieval: a signature-based approach for linking endoscopic images with video segments. In: Del Bimbo A, Chen S-C, Wang H, Yu H, Zimmermann R (eds) IEEE Proceedings of international symposium on multimedia Miami, FL, USA, pp 1–6 Beecks C, Schoeffmann K, Lux M, Uysal MS, Seidl T (2015) Endoscopic video retrieval: a signature-based approach for linking endoscopic images with video segments. In: Del Bimbo A, Chen S-C, Wang H, Yu H, Zimmermann R (eds) IEEE Proceedings of international symposium on multimedia Miami, FL, USA, pp 1–6
18.
Zurück zum Zitat Blum T, Feussner H, Navab N (2010) Modeling and segmentation of surgical workflow from laparoscopic video. Lect Notes Comput Sci 6363:400–407CrossRef Blum T, Feussner H, Navab N (2010) Modeling and segmentation of surgical workflow from laparoscopic video. Lect Notes Comput Sci 6363:400–407CrossRef
19.
Zurück zum Zitat Loukas C, Georgiou E (2015) Smoke detection in endoscopic surgery videos: a first step towards retrieval of semantic events. Int J Med Robot Comput Assist Surg 11:80–94. doi:10.1002/rcs.1578 CrossRef Loukas C, Georgiou E (2015) Smoke detection in endoscopic surgery videos: a first step towards retrieval of semantic events. Int J Med Robot Comput Assist Surg 11:80–94. doi:10.​1002/​rcs.​1578 CrossRef
25.
Zurück zum Zitat Hamerly G, Elkan C (2003) Learning the K in K-Means. In: Thrun S, Saul LK, Schölkopf B (eds) Adv. Neural Inf. Process. Syst. MIT press, Whistler BC, Canada, pp 281–288 Hamerly G, Elkan C (2003) Learning the K in K-Means. In: Thrun S, Saul LK, Schölkopf B (eds) Adv. Neural Inf. Process. Syst. MIT press, Whistler BC, Canada, pp 281–288
26.
Zurück zum Zitat Attias H (2000) A variational Bayesian framework for graphical models. In: Advanced neural information processing system. Neural Information Processing Systems Foundation, pp 209–215 Attias H (2000) A variational Bayesian framework for graphical models. In: Advanced neural information processing system. Neural Information Processing Systems Foundation, pp 209–215
27.
Zurück zum Zitat Corduneanu A, Bishop CM (2001) Variational Bayesian model selection for mixture distributions. In: Proceedings of 8th international conference on AI statistics. Key West, FL, USA, pp 27–34 Corduneanu A, Bishop CM (2001) Variational Bayesian model selection for mixture distributions. In: Proceedings of 8th international conference on AI statistics. Key West, FL, USA, pp 27–34
28.
Zurück zum Zitat Bishop CM (2006) Pattern recognition and machine learning. Springer, New York, pp 474–480 Bishop CM (2006) Pattern recognition and machine learning. Springer, New York, pp 474–480
Metadaten
Titel
Shot boundary detection in endoscopic surgery videos using a variational Bayesian framework
verfasst von
Constantinos Loukas
Nikolaos Nikiteas
Dimitrios Schizas
Evangelos Georgiou
Publikationsdatum
01.11.2016
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Computer Assisted Radiology and Surgery / Ausgabe 11/2016
Print ISSN: 1861-6410
Elektronische ISSN: 1861-6429
DOI
https://doi.org/10.1007/s11548-016-1431-2

Weitere Artikel der Ausgabe 11/2016

International Journal of Computer Assisted Radiology and Surgery 11/2016 Zur Ausgabe

Premium Partner