Skip to main content

2020 | OriginalPaper | Buchkapitel

Deep Selection: A Fully Supervised Camera Selection Network for Surgery Recordings

verfasst von : Ryo Hachiuma, Tomohiro Shimizu, Hideo Saito, Hiroki Kajita, Yoshifumi Takatsume

Erschienen in: Medical Image Computing and Computer Assisted Intervention – MICCAI 2020

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Recording surgery in operating rooms is an essential task for education and evaluation of medical treatment. However, recording the desired targets, such as the surgery field, surgical tools, or doctor’s hands, is difficult because the targets are heavily occluded during surgery. We use a recording system in which multiple cameras are embedded in the surgical lamp, and we assume that at least one camera is recording the target without occlusion at any given time. As the embedded cameras obtain multiple video sequences, we address the task of selecting the camera with the best view of the surgery. Unlike the conventional method, which selects the camera based on the area size of the surgery field, we propose a deep neural network that predicts the camera selection probability from multiple video sequences by learning the supervision of the expert annotation. We created a dataset in which six different types of plastic surgery are recorded, and we provided the annotation of camera switching. Our experiments show that our approach successfully switched between cameras and outperformed three baseline methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Chen, J., Carr, P.: Autonomous camera systems: a survey. In: Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence (2014) Chen, J., Carr, P.: Autonomous camera systems: a survey. In: Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence (2014)
2.
Zurück zum Zitat Chen, J., Meng, L., Little, J.J.: Camera selection for broadcasting soccer games. In: Winter Conference on Applications of Computer Vision (WACV), pp. 427–435. IEEE (2018) Chen, J., Meng, L., Little, J.J.: Camera selection for broadcasting soccer games. In: Winter Conference on Applications of Computer Vision (WACV), pp. 427–435. IEEE (2018)
3.
Zurück zum Zitat Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (2009) Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (2009)
4.
Zurück zum Zitat Doubek, P., Geys, I., Svoboda, T., Van Gool, L.: Cinematographic rules applied to a camera network. In: The Fifth Workshop on Omnidirectional Vision, Camera Networks and Non-Classical Cameras, pp. 17–29. Czech Technical University, Prague, Czech Republic (2004) Doubek, P., Geys, I., Svoboda, T., Van Gool, L.: Cinematographic rules applied to a camera network. In: The Fifth Workshop on Omnidirectional Vision, Camera Networks and Non-Classical Cameras, pp. 17–29. Czech Technical University, Prague, Czech Republic (2004)
5.
Zurück zum Zitat Graves, A., Fernández, S., Schmidhuber, J.: Bidirectional LSTM networks for improved phoneme classification and recognition. In: Duch, W., Kacprzyk, J., Oja, E., Zadrożny, S. (eds.) ICANN 2005. LNCS, vol. 3697, pp. 799–804. Springer, Heidelberg (2005). https://doi.org/10.1007/11550907_126CrossRef Graves, A., Fernández, S., Schmidhuber, J.: Bidirectional LSTM networks for improved phoneme classification and recognition. In: Duch, W., Kacprzyk, J., Oja, E., Zadrożny, S. (eds.) ICANN 2005. LNCS, vol. 3697, pp. 799–804. Springer, Heidelberg (2005). https://​doi.​org/​10.​1007/​11550907_​126CrossRef
6.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778. IEEE (Jun 2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778. IEEE (Jun 2016)
7.
Zurück zum Zitat Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations (ICLR) (2015) Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations (ICLR) (2015)
8.
Zurück zum Zitat Li, C., Kitani, K.M.: Pixel-level hand detection in ego-centric videos. In: Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3570–3577. IEEE (July 2013) Li, C., Kitani, K.M.: Pixel-level hand detection in ego-centric videos. In: Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3570–3577. IEEE (July 2013)
9.
Zurück zum Zitat Lin, T., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: International Conference on Computer Vision (ICCV), pp. 2999–3007. IEEE, October 2017 Lin, T., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: International Conference on Computer Vision (ICCV), pp. 2999–3007. IEEE, October 2017
10.
Zurück zum Zitat Liu, Q., Rui, Y., Gupta, A., Cadiz, J.J.: Automating camera management for lecture room environments. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 442–449. ACM (2001) Liu, Q., Rui, Y., Gupta, A., Cadiz, J.J.: Automating camera management for lecture room environments. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 442–449. ACM (2001)
11.
Zurück zum Zitat Matsumoto, S., et al.: Digital video recording in trauma surgery using commercially available equipment. Scand. J. Trauma Resuscitation Emerg. Med. 21, 27–27 (2013) Matsumoto, S., et al.: Digital video recording in trauma surgery using commercially available equipment. Scand. J. Trauma Resuscitation Emerg. Med. 21, 27–27 (2013)
12.
Zurück zum Zitat Qi, C.R., Su, H., Mo, K., Guibas, L.J.: Pointnet: deep learning on point sets for 3D classification and segmentation. In: Conference on Computer Vision and Pattern Recognition (CVPR), pp. 652–660. IEEE, July 2017 Qi, C.R., Su, H., Mo, K., Guibas, L.J.: Pointnet: deep learning on point sets for 3D classification and segmentation. In: Conference on Computer Vision and Pattern Recognition (CVPR), pp. 652–660. IEEE, July 2017
13.
Zurück zum Zitat Sadri, A., Hunt, D., Rhobaye, S., Juma, A.: Video recording of surgery to improve training in plastic surgery. J. Plast. Reconstr. Aesthetic Surg. 66(4), 122–123 (2013)CrossRef Sadri, A., Hunt, D., Rhobaye, S., Juma, A.: Video recording of surgery to improve training in plastic surgery. J. Plast. Reconstr. Aesthetic Surg. 66(4), 122–123 (2013)CrossRef
14.
Zurück zum Zitat Shimizu, T., Oishi, K., Hachiuma, R., Kajita, H., Yoshihumi, T., Hideo, S.: Surgery recording without occlusions by multi-view surgical videos. In: International Conference on Computer Vision Theory and Applications, February 2020 Shimizu, T., Oishi, K., Hachiuma, R., Kajita, H., Yoshihumi, T., Hideo, S.: Surgery recording without occlusions by multi-view surgical videos. In: International Conference on Computer Vision Theory and Applications, February 2020
Metadaten
Titel
Deep Selection: A Fully Supervised Camera Selection Network for Surgery Recordings
verfasst von
Ryo Hachiuma
Tomohiro Shimizu
Hideo Saito
Hiroki Kajita
Yoshifumi Takatsume
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-59716-0_40

Premium Partner