Skip to main content
Erschienen in: International Journal of Multimedia Information Retrieval 4/2022

03.10.2022 | Regular Paper

A novel method for video shot boundary detection using CNN-LSTM approach

verfasst von: Abdelhalim Benoughidene, Faiza Titouna

Erschienen in: International Journal of Multimedia Information Retrieval | Ausgabe 4/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Due to the rapid growth of digital videos and the massive increase in video content, there is an urgent need to develop efficient automatic video content analysis mechanisms for different tasks, namely summarization, retrieval, and classification. In all these applications, one needs to identify shot boundary detection. This paper proposes a novel dual-stage approach for cut transition detection that can withstand certain illumination and motion effects. Firstly, we present a deep neural network model using the pre-trained model combined with long short-term memory LSTM network and the euclidean distance metric. Two parallel pre-trained models sharing the same weights extract the spatial features. Then, these features are fed to the LSTM and the euclidean distance metric to classify the frames into specific categories (similar or not similar). To train the model, we generated a new database containing 5000 frame pairs with two labels (similar, dissimilar) for training and 1000 frame pairs for testing from online videos. Secondly, we adopt the segment selection process to predict the shot boundaries. This preprocessing method can help improve the accuracy and speed of the VSBD algorithm. Then, cut transition detection based on the similarity model is conducted to identify the shot boundaries in the candidate segments. Experimental results on standard databases TRECVid 2001, 2007, and RAI show that the proposed approach achieves better detection rates over the state-of-the-art SBD methods in terms of the F1 score criterion.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Halim BA, Faiza T, Seridi H et al (eds) Shot boundary detection: fundamental concepts and survey. In: Seridi H et al (eds) The 1st international conference on innovative trends in computer science, CITSC 2019, Guelma, Algeria, November 20–21, 2019, CEUR workshop proceedings (CEUR-WS.org, 2019), vol 2589, pp 34–40. http://ceur-ws.org/Vol-2589/Paper6.pdf Halim BA, Faiza T, Seridi H et al (eds) Shot boundary detection: fundamental concepts and survey. In: Seridi H et al (eds) The 1st international conference on innovative trends in computer science, CITSC 2019, Guelma, Algeria, November 20–21, 2019, CEUR workshop proceedings (CEUR-WS.org, 2019), vol 2589, pp 34–40. http://​ceur-ws.​org/​Vol-2589/​Paper6.​pdf
2.
Zurück zum Zitat Pal G et al (2015) Video shot boundary detection: a review. Springer, pp 119–127 Pal G et al (2015) Video shot boundary detection: a review. Springer, pp 119–127
6.
Zurück zum Zitat Cernekova Z, Nikou C, Pitas I (2002) Shot detection in video sequences using entropy based metrics. IEEE Cernekova Z, Nikou C, Pitas I (2002) Shot detection in video sequences using entropy based metrics. IEEE
7.
Zurück zum Zitat Baber J, Afzulpurkar N, Dailey MN, Bakhtyar M (2011) Shot boundary detection from videos using entropy and local descriptor. IEEE Baber J, Afzulpurkar N, Dailey MN, Bakhtyar M (2011) Shot boundary detection from videos using entropy and local descriptor. IEEE
9.
Zurück zum Zitat Zheng J, Zou F, Shi M (2004) An efficient algorithm for video shot boundary detection. IEEE Zheng J, Zou F, Shi M (2004) An efficient algorithm for video shot boundary detection. IEEE
10.
Zurück zum Zitat Bruno E, Pellerin D (2002) Video shot detection based on linear prediction of motion, vol 1, pp 289–292. IEEE Bruno E, Pellerin D (2002) Video shot detection based on linear prediction of motion, vol 1, pp 289–292. IEEE
13.
Zurück zum Zitat Kikukawa S, Kawafuchi T (1992) Development of an automatic summary editing system for the audio-visual resources. Trans IEICE J75–A(2):204–212 Kikukawa S, Kawafuchi T (1992) Development of an automatic summary editing system for the audio-visual resources. Trans IEICE J75–A(2):204–212
14.
Zurück zum Zitat Sun J, Wan Y (2014) A novel metric for efficient video shot boundary detection. IEEE Sun J, Wan Y (2014) A novel metric for efficient video shot boundary detection. IEEE
16.
Zurück zum Zitat Swanberg D, Shu C-F, Jain RC, Niblack CW (ed.) (1993) Knowledge guided parsing in video databases. In: Niblack CW (ed) Storage and retrieval for image and video databases. SPIE Swanberg D, Shu C-F, Jain RC, Niblack CW (ed.) (1993) Knowledge guided parsing in video databases. In: Niblack CW (ed) Storage and retrieval for image and video databases. SPIE
17.
Zurück zum Zitat Li Z, Liu X, Zhang S (2016) Shot boundary detection based on multilevel difference of colour histograms. IEEE Li Z, Liu X, Zhang S (2016) Shot boundary detection based on multilevel difference of colour histograms. IEEE
18.
Zurück zum Zitat Shao H, Qu Y, Cui W (2015) Shot boundary detection algorithm based on HSV histogram and HOG feature. Atlantis Press, pp 951–957 Shao H, Qu Y, Cui W (2015) Shot boundary detection algorithm based on HSV histogram and HOG feature. Atlantis Press, pp 951–957
22.
Zurück zum Zitat Zabih RJM, Mai K (1995) A feature-based algorithm for detecting and classifying scene breaks. ACM, San Francisco, pp 189–200 Zabih RJM, Mai K (1995) A feature-based algorithm for detecting and classifying scene breaks. ACM, San Francisco, pp 189–200
26.
Zurück zum Zitat Shahraray B, Rodriguez AA, Safranek RJ, Delp EJ (eds) (1995) Scene change detection and content-based sampling of video sequences. In: Rodriguez AA, Safranek RJ, Delp EJ (eds) Digital video compression: algorithms and technologies 1995, SPIE Shahraray B, Rodriguez AA, Safranek RJ, Delp EJ (eds) (1995) Scene change detection and content-based sampling of video sequences. In: Rodriguez AA, Safranek RJ, Delp EJ (eds) Digital video compression: algorithms and technologies 1995, SPIE
29.
Zurück zum Zitat Bi J, Liu X, Lang B (2011) A novel shot boundary detection based on information theory using SVM. IEEE Bi J, Liu X, Lang B (2011) A novel shot boundary detection based on information theory using SVM. IEEE
31.
Zurück zum Zitat Nishani E, Cico B (2017) Computer vision approaches based on deep learning and neural networks: deep neural networks for video analysis of human pose estimation. IEEE Nishani E, Cico B (2017) Computer vision approaches based on deep learning and neural networks: deep neural networks for video analysis of human pose estimation. IEEE
32.
Zurück zum Zitat Xu J, Song L, Xie R (2016) Shot boundary detection using convolutional neural networks. IEEE Xu J, Song L, Xie R (2016) Shot boundary detection using convolutional neural networks. IEEE
33.
Zurück zum Zitat Hassanien A, Elgharib MA, Selim A, Hefeeda M, Matusik W (2017) Large-scale, fast and accurate shot boundary detection through spatio-temporal convolutional neural networks. CoRR. arXiv:1705.03281 Hassanien A, Elgharib MA, Selim A, Hefeeda M, Matusik W (2017) Large-scale, fast and accurate shot boundary detection through spatio-temporal convolutional neural networks. CoRR. arXiv:​1705.​03281
34.
Zurück zum Zitat Liang R, Zhu Q, Wei H, Liao S (2017) A video shot boundary detection approach based on CNN feature. IEEE. Liang R, Zhu Q, Wei H, Liao S (2017) A video shot boundary detection approach based on CNN feature. IEEE.
36.
Zurück zum Zitat Melekhov I, Kannala J, Rahtu E (2016) Siamese network features for image matching. IEEE Melekhov I, Kannala J, Rahtu E (2016) Siamese network features for image matching. IEEE
40.
Zurück zum Zitat Tippaya S, Sitjongsataporn S, Tan T, Chamnongthai K, Khan M (2015) Video shot boundary detection based on candidate segment selection and transition pattern analysis. IEEE Tippaya S, Sitjongsataporn S, Tan T, Chamnongthai K, Khan M (2015) Video shot boundary detection based on candidate segment selection and transition pattern analysis. IEEE
42.
Zurück zum Zitat Baraldi L, Grana C, Cucchiara R (2015) Shot and scene detection via hierarchical clustering for re-using broadcast video. Springer, pp 801–811 Baraldi L, Grana C, Cucchiara R (2015) Shot and scene detection via hierarchical clustering for re-using broadcast video. Springer, pp 801–811
43.
Zurück zum Zitat Souček T, Moravec J, Lokoč J (2019) Transnet: a deep network for fast detection of common shot transitions. CoRR. arXiv:1906.03363 Souček T, Moravec J, Lokoč J (2019) Transnet: a deep network for fast detection of common shot transitions. CoRR. arXiv:​1906.​03363
44.
Zurück zum Zitat Gygli M (2018) Ridiculously fast shot boundary detection with fully convolutional neural networks. IEEE Gygli M (2018) Ridiculously fast shot boundary detection with fully convolutional neural networks. IEEE
Metadaten
Titel
A novel method for video shot boundary detection using CNN-LSTM approach
verfasst von
Abdelhalim Benoughidene
Faiza Titouna
Publikationsdatum
03.10.2022
Verlag
Springer London
Erschienen in
International Journal of Multimedia Information Retrieval / Ausgabe 4/2022
Print ISSN: 2192-6611
Elektronische ISSN: 2192-662X
DOI
https://doi.org/10.1007/s13735-022-00251-8

Weitere Artikel der Ausgabe 4/2022

International Journal of Multimedia Information Retrieval 4/2022 Zur Ausgabe

Premium Partner