Skip to main content
Erschienen in: Multimedia Systems 3/2020

18.12.2019 | Regular Paper

Keyframe extraction using Pearson correlation coefficient and color moments

verfasst von: Reddy Mounika Bommisetty, Om Prakash, Ashish Khare

Erschienen in: Multimedia Systems | Ausgabe 3/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Keyframe extraction plays a significant role in wide variety of real-time video processing applications such as video summarization, video management and retrieval, etc. A keyframe captures the whole content of its shot and does not contain any redundant information. The keyframe extraction algorithms are facing challenges due to different visual characteristics in videos of different categories. Therefore, a single feature is not enough to capture visual characteristics of a variety of videos. In order to tackle this problem, we propose an approach of keyframe extraction that uses hybridization of features. In the present article, we propose a novel shot detection-based keyframe extraction algorithm based on combination of two features: one is Pearson correlation coefficient (PCC) and other is color moments (CM). The linear transformation invariance property of PCC facilitates the proposed algorithm to work well under varying lighting conditions. On the other hand, the scale and rotation invariance properties of color moments are beneficial for representation of complex objects that may be present in different poses and orientations. These sustained reasons support the combination of these two features, which brings significant benefits for keyframe extraction in the proposed method. The proposed method detects shot boundaries by employing combo feature set (PCC and CM). From each shot, the frame with highest mean and standard deviation is selected as keyframe. Furthermore, another important contribution is that we developed a new dataset by collecting the videos of different categories such as movies, news, serials, animations and personal interviews and made it available online. The proposed method is experimented on three datasets: two publicly available datasets and one dataset developed by us. The performance of the proposed method on these datasets has been evaluated on the basis of different evaluation parameters: figure of merit, detection percentage, accuracy, and missing factor. Principal advantage of proposed work lies in the fact that it is capable to detect both the abrupt and gradual shot transitions. In real-time videos, it is common to have abrupt and small transitions. The experimental results show the superior performance of the proposed method over the other state-of-the-art methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Khare, M., Srivastava, R.K., Khare, A.: Object tracking using combination of Daubechies complex wavelet transform and Zernike moment. Multimed Tools Appl 76(1), 1247–1290 (2017) Khare, M., Srivastava, R.K., Khare, A.: Object tracking using combination of Daubechies complex wavelet transform and Zernike moment. Multimed Tools Appl 76(1), 1247–1290 (2017)
2.
Zurück zum Zitat Prakash, O., Gwak, J., Khare, M., Khare, A., Jeon, M.: Human detection in complex real scenes based on combination of biorthogonal wavelet transform and Zernike moments. Optik Int J Light Electron Opt 1(157), 1267–1281 (2018) Prakash, O., Gwak, J., Khare, M., Khare, A., Jeon, M.: Human detection in complex real scenes based on combination of biorthogonal wavelet transform and Zernike moments. Optik Int J Light Electron Opt 1(157), 1267–1281 (2018)
3.
Zurück zum Zitat Khare, A., Mounika, B.R., Vasu, B.: On retrieval of nearly identical video clips with query frame. In 2019 International Conference on Automation, Computational and Technology Management (ICACTM), pp. 116–121. IEEE (2019) Khare, A., Mounika, B.R., Vasu, B.: On retrieval of nearly identical video clips with query frame. In 2019 International Conference on Automation, Computational and Technology Management (ICACTM), pp. 116–121. IEEE (2019)
4.
Zurück zum Zitat Singhal, A., Kumar, P., Saini, R., Roy, P.P., Dogra, D.P., Kim, B.G.: Summarization of videos by analyzing affective state of the user through crowdsource. Cognit Syst Res 1(52), 917–930 (2018) Singhal, A., Kumar, P., Saini, R., Roy, P.P., Dogra, D.P., Kim, B.G.: Summarization of videos by analyzing affective state of the user through crowdsource. Cognit Syst Res 1(52), 917–930 (2018)
5.
Zurück zum Zitat Jaiswal, S., Virmani, S., Sethi, V., De, K., Roy, P.P.: An intelligent recommendation system using gaze and emotion detection. Multimed Tools Appl 2018, 1–20 (2018) Jaiswal, S., Virmani, S., Sethi, V., De, K., Roy, P.P.: An intelligent recommendation system using gaze and emotion detection. Multimed Tools Appl 2018, 1–20 (2018)
6.
Zurück zum Zitat Nigam, S., Khare, A.: Integration of moment invariants and uniform local binary patterns for human activity recognition in video sequences. Multimed Tools Appl 75(24), 17303–17332 (2016) Nigam, S., Khare, A.: Integration of moment invariants and uniform local binary patterns for human activity recognition in video sequences. Multimed Tools Appl 75(24), 17303–17332 (2016)
7.
Zurück zum Zitat Khare, M., Binh, N.T., Srivastava, R.K., Khare, A.: Vehicle identification in traffic surveillance-complex wavelet transform based approach. J Sci Technol 52(4A), 29–38 (2014) Khare, M., Binh, N.T., Srivastava, R.K., Khare, A.: Vehicle identification in traffic surveillance-complex wavelet transform based approach. J Sci Technol 52(4A), 29–38 (2014)
8.
Zurück zum Zitat Khare, M., Srivastava, R.K., Khare, A.: Single change detection-based moving object segmentation by using Daubechies complex wavelet transform. IET Image Proc. 8(6), 334–344 (2014) Khare, M., Srivastava, R.K., Khare, A.: Single change detection-based moving object segmentation by using Daubechies complex wavelet transform. IET Image Proc. 8(6), 334–344 (2014)
9.
Zurück zum Zitat Birinci, M., Kiranyaz, S.: A perceptual scheme for fully automatic video shot boundary detection. Signal process Image Commun 29(3), 410–423 (2014) Birinci, M., Kiranyaz, S.: A perceptual scheme for fully automatic video shot boundary detection. Signal process Image Commun 29(3), 410–423 (2014)
10.
Zurück zum Zitat Mohanta, P.P., Saha, S.K., Chanda, B.: A model-based shot boundary detection technique using frame transition parameters. IEEE Trans Multimed 14(1), 223–233 (2012) Mohanta, P.P., Saha, S.K., Chanda, B.: A model-based shot boundary detection technique using frame transition parameters. IEEE Trans Multimed 14(1), 223–233 (2012)
11.
Zurück zum Zitat Tavassolipour, M., Karimian, M., Kasaei, S.: Event detection and summarization in soccer videos using Bayesian network and copula. IEEE Trans Circ Syst Video Technol 24(2), 291–304 (2014) Tavassolipour, M., Karimian, M., Kasaei, S.: Event detection and summarization in soccer videos using Bayesian network and copula. IEEE Trans Circ Syst Video Technol 24(2), 291–304 (2014)
12.
Zurück zum Zitat Lu, Z.M., Shi, Y.: Fast video shot boundary detection based on SVD and pattern matching. IEEE Trans Image Process 22(12), 5136–5145 (2013)MathSciNet Lu, Z.M., Shi, Y.: Fast video shot boundary detection based on SVD and pattern matching. IEEE Trans Image Process 22(12), 5136–5145 (2013)MathSciNet
13.
Zurück zum Zitat Ayadi, T., Ellouze, M., Hamdani, T.M., Alimi, A.M.: Movie scenes detection with MIGSOM based on shots semi-supervised clustering. Neural Comput Appl 22(7–8), 1387–1396 (2013) Ayadi, T., Ellouze, M., Hamdani, T.M., Alimi, A.M.: Movie scenes detection with MIGSOM based on shots semi-supervised clustering. Neural Comput Appl 22(7–8), 1387–1396 (2013)
14.
Zurück zum Zitat Dadashi, R., Kanan, H.R.: AVCD-FRA: a novel solution to automatic video cut detection using fuzzy-rule-based approach. Comput Vis Image Underst 117(7), 807–817 (2013) Dadashi, R., Kanan, H.R.: AVCD-FRA: a novel solution to automatic video cut detection using fuzzy-rule-based approach. Comput Vis Image Underst 117(7), 807–817 (2013)
15.
Zurück zum Zitat Jadhav, M.P., Jadhav, D.S.: Video summarization using higher order color moments (VSUHCM). Procedia Comput Sci 1(45), 275–281 (2015) Jadhav, M.P., Jadhav, D.S.: Video summarization using higher order color moments (VSUHCM). Procedia Comput Sci 1(45), 275–281 (2015)
16.
Zurück zum Zitat Sheena, C.V., Narayanan, N.K.: Key-frame extraction by analysis of histograms of video frames using statistical methods. Procedia Comput Sci 1(70), 36–40 (2015) Sheena, C.V., Narayanan, N.K.: Key-frame extraction by analysis of histograms of video frames using statistical methods. Procedia Comput Sci 1(70), 36–40 (2015)
17.
Zurück zum Zitat Hannane, R., Elboushaki, A., Afdel, K., Naghabhushan, P., Javed, M.: An efficient method for video shot boundary detection and keyframe extraction using SIFT-point distribution histogram. Int J Multimed Inform Retr 5(2), 89–104 (2016) Hannane, R., Elboushaki, A., Afdel, K., Naghabhushan, P., Javed, M.: An efficient method for video shot boundary detection and keyframe extraction using SIFT-point distribution histogram. Int J Multimed Inform Retr 5(2), 89–104 (2016)
18.
Zurück zum Zitat Thakre, K.S., RajurkarAM, Manthalkar R.R.: Video partitioning and secured keyframe extraction of MPEG video. Procedia Comput Sci 78, 790–798 (2016) Thakre, K.S., RajurkarAM, Manthalkar R.R.: Video partitioning and secured keyframe extraction of MPEG video. Procedia Comput Sci 78, 790–798 (2016)
19.
Zurück zum Zitat Yu, L., Cao, J., Chen, M., Cui, X.: Key frame extraction scheme based on sliding window and features. Peer to Peer Netw Appl 11(5), 1141–1152 (2018) Yu, L., Cao, J., Chen, M., Cui, X.: Key frame extraction scheme based on sliding window and features. Peer to Peer Netw Appl 11(5), 1141–1152 (2018)
20.
Zurück zum Zitat Lu, G., Zhou, Y., Li, X., Yan, P.: Unsupervised, efficient and scalable key-frame selection for automatic summarization of surveillance videos. Multimed Tools Appl 76(5), 6309–6331 (2017) Lu, G., Zhou, Y., Li, X., Yan, P.: Unsupervised, efficient and scalable key-frame selection for automatic summarization of surveillance videos. Multimed Tools Appl 76(5), 6309–6331 (2017)
21.
Zurück zum Zitat Loukas, C., Nikiteas, N., Schizas, D., Georgiou, E.: Shot boundary detection in endoscopic surgery videos using a variational Bayesian framework. Int J Comput Assist Radiol Surg 11(11), 1937–1949 (2016) Loukas, C., Nikiteas, N., Schizas, D., Georgiou, E.: Shot boundary detection in endoscopic surgery videos using a variational Bayesian framework. Int J Comput Assist Radiol Surg 11(11), 1937–1949 (2016)
22.
Zurück zum Zitat Thounaojam, D.M., Khelchandra, T., Singh, K.M., Roy, S.: A genetic algorithm and fuzzy logic approach for video shot boundary detection. Comput Intell Neurosci 1(2016), 14 (2016) Thounaojam, D.M., Khelchandra, T., Singh, K.M., Roy, S.: A genetic algorithm and fuzzy logic approach for video shot boundary detection. Comput Intell Neurosci 1(2016), 14 (2016)
23.
Zurück zum Zitat Dutta, D., Saha, S.K., Chanda, B.: A shot detection technique using linear regression of shot transition pattern. Multimed Tools Appl 75(1), 93–113 (2016) Dutta, D., Saha, S.K., Chanda, B.: A shot detection technique using linear regression of shot transition pattern. Multimed Tools Appl 75(1), 93–113 (2016)
24.
Zurück zum Zitat Priya, G.L., Domnic, S.: Walsh-Hadamard transform kernel-based feature vector for shot boundary detection. IEEE Trans Image Process 23(12), 5187–5197 (2014)MathSciNetMATH Priya, G.L., Domnic, S.: Walsh-Hadamard transform kernel-based feature vector for shot boundary detection. IEEE Trans Image Process 23(12), 5187–5197 (2014)MathSciNetMATH
25.
Zurück zum Zitat González-Díaz, I., Martínez-Cortés, T., Gallardo-Antolín, A., Díaz-de-María, F.: Temporal segmentation and keyframe selection methods for user-generated video search-based annotation. Expert Syst Appl 42(1), 488–502 (2015) González-Díaz, I., Martínez-Cortés, T., Gallardo-Antolín, A., Díaz-de-María, F.: Temporal segmentation and keyframe selection methods for user-generated video search-based annotation. Expert Syst Appl 42(1), 488–502 (2015)
26.
Zurück zum Zitat Ji, P., Cao, L., Zhang, X., Zhang, L., Wu, W.: News videos anchor person detection by shot clustering. Neurocomputing 10(123), 86–99 (2014) Ji, P., Cao, L., Zhang, X., Zhang, L., Wu, W.: News videos anchor person detection by shot clustering. Neurocomputing 10(123), 86–99 (2014)
27.
Zurück zum Zitat Wang, J., Neskovic, P., Cooper, L.N.: Improving nearest neighbor rule with a simple adaptive distance measure. Pattern Recogn Lett 28(2), 207–213 (2007) Wang, J., Neskovic, P., Cooper, L.N.: Improving nearest neighbor rule with a simple adaptive distance measure. Pattern Recogn Lett 28(2), 207–213 (2007)
28.
Zurück zum Zitat Cotsaces, C., Nikolaidis, N., Pitas, I.: Video shot boundary detection and condensed representation: a review. IEEE Signal Process Mag 23(2), 28–37 (2006) Cotsaces, C., Nikolaidis, N., Pitas, I.: Video shot boundary detection and condensed representation: a review. IEEE Signal Process Mag 23(2), 28–37 (2006)
29.
Zurück zum Zitat Dang, C., Radha, H.: RPCA-KFE: key frame extraction for video using robust principal component analysis. IEEE Trans Image Process 24(11), 3742–3753 (2015)MathSciNetMATH Dang, C., Radha, H.: RPCA-KFE: key frame extraction for video using robust principal component analysis. IEEE Trans Image Process 24(11), 3742–3753 (2015)MathSciNetMATH
30.
Zurück zum Zitat VáZquez-MartíN, R., Bandera, A.: Spatio-temporal feature-based keyframe detection from video shots using spectral clustering. Pattern Recogn Lett. 34(7), 770–779 (2013) VáZquez-MartíN, R., Bandera, A.: Spatio-temporal feature-based keyframe detection from video shots using spectral clustering. Pattern Recogn Lett. 34(7), 770–779 (2013)
31.
Zurück zum Zitat Ioannidis, A., Chasanis, V., Likas, A.: Weighted multi-view key-frame extraction. Pattern Recogn Lett 1(72), 52–61 (2016) Ioannidis, A., Chasanis, V., Likas, A.: Weighted multi-view key-frame extraction. Pattern Recogn Lett 1(72), 52–61 (2016)
32.
Zurück zum Zitat Priya, G.L., Domnic, S.: Shot based keyframe extraction for ecological video indexing and retrieval. Ecol Inform 1(23), 107–117 (2014) Priya, G.L., Domnic, S.: Shot based keyframe extraction for ecological video indexing and retrieval. Ecol Inform 1(23), 107–117 (2014)
33.
Zurück zum Zitat Mendi, E., Bayrak, C.: Shot boundary detection and key-frame extraction from neurosurgical video sequences. Imaging Sci J 60(2), 90–96 (2012) Mendi, E., Bayrak, C.: Shot boundary detection and key-frame extraction from neurosurgical video sequences. Imaging Sci J 60(2), 90–96 (2012)
34.
Zurück zum Zitat Vila, M., Bardera, A., Xu, Q., Feixas, M., Sbert, M.: Tsallis entropy-based information measures for shot boundary detection and keyframe selection. SIViP 7(3), 507–520 (2013) Vila, M., Bardera, A., Xu, Q., Feixas, M., Sbert, M.: Tsallis entropy-based information measures for shot boundary detection and keyframe selection. SIViP 7(3), 507–520 (2013)
35.
Zurück zum Zitat Furuichi, S.: Information theoretical properties of Tsallis entropies. J Math Phys 47(2), 023302 (2006)MathSciNetMATH Furuichi, S.: Information theoretical properties of Tsallis entropies. J Math Phys 47(2), 023302 (2006)MathSciNetMATH
36.
Zurück zum Zitat Burbea, J., Rao, C.: On the convexity of some divergence measures based on entropy functions. IEEE Trans Inf Theory 28(3), 489–495 (1982)MathSciNetMATH Burbea, J., Rao, C.: On the convexity of some divergence measures based on entropy functions. IEEE Trans Inf Theory 28(3), 489–495 (1982)MathSciNetMATH
37.
Zurück zum Zitat Vovk, V., Nouretdinov, I., Gammerman, A.: Testing exchangeability on-line. In Proceedings of the 20th International Conference on Machine Learning (ICML-03), pp. 768–775. Washington, DC (2003) Vovk, V., Nouretdinov, I., Gammerman, A.: Testing exchangeability on-line. In Proceedings of the 20th International Conference on Machine Learning (ICML-03), pp. 768–775. Washington, DC (2003)
38.
Zurück zum Zitat Chakraborty, D., Roy, P.P., Saini, R., Alvarez, J.M., Pal, U.: Frame selection for OCR from video stream of book flipping. Multimed Tools Appl 77(1), 985–1008 (2018) Chakraborty, D., Roy, P.P., Saini, R., Alvarez, J.M., Pal, U.: Frame selection for OCR from video stream of book flipping. Multimed Tools Appl 77(1), 985–1008 (2018)
39.
Zurück zum Zitat Deb, K. (2001). Multi-objective optimization using evolutionary algorithms, vol. 16. Wiley Deb, K. (2001). Multi-objective optimization using evolutionary algorithms, vol. 16. Wiley
40.
Zurück zum Zitat Poornima, K., Kanchana, R.: A method to align images using image segmentation. IJCSE 2(1), 294–298 (2012) Poornima, K., Kanchana, R.: A method to align images using image segmentation. IJCSE 2(1), 294–298 (2012)
41.
Zurück zum Zitat Khare, M., Srivastava, R.K., Khare, A.: Moving object segmentation in Daubechies complex wavelet domain. SIViP 9(3), 635–650 (2015) Khare, M., Srivastava, R.K., Khare, A.: Moving object segmentation in Daubechies complex wavelet domain. SIViP 9(3), 635–650 (2015)
42.
Zurück zum Zitat Shaker, I.F., Abd-Elrahman, A., Abdel-Gawad, A.K., Sherief, M.A.: Building extraction from high resolution space images in high density residential areas in the Great Cairo region. Remote Sens 3(4), 781–791 (2011) Shaker, I.F., Abd-Elrahman, A., Abdel-Gawad, A.K., Sherief, M.A.: Building extraction from high resolution space images in high density residential areas in the Great Cairo region. Remote Sens 3(4), 781–791 (2011)
Metadaten
Titel
Keyframe extraction using Pearson correlation coefficient and color moments
verfasst von
Reddy Mounika Bommisetty
Om Prakash
Ashish Khare
Publikationsdatum
18.12.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Multimedia Systems / Ausgabe 3/2020
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-019-00642-8

Weitere Artikel der Ausgabe 3/2020

Multimedia Systems 3/2020 Zur Ausgabe

Editorial

Editorial

Neuer Inhalt