Top

Multimedia Systems

Published in:

03-05-2021 | Special Issue Paper

Key frame extraction based on global motion statistics for team-sport videos

Authors: Yuan Yuan, Zhe Lu, Zhou Yang, Meng Jian, Lifang Wu, Zeyu Li, Xu Liu

Published in: Multimedia Systems | Issue 2/2022

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Key frame extraction is an important manner of video summarization. It can be used to interpret video content quickly. Existing approaches first partition the entire video into video clips by shot boundary detection, and then, extract key frames by frame clustering. However, in most team-sport videos, a video clip usually includes many events, and it is difficult to extract the key frames related to all of these events accurately, because different events of a game shot can have features of similar appearance. As is well known, most events in team-sport videos are attack and defense conversions, which are related to global translation. Therefore, by using fine-grained partition based on the global motion, a shot could be further partitioned into more video clips, from which more key frames could be extracted and they are related to the events. In this study, global horizontal motion is introduced to further partition video clips into fine-grained video clips. Furthermore, global motion statistics are utilized to extract candidate key frames. Finally, the representative key frames are extracted based on the spatial–temporal consistence and hierarchical clustering, and the redundant frames are removed. A dataset called SportKF is built, which includes 25 videos of 197,878 frames in 112 min and 764 key frames from four types of sports (basketball, football, American football and field hockey). The experimental results demonstrate that the proposed scheme achieves state-of-the-art performance by introducing global motion statistics.

previous article Pinyin-to-Chinese conversion on sentence-level for domain-specific applications using self-attention model

next article 3D human pose estimation with multi-scale graph convolution and hierarchical body pooling

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Youtube-8M: https://research.google.com/youtube8m/.

Abd-Almageed, W.: Online, simultaneous shot boundary detection and key frame extraction for sports videos using rank tracing. In: 2008 15th IEEE International Conference on Image Processing, pp 3200–3203 (2008)

Baker, S., Scharstein, D., Lewis, J.P., Roth, S., Black, M.J., Szeliski, R.: A database and evaluation methodology for optical flow. Int. J. Comput. Vis. 92(1), 1–31 (2011)CrossRef

Basavarajaiah, M., Sharma, P.: Survey of compressed domain video summarization techniques. ACM Comput. Surv. 52(6), 116–144 (2019)

Cernekova, Z., Pitas, I., Nikou, C.: Information theory-based shot cut/fade detection and video summarization. IEEE Trans. Circ. Syst. Video Technol. 16(1), 82–91 (2006)CrossRef

Chacon-Quesada, R., Siles-Canales, F.: Evaluation of different histogram distances for temporal segmentation in digital videos of football matches from tv broadcast. In: 2017 International Conference and Workshop on Bioinspired Intelligence (IWOBI), pp 1–7 (2017)

Chen, Y., Hu, W., Zeng, X., Li, W.: Indexing and matching of video shots based on motion and color analysis. In: 2006 9th International Conference on Control, pp 1–6 (2006)

de Avila, S.E.F., Lopes, A.P.B., da Luz, A., de Albuquerque, Araújo A.: Vsumm: A mechanism designed to produce static video summaries and a novel evaluation method. Pattern Recogn. Lett. 32(1), 56–68 (2011)CrossRef

Devanne, M., Wannous, H., Berretti, S., Pala, P., Daoudi, M., Del Bimbo, A.: 3d human action recognition by shape analysis of motion trajectories on Riemannian manifold. IEEE Trans. Cybern. 45(7), 1340–1352 (2015)CrossRef

Doulamis, N.D., Doulamis, A.D., Avrithis, Y., Kollias, S.D.: A stochastic framework for optimal key frame extraction from mpeg video databases. In: 1999 IEEE Third Workshop on Multimedia Signal Processing, pp 141–146 (1999)

10.

Evangelopoulos, G., Zlatintsi, A., Skoumas, G., Rapantzikos, K., Potamianos, A., Maragos, P., Avrithis, Y.: Video event detection and summarization using audio, visual and text saliency. In: 2009 IEEE International Conference on Acoustics, pp 3553–3556 (2009)

11.

Gharbi, H., Bahroun, S., Massaoudi, M., Zagrouba, E.: Key frames extraction using graph modularity clustering for efficient video summarization. In: 2017 IEEE International Conference on Acoustics, pp 1502–1506 (2017)

12.

Gunawardena, P., Sudarshana, H., Amila, O., Nawaratne, R., Alahakoon, D., Perera, A.S., Chitraranjan, C.: Interest-oriented video summarization with keyframe extraction. In: 2019 19th International Conference on Advances in ICT for Emerging Regions, vol 250, pp 1–8 (2019)

13.

Hannane, R., Elboushaki, A., Afdel, K., Naghabhushan, P., Javed, M.: An efficient method for video shot boundary detection and keyframe extraction using sift-point distribution histogram. Int. J. Multimed. Inf. Retrieval 5(2), 89–104 (2016)CrossRef

14.

Huang, C., Wang, H.: A novel key-frames selection framework for comprehensive video summarization. IEEE Trans. Circuits Syst. Video Technol. 30(2), 577–589 (2020)CrossRef

15.

Huayong, L., Tao, L.: Key frame extraction based on improved frame blocks features and second extraction. In: 2015 12th International Conference on Fuzzy Systems and Knowledge Discovery, pp 1950–1955 (2015)

16.

Ilg, E., Mayer, N., Saikia, T., Keuper, M., Dosovitskiy, A., Brox, T.: Flownet 2.0: Evolution of optical flow estimation with deep networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 1647–1655 (2017)

17.

Ioannidis, A., Chasanis, V., Likas, A.: Weighted multi-view key-frame extraction. Pattern Recogn. Lett. 72, 52–61 (2016)CrossRef

18.

Kumar, K.: Evs-dk: event video skimming using deep keyframe. J. Vis. Commun. Image Represent. 58, 345–352 (2019)CrossRef

19.

Kumar, K., Shrimankar, D.D.: Deep event learning boost-up approach: Delta. Multimed. Tools Appl. 77, 26635–26655 (2018)CrossRef

20.

Kumar, K., Shrimankar, D.D.: F-des: Fast and deep event summarization. IEEE Trans. Multimed. 20(2), 323–334 (2018)CrossRef

21.

Kumar, K., Shrimankar, D.D., Singh, N.: Equal partition based clustering approach for event summarization in videos. In: 2016 12th International Conference on Signal-Image Technology Internet-Based Systems (SITIS), pp 119–126 (2016)

22.

Kumar, K., Shrimankar, D.D., Singh, N.: Event bagging: A novel event summarization approach in multiview surveillance videos. In: 2017 International Conference on Innovations in Electronics, Signal Processing and Communication (IESC), pp 106–111 (2017)

23.

Kumar, K., Shrimankar, D.D., Singh, N.: Eratosthenes sieve based key-frame extraction technique for event summarization in videos. Multimed. Tools Appl. 77, 7383–7404 (2018)CrossRef

24.

Kumar, K., Shrimankar, D.D., Singh, N.: V-less: a video from linear event summaries. Adv. Intell. Syst. Comput. 703, 385–395 (2018)

25.

Kumar, K., Shrimankar, D.D., Singh, N.: Key-lectures: keyframes extraction in video lectures. Mach. Intell. Signal Anal. 748, 453–459 (2019)CrossRef

26.

Lin, Y., Lian, F.: Data reduction based on keyframe with motion energy extraction rules. In: 2014 IEEE International Conference on Information and Automation (ICIA), pp 507–512 (2014)

27.

Mazloom, M., Habibian, A., Liu, D., Snoek, C.G., Chang, S.F.: Encoding concept prototypes for video event detection and summarization. In: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, pp 123–130 (2015)

28.

Mendi, E., Clemente, H.B., Bayrak, C.: Sports video summarization based on motion analysis. Comput. Electr. Eng. 39(3), 790–796 (2013)CrossRef

29.

Mounika, B.R., Prakash, O., Khare, A.: Key frame extraction using uniform local binary pattern. In: 2018 Second International Conference on Advances in Computing, Control and Communication Technology, pp 87–91 (2018)

30.

Nasreen, A., Roy, K., Roy, K., Shobha, G.: Key frame extraction and foreground modelling using k-means clustering. In: 2015 7th International Conference on Computational Intelligence, Communication Systems and Networks, pp 141–145 (2015)

31.

Peng, X., Lexing, X., Shih-Fu, C., Divakaran, A., Vetro, A., Huifang, S. (2001) Algorithms and system for segmentation and structure analysis in soccer video. In: IEEE International Conference on Multimedia and Expo, pp 721–724

32.

Qu, Z., Gao, T.F.: An improved algorithm of keyframe extraction for video summarization. Adv. Mater. Res. 225–226(1–2), 807–811 (2011)CrossRef

33.

Rashedi, E., Mirzaei, A., Rahmati, M.: An information theoretic approach to hierarchical clustering combination. Neurocomputing 148, 487–497 (2015)CrossRef

34.

Ren, Z., Yan, J., Ni, B., Liu, B., Yang, X., Zha, H.: Unsupervised deep learning for optical flow estimation. Proceedings of the AAAI Conference on Artificial Intelligence 31(1), (2017)

35.

Sun, D., Yang, X., Liu, M., Kautz, J.: Pwc-net: Cnns for optical flow using pyramid, warping, and cost volume. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 8934–8943 (2018)

36.

Vennila, TJ., Balamurugan, V.: A stochastic framework for keyframe extraction. In: 2020 International Conference on Emerging Trends in Information Technology and Engineering, pp 1–5 (2020)

37.

Vila, R.M, Bardera, A.Q., Feixas, M.S.: (2013) Tsallis entropy-based information measures for shot boundary detection and keyframe selection. Signal Image and Video Processing pp 507–520

38.

Vázquez-Martín, R., Bandera, A.: Spatio–temporal feature-based keyframe detection from video shots using spectral clustering. Pattern Recogn. Lett. 34(7), 770–779 (2013)CrossRef

39.

Wu, L., Zhang, S., Jian, M., Lu, Z., Wang, D.: Two stage shot boundary detection via feature fusion and spatial–temporal convolutional neural networks. IEEE Access 7, 77268–77276 (2019)CrossRef

40.

Wu, L., Yang, Z., Wang, Q., Jian, M., Zhao, B., Yan, J., Chen, C.W.: Fusing motion patterns and key visual information for semantic event recognition in basketball videos. Neurocomputing 413, 217–229 (2020)CrossRef

41.

Xia, G., Chen, B., Sun, H., Liu, Q.: Nonconvex low-rank kernel sparse subspace learning for keyframe extraction and motion segmentation. IEEE Transactions on Neural Networks and Learning Systems pp 1–15 (2020)

42.

Xie, W.B., Lee, Y.L., Wang, C., Chen, D.B., Zhou, T.: Hierarchical clustering supported by reciprocal nearest neighbors. Inf. Sci. 527, 279–292 (2020)MathSciNetCrossRef

43.

Yueting, Z., Yong, R., Huang, T.S., Mehrotra, S.: Adaptive key frame extraction using unsupervised clustering. In: Proceedings 1998 International Conference on Image Processing, vol 1, pp 866–870 (1998)

44.

Zhang, X., He, Z.: Video abnormality judgment based on grayscale and optical flow detection. Chin. J. Electron. Dev. 42(3), 718–721 (2019)

45.

Zhong, D., Kumar, R., Chang, SF.: Real-time personalized sports video filtering and summarization. In: Proceedings of the Ninth ACM International Conference on Multimedia, pp 623–625 (2001)

Title: Key frame extraction based on global motion statistics for team-sport videos
Authors: Yuan Yuan
Zhe Lu
Zhou Yang
Meng Jian
Lifang Wu
Zeyu Li
Xu Liu
Publication date: 03-05-2021
Publisher: Springer Berlin Heidelberg
Published in: Multimedia Systems / Issue 2/2022
Print ISSN: 0942-4962
Electronic ISSN: 1432-1882
DOI: https://doi.org/10.1007/s00530-021-00777-7

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Other articles of this Issue 2/2022

Lightweight feature separation, fusion and optimization networks for accurate image super-resolution

Unsupervised adversarial image retrieval

DBFC-Net: a uniform framework for fine-grained cross-media retrieval

A novel SPLIT-SIM approach for efficient image retrieval

Structure injected weight normalization for training deep networks

Watermarking techniques for three-dimensional (3D) mesh models: a survey