Skip to main content
Top

2014 | OriginalPaper | Chapter

Novel Mutual Information Analysis of Attentive Motion Entropy Algorithm for Sports Video Summarization

Authors : Bo-Wei Chen, Karunanithi Bharanitharan, Jia-Ching Wang, Zhounghua Fu, Jhing-Fa Wang

Published in: Advanced Technologies, Embedded and Multimedia for Human-centric Computing

Publisher: Springer Netherlands

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This study presents a novel summarization method, which utilizes attentive motion analysis, mutual information, and segmental spectro-temporal subtraction, for generating sports video abstracts. The proposed attentive motion entropy and mutual information are both based on an attentive model. To capture and detect significant segments among a video, this work uses color contrast, intensity contrast, and orientation contrast of frames to calculate saliency maps. Regional histograms of oriented gradients based on human shapes are also adopted at the preliminary stage. In the next step, a new algorithm based on mutual information is proposed to improve the smoothness problem when the system selects the boundaries of motion segments. Meanwhile, differential salient motions and oriented gradients are merged to mutual information analysis, subsequently generating an attentive curve. Furthermore, to remove non-motion boundaries, a smoothing technique based on segmental spectro-temporal subtraction is also used for selecting favorable event boundaries. The experiment results show that our proposed algorithm can detect highlights effectively and generate smooth playable clips. Compared with existing systems, the precision and recall rates of our system outperform their results by 8.6 and 11.1 %, respectively. Besides, smoothness is enhanced by 0.7 on average, which also verified feasibility of our system.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bagga A, Hu J, Zhong J, Ramesh G (2002) Multi-source combined-media video tracking for summarization. In Proceedings of the 16th IEEE international conference pattern recognition, Quebec, Canada, Aug 11–15. IEEE computer society, Washington, pp 818–821 Bagga A, Hu J, Zhong J, Ramesh G (2002) Multi-source combined-media video tracking for summarization. In Proceedings of the 16th IEEE international conference pattern recognition, Quebec, Canada, Aug 11–15. IEEE computer society, Washington, pp 818–821
2.
go back to reference Liu T, Zhang H-J, Qi F (2003) A novel video key-frame-extraction algorithm based on perceived motion energy model. IEEE trans. circuits and systems for video technology 13(10):1006–1013 Liu T, Zhang H-J, Qi F (2003) A novel video key-frame-extraction algorithm based on perceived motion energy model. IEEE trans. circuits and systems for video technology 13(10):1006–1013
3.
go back to reference Duan L-Y, Xu M, Tian Q, Xu C-S, Jin JS (2005) A unified framework for semantic shot classification in sports video. IEEE Trans Multimedia 7(6):1066–1083CrossRef Duan L-Y, Xu M, Tian Q, Xu C-S, Jin JS (2005) A unified framework for semantic shot classification in sports video. IEEE Trans Multimedia 7(6):1066–1083CrossRef
4.
go back to reference Li Z, Schuster GM, Katsaggelos AK (2005) MINMAX optimal video summarization. IEEE trans. circuits and systems for video technology, 15(10):1245–1256 Li Z, Schuster GM, Katsaggelos AK (2005) MINMAX optimal video summarization. IEEE trans. circuits and systems for video technology, 15(10):1245–1256
5.
go back to reference Liu T-Y, Ma W-Y, Zhang H-J (2005) Effective feature extraction for play detection in American football video. In: Proceedings of the 11th international multimedia modeling conference (Melbourne, Australia, Jan. 12–14). IEEE computer society, Washington, pp 164–171 Liu T-Y, Ma W-Y, Zhang H-J (2005) Effective feature extraction for play detection in American football video. In: Proceedings of the 11th international multimedia modeling conference (Melbourne, Australia, Jan. 12–14). IEEE computer society, Washington, pp 164–171
6.
go back to reference Ma Y-F, Hua X-S, Lu L, Zhang H-J (2005) A generic framework of user attention model and its application in video summarization. IEEE Trans Multimedia 7(5):907–919CrossRef Ma Y-F, Hua X-S, Lu L, Zhang H-J (2005) A generic framework of user attention model and its application in video summarization. IEEE Trans Multimedia 7(5):907–919CrossRef
7.
go back to reference Yeo B-L, Liu B (2005) Rapid scene analysis on compressed video. IEEE trans circuits and systems for video technology, 5(6):533–544 Yeo B-L, Liu B (2005) Rapid scene analysis on compressed video. IEEE trans circuits and systems for video technology, 5(6):533–544
8.
go back to reference Cernekova Z, Pitas I, Nikou C (2006) Information theory-based shot cut/fade detection and video summarization. IEEE transactions circuits and systems for video technology, 16(1):82–91 Cernekova Z, Pitas I, Nikou C (2006) Information theory-based shot cut/fade detection and video summarization. IEEE transactions circuits and systems for video technology, 16(1):82–91
9.
go back to reference Li Y, Lee S-H, Yeh C-H, Kuo C-CJ (2006) Techniques for movie content analysis and skimming: tutorial and overview on video abstraction techniques. IEEE Signal Process Mag 23(2):79–89CrossRefMATH Li Y, Lee S-H, Yeh C-H, Kuo C-CJ (2006) Techniques for movie content analysis and skimming: tutorial and overview on video abstraction techniques. IEEE Signal Process Mag 23(2):79–89CrossRefMATH
10.
go back to reference Taskiran CM, Pizlo Z, Amir A, Ponceleon D, Delp EJ (2006) Automated video program summarization using speech transcripts. IEEE Trans Multimedia 8(4):775–791CrossRef Taskiran CM, Pizlo Z, Amir A, Ponceleon D, Delp EJ (2006) Automated video program summarization using speech transcripts. IEEE Trans Multimedia 8(4):775–791CrossRef
11.
go back to reference Chen C-Y, Wang J-C, Wang J-F, Hu Y-H (2007) Event-based segmentation of sports video using motion entropy. In: Proceedings of the 9th IEEE international symposium multimedia (Taichung, Taiwan, 10–12). IEEE computer society, Washington, pp 107–111 Chen C-Y, Wang J-C, Wang J-F, Hu Y-H (2007) Event-based segmentation of sports video using motion entropy. In: Proceedings of the 9th IEEE international symposium multimedia (Taichung, Taiwan, 10–12). IEEE computer society, Washington, pp 107–111
12.
go back to reference You J, Liu G, Sun L, Li H (2007) A multiple visual models based perceptive analysis framework for multilevel video summarization. IEEE trans. circuits and systems for video technology, 17(3):273–285 You J, Liu G, Sun L, Li H (2007) A multiple visual models based perceptive analysis framework for multilevel video summarization. IEEE trans. circuits and systems for video technology, 17(3):273–285
13.
go back to reference Chen B-W, Wang J-C, Wang J-F (2009) A novel video summarization based on mining the story-structure and semantic relations among concept entities. IEEE Trans Multimedia 11(2):295–312CrossRef Chen B-W, Wang J-C, Wang J-F (2009) A novel video summarization based on mining the story-structure and semantic relations among concept entities. IEEE Trans Multimedia 11(2):295–312CrossRef
14.
go back to reference Black MJ (1996) The robust estimation of multiple motions: parametric and piecewise-smooth flow fields. Comput Vis Image Underst 63(1):75–104MathSciNetCrossRef Black MJ (1996) The robust estimation of multiple motions: parametric and piecewise-smooth flow fields. Comput Vis Image Underst 63(1):75–104MathSciNetCrossRef
15.
go back to reference Walther D, Rutishauser U, Koch C, Perona P (2005) Selective visual attention enables learning and recognition of multiple objects in cluttered scenes. Comput Vis Image Underst 100(1–2):41–63CrossRef Walther D, Rutishauser U, Koch C, Perona P (2005) Selective visual attention enables learning and recognition of multiple objects in cluttered scenes. Comput Vis Image Underst 100(1–2):41–63CrossRef
16.
go back to reference Walther D, Koch C (2006) Modeling attention to salient proto-objects. Neural Networks 19(9):1395–1407CrossRefMATH Walther D, Koch C (2006) Modeling attention to salient proto-objects. Neural Networks 19(9):1395–1407CrossRefMATH
17.
go back to reference Ma Y-F, Lu L, Zhang H-J, Li M (2002) A user attention model for video summarization. In: Proceedings of the 10th ACM international conference multimedia (Juan-les-Pins, France, Dec. 01–06). ACM Press, New York, pp 533–542 Ma Y-F, Lu L, Zhang H-J, Li M (2002) A user attention model for video summarization. In: Proceedings of the 10th ACM international conference multimedia (Juan-les-Pins, France, Dec. 01–06). ACM Press, New York, pp 533–542
18.
go back to reference Lu S, King I, Lyu MR (2005) A novel video summarization framework for document preparation and archival applications. In: Proceedings of the 2005 IEEE aerospace conference (Big Sky, Montana, United States, Mar. 05–12). IEEE computer society, Washington, 1–10 Lu S, King I, Lyu MR (2005) A novel video summarization framework for document preparation and archival applications. In: Proceedings of the 2005 IEEE aerospace conference (Big Sky, Montana, United States, Mar. 05–12). IEEE computer society, Washington, 1–10
19.
go back to reference Ngo C-W, Ma Y-F, Zhang H-J (2005) Video summarization and scene detection by graph modeling. IEEE transactions circuits and systems for video technology, 15(2):296–305 Ngo C-W, Ma Y-F, Zhang H-J (2005) Video summarization and scene detection by graph modeling. IEEE transactions circuits and systems for video technology, 15(2):296–305
20.
go back to reference Chen Y-T, Chen C-S (2008) Fast human detection using a novel boosted cascading structure with meta stages. IEEE Trans Image Proc 17(8):1452–1464CrossRef Chen Y-T, Chen C-S (2008) Fast human detection using a novel boosted cascading structure with meta stages. IEEE Trans Image Proc 17(8):1452–1464CrossRef
21.
go back to reference Kamath SD, Loizou PC (2002) A multi-band spectral subtraction method for enhancing speech corrupted by colored noise. In: Proceedings of the IEEE international conference acoustics, speech, and signal processing (Orlando, Florida, United States, May 13–17). IEEE computer society, Washington, pp 4164–4167 Kamath SD, Loizou PC (2002) A multi-band spectral subtraction method for enhancing speech corrupted by colored noise. In: Proceedings of the IEEE international conference acoustics, speech, and signal processing (Orlando, Florida, United States, May 13–17). IEEE computer society, Washington, pp 4164–4167
22.
go back to reference Zhang T, Kuo C-CJ (2001) Audio content analysis for online audiovisual data segmentation and classification. IEEE Trans Speech Audio Proc 9(4):441–457CrossRef Zhang T, Kuo C-CJ (2001) Audio content analysis for online audiovisual data segmentation and classification. IEEE Trans Speech Audio Proc 9(4):441–457CrossRef
23.
go back to reference Misra H, Vepa J, Bourlard H (2006) Multi-stream ASR: an oracle perspective. In: Proceedings of the ISCA international conference spoken language processing (Pittsburgh, Pennsylvania, United States, Sep. 17–21) Misra H, Vepa J, Bourlard H (2006) Multi-stream ASR: an oracle perspective. In: Proceedings of the ISCA international conference spoken language processing (Pittsburgh, Pennsylvania, United States, Sep. 17–21)
24.
go back to reference Gray AH, Markel JD (1974) A spectral-flatness measure for studying the autocorrelation method of linear prediction of speech analysis. IEEE Trans Acoustics, Speech and Signal Processing 22(3):207–217CrossRef Gray AH, Markel JD (1974) A spectral-flatness measure for studying the autocorrelation method of linear prediction of speech analysis. IEEE Trans Acoustics, Speech and Signal Processing 22(3):207–217CrossRef
Metadata
Title
Novel Mutual Information Analysis of Attentive Motion Entropy Algorithm for Sports Video Summarization
Authors
Bo-Wei Chen
Karunanithi Bharanitharan
Jia-Ching Wang
Zhounghua Fu
Jhing-Fa Wang
Copyright Year
2014
Publisher
Springer Netherlands
DOI
https://doi.org/10.1007/978-94-007-7262-5_117