Skip to main content

2014 | OriginalPaper | Buchkapitel

9. Scalable Video Genre Classification and Event Detection

verfasst von : Paisarn Muneesawang, Ning Zhang, Ling Guan

Erschienen in: Multimedia Database Retrieval

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This chapter focuses on a systematic and generic approach which is experimented on scalable video genre classification and event detection. The system aims at the event detection scenario of an input video with an orderly sequential process. Initially, domain-knowledge independent local descriptors are extracted homogeneously from the input video sequence. Then the video representation is created by adopting a Bag-of-word (BoW) model. The video’s genre is firstly identified by applying the k-nearest neighbor (k-NN) classifiers on the initially obtained video representation. Various dissimilarity measures are assessed and evaluated analytically. Then, at the high-level event detection, a hidden conditional random field (HCRF) structured prediction model is utilized for interesting event detection. The input of this event detection relies on middle-level view agents in characterizing each frame of video sequence into one of four view groups, namely closed-up-view, mid-view, long-view and outer-field-view. Unsupervised probabilistic latent semantic analysis (PLSA) based approach is employed at the histogram-based video representation to achieve these middle-level view groups. The framework demonstrates the efficiency and generality in processing voluminous video collection and achieves various tasks in video analysis. The affectiveness of the framework is justified by extensive experimentation. Results are compared with benchmarks and state of the art algorithms. Limited human expertise and effort is involved in both domain-knowledge independent video representation and annotation free unsupervised view labeling. As a result, such a systematic and scalable approach can be widely applied in processing massive videos generically.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
231.
Zurück zum Zitat J. Sivic, A. Zisserman.: Video Google: Efficient visual search of videos. Toward Category-Level Object Recognition, 127–144, (2006) J. Sivic, A. Zisserman.: Video Google: Efficient visual search of videos. Toward Category-Level Object Recognition, 127–144, (2006)
232.
Zurück zum Zitat J. Sivic, A. Zisserman.: Video data mining using configurations of viewpoint invariant regions. Proc. IEEE CVPR, 479–488 (2004) J. Sivic, A. Zisserman.: Video data mining using configurations of viewpoint invariant regions. Proc. IEEE CVPR, 479–488 (2004)
233.
Zurück zum Zitat T. Quack, V. Ferrari, L. Van Gool.: Video mining with frequent itemset configurations. Image and Video Retrieval, 360–369 (2006) T. Quack, V. Ferrari, L. Van Gool.: Video mining with frequent itemset configurations. Image and Video Retrieval, 360–369 (2006)
234.
Zurück zum Zitat J. Sivic, A. Zisserman.: Efficient visual search for objects in videos. Proceedings of the IEEE, vol. 96, no. 4, 548–566 (2008) J. Sivic, A. Zisserman.: Efficient visual search for objects in videos. Proceedings of the IEEE, vol. 96, no. 4, 548–566 (2008)
235.
Zurück zum Zitat J. Sivic, F. Schaffalitzky, A. Zisserman.: Object level grouping for video shots. Proc. Computer Vision-ECCV 2004, 85–98, (2004) J. Sivic, F. Schaffalitzky, A. Zisserman.: Object level grouping for video shots. Proc. Computer Vision-ECCV 2004, 85–98, (2004)
236.
Zurück zum Zitat Y. Jiang, C. Ngo, and J. Yang.: Towards optimal bag-of-features for object categorization and semantic video retrieval. Proc. ACM CIVR, 501–510 (2007) Y. Jiang, C. Ngo, and J. Yang.: Towards optimal bag-of-features for object categorization and semantic video retrieval. Proc. ACM CIVR, 501–510 (2007)
237.
Zurück zum Zitat J. Sivic, A. Zisserman.: Efficient visual search of videos cast as text retrieval. IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 31, no. 4, 591–606 (2009) J. Sivic, A. Zisserman.: Efficient visual search of videos cast as text retrieval. IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 31, no. 4, 591–606 (2009)
238.
Zurück zum Zitat A. Basharat, Y. Zhai, and M. Shah.: Content based video matching using spatiotemporal volumes. Computer Vision and Image Understanding, vol. 110, no. 3, 360–377 (2008) A. Basharat, Y. Zhai, and M. Shah.: Content based video matching using spatiotemporal volumes. Computer Vision and Image Understanding, vol. 110, no. 3, 360–377 (2008)
239.
Zurück zum Zitat J. Law-To, O. Buisson, V. Gouet-Brunet, N. Boujemaa.: Robust voting algorithm based on labels of behavior for video copy detection. Proc. ACM Multimedia, 835–844 (2006) J. Law-To, O. Buisson, V. Gouet-Brunet, N. Boujemaa.: Robust voting algorithm based on labels of behavior for video copy detection. Proc. ACM Multimedia, 835–844 (2006)
240.
Zurück zum Zitat J. Sivic, M. Everingham, A. Zisserman.: Person spotting: video shot retrieval for face sets. Image and Video Retrieval, 592–592 (2005) J. Sivic, M. Everingham, A. Zisserman.: Person spotting: video shot retrieval for face sets. Image and Video Retrieval, 592–592 (2005)
241.
Zurück zum Zitat X. Zhou, X. Zhuang, S. Yan, S. Chang, M. Hasegawa-Johnson, T. Huang.: Sift-bag kernel for video event analysis. Proc. ACM Multimedia, 229–238 (2008) X. Zhou, X. Zhuang, S. Yan, S. Chang, M. Hasegawa-Johnson, T. Huang.: Sift-bag kernel for video event analysis. Proc. ACM Multimedia, 229–238 (2008)
242.
Zurück zum Zitat D. Xu, S. Chang.: Video event recognition using kernel methods with multilevel temporal alignment. IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 30, no. 11, 1985–1997 (2008) D. Xu, S. Chang.: Video event recognition using kernel methods with multilevel temporal alignment. IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 30, no. 11, 1985–1997 (2008)
243.
Zurück zum Zitat P. Xu, L. Xie, S. Chang, A. Divakaran, A. Vetro, H. Sun.: Algorithms and system for segmentation and structure analysis in soccer video. Proc. IEEE ICME, 928–931 (2001) P. Xu, L. Xie, S. Chang, A. Divakaran, A. Vetro, H. Sun.: Algorithms and system for segmentation and structure analysis in soccer video. Proc. IEEE ICME, 928–931 (2001)
244.
Zurück zum Zitat A. Ekin, A. Tekalp.: Framework for tracking and analysis of soccer video. Proc. SPIE VCIP, vol. 4671, 763–774 (2002) A. Ekin, A. Tekalp.: Framework for tracking and analysis of soccer video. Proc. SPIE VCIP, vol. 4671, 763–774 (2002)
245.
Zurück zum Zitat L. Xu, Y. Li.: Video classification using spatial-temporal features and PCA. Proc. IEEE ICME. vol. 3, 485–488 (2003) L. Xu, Y. Li.: Video classification using spatial-temporal features and PCA. Proc. IEEE ICME. vol. 3, 485–488 (2003)
246.
Zurück zum Zitat S. Nepal, U. Srinivasan, G. Reynolds.: Automatic detection of “Goal” segments in basketball videos. Proc. ACM MM, 261–269 (2001) S. Nepal, U. Srinivasan, G. Reynolds.: Automatic detection of “Goal” segments in basketball videos. Proc. ACM MM, 261–269 (2001)
247.
Zurück zum Zitat G. Zhu, C. Xu, Q. Huang, Y. Rui, S. Jiang, W. Gao, H. Yao.: Event tactic analysis based on broadcast sports video. IEEE Transactions on Multimedia. vol. 11, no. 1, 49–67 (2009) G. Zhu, C. Xu, Q. Huang, Y. Rui, S. Jiang, W. Gao, H. Yao.: Event tactic analysis based on broadcast sports video. IEEE Transactions on Multimedia. vol. 11, no. 1, 49–67 (2009)
248.
Zurück zum Zitat S. Fischer, R. Lienhart, W. Effelsberg.: Automatic recognition of film genres. Proc. ACM MM. vol. 95, 295–304 (1995) S. Fischer, R. Lienhart, W. Effelsberg.: Automatic recognition of film genres. Proc. ACM MM. vol. 95, 295–304 (1995)
249.
Zurück zum Zitat D. Brezeale, D. Cook.: Automatic video classification: A survey of the literature. IEEE Trans. on Systems, Man, Cybernetics, Part C: Applications and Reviews. vol. 38, no. 3, 416–430 (2008) D. Brezeale, D. Cook.: Automatic video classification: A survey of the literature. IEEE Trans. on Systems, Man, Cybernetics, Part C: Applications and Reviews. vol. 38, no. 3, 416–430 (2008)
250.
Zurück zum Zitat B. Truong, C. Dorai, S. Venkatesh.: Automatic genre identification for content-based video categorization. Proc. IEEE ICPR, vol. 15, 230–233 (2000) B. Truong, C. Dorai, S. Venkatesh.: Automatic genre identification for content-based video categorization. Proc. IEEE ICPR, vol. 15, 230–233 (2000)
251.
Zurück zum Zitat S. Takagi, S. Hattori, K. Yokoyama, A. Kodate, H. Tominaga.: Sports video categorizing method using camera motion parameters. Proc. IEEE ICME, vol. 2, 461–464 (2003) S. Takagi, S. Hattori, K. Yokoyama, A. Kodate, H. Tominaga.: Sports video categorizing method using camera motion parameters. Proc. IEEE ICME, vol. 2, 461–464 (2003)
252.
Zurück zum Zitat E. Jaser, J. Kittler, W. Christmas.: Hierarchical decision making scheme for sports video categorisation with temporal post-processing. Proc. IEEE CVPR, vol. 2, 908–913 (2004) E. Jaser, J. Kittler, W. Christmas.: Hierarchical decision making scheme for sports video categorisation with temporal post-processing. Proc. IEEE CVPR, vol. 2, 908–913 (2004)
253.
Zurück zum Zitat J. Wang, C. Xu, E. Chng.: Automatic sports video genre classification using pseudo-2d-hmm. Proc. ICPR, 778–781 (2006) J. Wang, C. Xu, E. Chng.: Automatic sports video genre classification using pseudo-2d-hmm. Proc. ICPR, 778–781 (2006)
254.
Zurück zum Zitat X. Yuan, W. Lai, T. Mei, X. Hua, X. Wu, S. Li.: Automatic video genre categorization using hierarchical svm. Proc. IEEE ICIP, 2905–2908 (2006) X. Yuan, W. Lai, T. Mei, X. Hua, X. Wu, S. Li.: Automatic video genre categorization using hierarchical svm. Proc. IEEE ICIP, 2905–2908 (2006)
255.
Zurück zum Zitat R. Glasberg, S. Schmiedeke, M. Mocigemba, T. Sikora.: New Real-Time Approaches for Video-Genre-Classification Using High-Level Descriptors and a Set of Classifiers. Proc. IEEE ICSC, 120–127 (2008) R. Glasberg, S. Schmiedeke, M. Mocigemba, T. Sikora.: New Real-Time Approaches for Video-Genre-Classification Using High-Level Descriptors and a Set of Classifiers. Proc. IEEE ICSC, 120–127 (2008)
256.
Zurück zum Zitat M. Montagnuolo, A. Messina.: Parallel neural networks for multimodal video genre classification. Journal of Multimedia Tools and Applications, vol. 41, no. 1, 125–159 (2009) M. Montagnuolo, A. Messina.: Parallel neural networks for multimodal video genre classification. Journal of Multimedia Tools and Applications, vol. 41, no. 1, 125–159 (2009)
257.
Zurück zum Zitat A. Ekin, A. M. Teklap, R. Mehrotra.: Automatic soccer video analysis and summarization. IEEE Trans. on Image Processing, vol. 12, no. 7, 796–807 (2003) A. Ekin, A. M. Teklap, R. Mehrotra.: Automatic soccer video analysis and summarization. IEEE Trans. on Image Processing, vol. 12, no. 7, 796–807 (2003)
258.
Zurück zum Zitat Y. Jiang, J. Yang, C. Ngo, A. Hauptmann.: Representations of keypoint-based semantic concept detection: A comprehensive study. IEEE Trans. on Multimedia. vol. 12, no. 1, 42–53 (2010) Y. Jiang, J. Yang, C. Ngo, A. Hauptmann.: Representations of keypoint-based semantic concept detection: A comprehensive study. IEEE Trans. on Multimedia. vol. 12, no. 1, 42–53 (2010)
259.
Zurück zum Zitat D. Lowe.: Distinctive image features from scale-invariant keypoints. Int. J. of computer vision, vol. 60, no. 2, 91–110 (2004) D. Lowe.: Distinctive image features from scale-invariant keypoints. Int. J. of computer vision, vol. 60, no. 2, 91–110 (2004)
260.
Zurück zum Zitat J. Philbin, O. Chum, M. Isard, J. Sivic, A. Zisserman.: Object retrieval with large vocabularies and fast spatial matching. Proc. IEEE CVPR, vol. 3613, 1575–1589 (2007) J. Philbin, O. Chum, M. Isard, J. Sivic, A. Zisserman.: Object retrieval with large vocabularies and fast spatial matching. Proc. IEEE CVPR, vol. 3613, 1575–1589 (2007)
261.
Zurück zum Zitat J. Yang, Y. Jiang, A. Hauptmann, C. Ngo.: Evaluating bag-of-visual-words representations in scene classification. Proc. ACM MIR, 197–206 (2007) J. Yang, Y. Jiang, A. Hauptmann, C. Ngo.: Evaluating bag-of-visual-words representations in scene classification. Proc. ACM MIR, 197–206 (2007)
262.
Zurück zum Zitat S. Lazebnik, C. Schmid, J. Ponce.: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories. Proc. IEEE CVPR, vol. 2, 2169–2178 (2006) S. Lazebnik, C. Schmid, J. Ponce.: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories. Proc. IEEE CVPR, vol. 2, 2169–2178 (2006)
263.
Zurück zum Zitat J. Zhang, M. Marszalek, S. Lazebnik, C. Schmid.: Local features and kernels for classification of texture and object categories: A comprehensive study. Int. J. of Computer Vision. vol. 73, no. 2, 213–238 (2007) J. Zhang, M. Marszalek, S. Lazebnik, C. Schmid.: Local features and kernels for classification of texture and object categories: A comprehensive study. Int. J. of Computer Vision. vol. 73, no. 2, 213–238 (2007)
264.
Zurück zum Zitat J. Sivic, A. Zisserman.: Video Google: A text retrieval approach to object matching in videos. Proc. ICCV. vol. 2, 1470–1477 (2003) J. Sivic, A. Zisserman.: Video Google: A text retrieval approach to object matching in videos. Proc. ICCV. vol. 2, 1470–1477 (2003)
265.
Zurück zum Zitat L. Li, N. Zhang, L. Duan, Q. Huang, J. Du, L. Guan.: Automatic sports genre categorization and view-type classification over large-scale dataset. Proc. ACM MM, 653–656 (2009) L. Li, N. Zhang, L. Duan, Q. Huang, J. Du, L. Guan.: Automatic sports genre categorization and view-type classification over large-scale dataset. Proc. ACM MM, 653–656 (2009)
266.
Zurück zum Zitat G. Lavee, E. Rivlin, M. Rudzsky.: Understanding video events: A survey of methods for automatic interpretation of semantic occurrences in video. IEEE Trans. on Systems, Man, Cybernetics, Part C: Applications and Reviews, vol. 39, no. 5, 489–504 (2009) G. Lavee, E. Rivlin, M. Rudzsky.: Understanding video events: A survey of methods for automatic interpretation of semantic occurrences in video. IEEE Trans. on Systems, Man, Cybernetics, Part C: Applications and Reviews, vol. 39, no. 5, 489–504 (2009)
267.
Zurück zum Zitat D. Sadlier, N. O’Connor.: Event detection in field sports video using audio-visual features and a support vector machine. IEEE Trans. on Circuits and Systems for Video Technology. vol. 15, no. 10, 1225–1233 (2005) D. Sadlier, N. O’Connor.: Event detection in field sports video using audio-visual features and a support vector machine. IEEE Trans. on Circuits and Systems for Video Technology. vol. 15, no. 10, 1225–1233 (2005)
268.
Zurück zum Zitat M. Xu, L. Duan, C. Xu, Q. Tian.: A fusion scheme of visual and auditory modalities for event detection in sports video. Proc. IEEE ICASSP, vol. 3, 189–192 (2003) M. Xu, L. Duan, C. Xu, Q. Tian.: A fusion scheme of visual and auditory modalities for event detection in sports video. Proc. IEEE ICASSP, vol. 3, 189–192 (2003)
269.
Zurück zum Zitat Q. Ye, Q. Huang, W. Gao, S. Jiang.: Exciting event detection in broadcast soccer video with mid-level description and incremental learning. Proc. ACM MM, 455–458 (2005) Q. Ye, Q. Huang, W. Gao, S. Jiang.: Exciting event detection in broadcast soccer video with mid-level description and incremental learning. Proc. ACM MM, 455–458 (2005)
270.
Zurück zum Zitat L. Li, Y. Chen, W. Hu, W. Li, X. Zhang.: Recognition of Semantic Basketball Events Based on Optical Flow Patterns. Proc. ISVC, 480–488 (2009) L. Li, Y. Chen, W. Hu, W. Li, X. Zhang.: Recognition of Semantic Basketball Events Based on Optical Flow Patterns. Proc. ISVC, 480–488 (2009)
271.
Zurück zum Zitat N. Babaguchi, Y. Kawai, T. Kitahashi.: Event based indexing of broadcasted sports video by intermodal collaboration. IEEE Trans. on Multimedia. vol. 4, no. 1, 68–75 (2002) N. Babaguchi, Y. Kawai, T. Kitahashi.: Event based indexing of broadcasted sports video by intermodal collaboration. IEEE Trans. on Multimedia. vol. 4, no. 1, 68–75 (2002)
272.
Zurück zum Zitat D. Zhang, S. Chang.: Event detection in baseball video using superimposed caption recognition. Proc. ACM MM, 315–318 (2002) D. Zhang, S. Chang.: Event detection in baseball video using superimposed caption recognition. Proc. ACM MM, 315–318 (2002)
273.
Zurück zum Zitat L. Duan, M. Xu, T. Chua, Q. Tian, C. Xu.: A mid-level representation framework for semantic sports video analysis. Proc. ACM MM, 33–44 (2003) L. Duan, M. Xu, T. Chua, Q. Tian, C. Xu.: A mid-level representation framework for semantic sports video analysis. Proc. ACM MM, 33–44 (2003)
274.
Zurück zum Zitat M. Tien, Y. Wang, C. Chou, K. Hsieh, W. Chu, J. Wu.: Event detection in tennis matches based on video data mining. Proc. IEEE ICME, 1477–1480 (2008) M. Tien, Y. Wang, C. Chou, K. Hsieh, W. Chu, J. Wu.: Event detection in tennis matches based on video data mining. Proc. IEEE ICME, 1477–1480 (2008)
275.
Zurück zum Zitat Y. Zhang, C. Xu, Y. Rui, J. Wang, H. Lu.: Semantic event extraction from basketball games using multi-modal analysis. Proc. IEEE ICME, 2190–2193 (2007) Y. Zhang, C. Xu, Y. Rui, J. Wang, H. Lu.: Semantic event extraction from basketball games using multi-modal analysis. Proc. IEEE ICME, 2190–2193 (2007)
276.
Zurück zum Zitat X. Tong, H. Lu, Q. Liu.: A three-layer event detection framework and its application in soccer video. Proc. IEEE ICME, 1551–1554 (2004) X. Tong, H. Lu, Q. Liu.: A three-layer event detection framework and its application in soccer video. Proc. IEEE ICME, 1551–1554 (2004)
277.
Zurück zum Zitat T. Mei and X. Hua.: Structure and event mining in sports video with efficient mosaic. Multimedia Tools and Applications, vol. 40, no. 1, 89–110 (2008) T. Mei and X. Hua.: Structure and event mining in sports video with efficient mosaic. Multimedia Tools and Applications, vol. 40, no. 1, 89–110 (2008)
278.
Zurück zum Zitat T. Wang, J. Li, Q. Diao, W. Hu, Y. Zhang, C. Dulong.: Semantic event detection using conditional random fields. Proc. IEEE CVPRW, 109–114 (2006) T. Wang, J. Li, Q. Diao, W. Hu, Y. Zhang, C. Dulong.: Semantic event detection using conditional random fields. Proc. IEEE CVPRW, 109–114 (2006)
279.
Zurück zum Zitat C. Xu, Y. Zhang, G. Zhu, Y. Rui, H. Lu, Q. Huang.: Using webcast text for semantic event detection in broadcast sports video. IEEE Trans. on Multimedia, vol. 10, no. 7, 1342–1355 (2008) C. Xu, Y. Zhang, G. Zhu, Y. Rui, H. Lu, Q. Huang.: Using webcast text for semantic event detection in broadcast sports video. IEEE Trans. on Multimedia, vol. 10, no. 7, 1342–1355 (2008)
280.
Zurück zum Zitat P. Wang, Z. Liu, S. Yang.: Investigation on unsupervised clustering algorithms for video shot categorization. J. of Soft Computing-A Fusion of Foundations, Methodologies and Applications, vol. 11, no. 4, 355–360 (2007) P. Wang, Z. Liu, S. Yang.: Investigation on unsupervised clustering algorithms for video shot categorization. J. of Soft Computing-A Fusion of Foundations, Methodologies and Applications, vol. 11, no. 4, 355–360 (2007)
281.
Zurück zum Zitat L. Zhong, C. Li, H. Li, Z. Xiong.: Unsupervised Clustering Algorithm for Video Shots Using Spectral Division. Proc. ISVC, 782–792 (2008) L. Zhong, C. Li, H. Li, Z. Xiong.: Unsupervised Clustering Algorithm for Video Shots Using Spectral Division. Proc. ISVC, 782–792 (2008)
282.
Zurück zum Zitat L. Duan, M. Xu, Q. Tian.: Semantic shot classification in sports video. Proc. SPIE, 300–313 (2003) L. Duan, M. Xu, Q. Tian.: Semantic shot classification in sports video. Proc. SPIE, 300–313 (2003)
283.
Zurück zum Zitat X. Tong, Q. Liu, H. Lu, H. Jin.: Shot classification in sports video. Proc. ICSP. vol. 2, 1364–1367 (2004) X. Tong, Q. Liu, H. Lu, H. Jin.: Shot classification in sports video. Proc. ICSP. vol. 2, 1364–1367 (2004)
284.
Zurück zum Zitat J. Wang, E. Chng, C. Xu.: Soccer replay detection using scene transition structure analysis. Proc. IEEE ICASSP, 433–437 (2005) J. Wang, E. Chng, C. Xu.: Soccer replay detection using scene transition structure analysis. Proc. IEEE ICASSP, 433–437 (2005)
285.
Zurück zum Zitat M. Kolekar and K. Palaniappan.: Semantic concept mining based on hierarchical event detection for soccer video indexing. J. of Multimedia, vol. 4, no. 5, 298–312 (2009) M. Kolekar and K. Palaniappan.: Semantic concept mining based on hierarchical event detection for soccer video indexing. J. of Multimedia, vol. 4, no. 5, 298–312 (2009)
286.
Zurück zum Zitat R. Benmokhtar, B. Huet, S. Berrani.: Low-level feature fusion models for soccer scene classification. Proc. IEEE ICME, 1329–1332 (2008) R. Benmokhtar, B. Huet, S. Berrani.: Low-level feature fusion models for soccer scene classification. Proc. IEEE ICME, 1329–1332 (2008)
287.
Zurück zum Zitat T. Hofmann.: Learning the similarity of documents: An information-geometric approach to document retrieval and categorization. NIPS, vol. 12, 914–920 (2000) T. Hofmann.: Learning the similarity of documents: An information-geometric approach to document retrieval and categorization. NIPS, vol. 12, 914–920 (2000)
288.
Zurück zum Zitat T. Hofmann.: Probabilistic latent semantic indexing. Proc. ACM SIGIR, 50–57 (1999) T. Hofmann.: Probabilistic latent semantic indexing. Proc. ACM SIGIR, 50–57 (1999)
289.
Zurück zum Zitat C. Chang and C. Lin.: LIBSVM: a library for support vector machines. (2001) C. Chang and C. Lin.: LIBSVM: a library for support vector machines. (2001)
290.
Zurück zum Zitat G. Miao, G. Zhu, S. Jiang, Q. Huang, C. Xu, W. Gao.: A Real-Time Score Detection and Recognition Approach for Broadcast Basketball Video. Proc. IEEE ICME, 1691–1694 (2007) G. Miao, G. Zhu, S. Jiang, Q. Huang, C. Xu, W. Gao.: A Real-Time Score Detection and Recognition Approach for Broadcast Basketball Video. Proc. IEEE ICME, 1691–1694 (2007)
291.
Zurück zum Zitat J. Dai, L. Duan, X. Tong, C. Xu, Q. Tian, H. Lu, J. Jin.: Replay scene classification in soccer video using web broadcast text. Proc. IEEE ICME, 1098–1101 (2005) J. Dai, L. Duan, X. Tong, C. Xu, Q. Tian, H. Lu, J. Jin.: Replay scene classification in soccer video using web broadcast text. Proc. IEEE ICME, 1098–1101 (2005)
292.
Zurück zum Zitat C. Xu, J. Wang, K. Wan, Y. Li, L. Duan.: Live sports event detection based on broadcast video and web-casting text. Proc. ACM MM, 230–237 (2006) C. Xu, J. Wang, K. Wan, Y. Li, L. Duan.: Live sports event detection based on broadcast video and web-casting text. Proc. ACM MM, 230–237 (2006)
293.
Zurück zum Zitat A. Quattoni, S. Wang, L. Morency, M. Collins, T. Darrell, M. Csail.: Hidden-state conditional random fields. IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 29, no. 10, 1848–1852 (2007) A. Quattoni, S. Wang, L. Morency, M. Collins, T. Darrell, M. Csail.: Hidden-state conditional random fields. IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 29, no. 10, 1848–1852 (2007)
294.
Zurück zum Zitat S. Wang, A. Quattoni, L. Morency, D. Demirdjian, T. Darrell.: Hidden conditional random fields for gesture recognition. Proc. IEEE CVPR, 1521–1527 (2006) S. Wang, A. Quattoni, L. Morency, D. Demirdjian, T. Darrell.: Hidden conditional random fields for gesture recognition. Proc. IEEE CVPR, 1521–1527 (2006)
295.
Zurück zum Zitat A. Gunawardana, M. Mahajan, A. Acero, J. Platt.: Hidden conditional random fields for phone classification. Proc. Interspeech, 1117–1120 (2005) A. Gunawardana, M. Mahajan, A. Acero, J. Platt.: Hidden conditional random fields for phone classification. Proc. Interspeech, 1117–1120 (2005)
296.
Zurück zum Zitat Y. Tan, D. Saur, S. Kulkarni, P. Ramadge.: Rapid estimation of camera motion from compressed video with application to video annotation. IEEE Trans. on circuits and systems for video technology. vol. 10, no. 1, 133–146 (2000) Y. Tan, D. Saur, S. Kulkarni, P. Ramadge.: Rapid estimation of camera motion from compressed video with application to video annotation. IEEE Trans. on circuits and systems for video technology. vol. 10, no. 1, 133–146 (2000)
297.
Zurück zum Zitat L. Morency, A. Quattoni, C. Christoudias, S. Wang.: Hidden-state Conditional Random Field Library. (2008) L. Morency, A. Quattoni, C. Christoudias, S. Wang.: Hidden-state Conditional Random Field Library. (2008)
298.
Zurück zum Zitat F. Sha and F. Pereira.: Shallow parsing with conditional random fields. in Proc. of HLT-NAACL, 213–220 (2003) F. Sha and F. Pereira.: Shallow parsing with conditional random fields. in Proc. of HLT-NAACL, 213–220 (2003)
299.
Zurück zum Zitat J. Lafferty, A. McCallum, F. Pereira.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. in Proc. ICML, 282–289 (2001) J. Lafferty, A. McCallum, F. Pereira.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. in Proc. ICML, 282–289 (2001)
300.
Zurück zum Zitat Y. Rubner, C. Tomasi, L. Guibas.: The earth mover’s distance as a metric for image retrieval. Inter. J. of Computer Vision, vol. 40, no. 2, 99–121 (2000) Y. Rubner, C. Tomasi, L. Guibas.: The earth mover’s distance as a metric for image retrieval. Inter. J. of Computer Vision, vol. 40, no. 2, 99–121 (2000)
301.
Zurück zum Zitat R. Duda, P. Hart, D. Stork.: Pattern classification. Wiley-Interscience. (2001) R. Duda, P. Hart, D. Stork.: Pattern classification. Wiley-Interscience. (2001)
302.
Zurück zum Zitat A. Jain, M. Murty, P. Flynn.: Data clustering: a review. ACM computing surveys, vol. 31, no. 3, 264–323 (1999) A. Jain, M. Murty, P. Flynn.: Data clustering: a review. ACM computing surveys, vol. 31, no. 3, 264–323 (1999)
303.
Zurück zum Zitat H. Bay, T. Tuytelaars, L. Van Gool.: Surf: Speeded up robust features. Lecture notes in computer science, vol. 3951, 404–411 (2006) H. Bay, T. Tuytelaars, L. Van Gool.: Surf: Speeded up robust features. Lecture notes in computer science, vol. 3951, 404–411 (2006)
Metadaten
Titel
Scalable Video Genre Classification and Event Detection
verfasst von
Paisarn Muneesawang
Ning Zhang
Ling Guan
Copyright-Jahr
2014
DOI
https://doi.org/10.1007/978-3-319-11782-9_9

Neuer Inhalt