Skip to main content

2016 | OriginalPaper | Buchkapitel

TV Commercial Detection Using Success Based Locally Weighted Kernel Combination

verfasst von : Raghvendra Kannao, Prithwijit Guha

Erschienen in: MultiMedia Modeling

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Classification problems using multiple kernel learning (MKL) algorithms achieve superior performance on account of using a weighted combination of base kernels on feature sub-sets. Each of the base kernels are characterized by the similarity measures defined over the feature sub-sets. Existing works in MKL have mostly used fixed weights which are shown to be related to the overall discriminative capability of corresponding base kernels. We argue that this class discrimination ability of a kernel is a local phenomenon and thus, advocate the necessity of using instance dependent functions for weighing the kernels. We propose a new framework for learning such weighing functions linked to ability of kernels to discriminate in the local regions of the feature space. During training, we first identify the regions of success in the feature sub-spaces, where the base kernels have high likelihood of success. These regions are identified by evaluating the performance of support vector machines (SVM) trained using corresponding (single) base kernels. The weighing functions are then estimated by using support vector regression (SVR). The target for SVRs is set to 1.0 for the successfully classified patterns and to 0.0, otherwise. The second contribution of this work is the construction and public domain release of a commercial detection dataset of 150 hours, acquired from 5 different TV news channels. Empirical results on 8 standard datasets and our own TV commercial detection dataset have shown the superiority of the proposed scheme of multiple kernel learning.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Atrey, P.K., Hossain, M.A., El, S.A., Kankanhalli, M.S.: Multimodal fusion for multimedia analysis: a survey. Multimedia Syst. 16(6), 345–379 (2010)CrossRef Atrey, P.K., Hossain, M.A., El, S.A., Kankanhalli, M.S.: Multimodal fusion for multimedia analysis: a survey. Multimedia Syst. 16(6), 345–379 (2010)CrossRef
2.
Zurück zum Zitat Ben-Hur, A., Noble, W.S.: Kernel methods for predicting protein-protein interactions. Bioinformatics 21(1), i38–i46 (2005)CrossRef Ben-Hur, A., Noble, W.S.: Kernel methods for predicting protein-protein interactions. Bioinformatics 21(1), i38–i46 (2005)CrossRef
3.
Zurück zum Zitat Duygulu, P., yu Chen, M., Hauptmann, A.: Comparison and combination of two novel commercial detection methods. In: International Conference on Multimedia and Expo, vol. 2, pp. 1267–1270 (2004) Duygulu, P., yu Chen, M., Hauptmann, A.: Comparison and combination of two novel commercial detection methods. In: International Conference on Multimedia and Expo, vol. 2, pp. 1267–1270 (2004)
4.
Zurück zum Zitat Gonen, M., Alpaydin, E.: Multiple kernel learning algorithms. J. Mach. Learn. Res. 12, 2211–2268 (2011)MathSciNet Gonen, M., Alpaydin, E.: Multiple kernel learning algorithms. J. Mach. Learn. Res. 12, 2211–2268 (2011)MathSciNet
5.
Zurück zum Zitat Gonen, M., Alpaydin, E.: Localized algorithms for multiple kernel learning. Pattern Recogn. 46, 795–807 (2013)CrossRef Gonen, M., Alpaydin, E.: Localized algorithms for multiple kernel learning. Pattern Recogn. 46, 795–807 (2013)CrossRef
6.
Zurück zum Zitat Hua, X.S., Lu, L., Zhang, H.J.: Robust learning-based TV commercial detection. In: International Conference on Multimedia and Expo (2005) Hua, X.S., Lu, L., Zhang, H.J.: Robust learning-based TV commercial detection. In: International Conference on Multimedia and Expo (2005)
7.
Zurück zum Zitat Jawanpuria, P., Varma, M., Nath, J.S.: On p-norm path following in multiple kernel learning for non-linear feature selection. In: International Conference on Machine Learning, June 2014 Jawanpuria, P., Varma, M., Nath, J.S.: On p-norm path following in multiple kernel learning for non-linear feature selection. In: International Conference on Machine Learning, June 2014
8.
Zurück zum Zitat Jo, T., Japkowicz, N.: Class imbalances versus small disjuncts. SIGKDD Explor. Newslett. 6(1), 40–49 (2004)MathSciNetCrossRef Jo, T., Japkowicz, N.: Class imbalances versus small disjuncts. SIGKDD Explor. Newslett. 6(1), 40–49 (2004)MathSciNetCrossRef
11.
Zurück zum Zitat Liu, N., Zhao, Y., Zhu, Z., Lu, H.: Exploiting visual-audio-textual characteristics for automatic tv commercial block detection and segmentation. IEEE Trans. Multimedia 13(5), 961–973 (2011)CrossRef Liu, N., Zhao, Y., Zhu, Z., Lu, H.: Exploiting visual-audio-textual characteristics for automatic tv commercial block detection and segmentation. IEEE Trans. Multimedia 13(5), 961–973 (2011)CrossRef
12.
Zurück zum Zitat Mühling, M., Ewerth, R., Zhou, J., Freisleben, B.: Multimodal Video Concept Detection via Bag of Auditory Words and Multiple Kernel Learning. In: Schoeffmann, K., Merialdo, B., Hauptmann, A.G., Ngo, C.-W., Andreopoulos, Y., Breiteneder, C. (eds.) MMM 2012. LNCS, vol. 7131, pp. 40–50. Springer, Heidelberg (2012) CrossRef Mühling, M., Ewerth, R., Zhou, J., Freisleben, B.: Multimodal Video Concept Detection via Bag of Auditory Words and Multiple Kernel Learning. In: Schoeffmann, K., Merialdo, B., Hauptmann, A.G., Ngo, C.-W., Andreopoulos, Y., Breiteneder, C. (eds.) MMM 2012. LNCS, vol. 7131, pp. 40–50. Springer, Heidelberg (2012) CrossRef
13.
Zurück zum Zitat Meng, L., Cai, Y., Wang, M., Li, Y.: Tv commercial detection based on shot change and text extraction. In: International Congress on Image and Signal Processing, pp. 1–5 (2009) Meng, L., Cai, Y., Wang, M., Li, Y.: Tv commercial detection based on shot change and text extraction. In: International Congress on Image and Signal Processing, pp. 1–5 (2009)
14.
Zurück zum Zitat Moguerza, J.M., Muñoz, A., de Diego, I.M.: Improving support vector classification via the combination of multiple sources of information. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A.C., de Ridder, D. (eds.) SSPR 2004 and SPR 2004. LNCS, pp. 592–600. Springer, Heidelberg (2004) CrossRef Moguerza, J.M., Muñoz, A., de Diego, I.M.: Improving support vector classification via the combination of multiple sources of information. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A.C., de Ridder, D. (eds.) SSPR 2004 and SPR 2004. LNCS, pp. 592–600. Springer, Heidelberg (2004) CrossRef
15.
Zurück zum Zitat Natarajan, P., Wu, S., Vitaladevuni, S., Zhuang, X., Tsakalidis, S., Park, U., Prasad, R., Natarajan, P.: Multimodal feature fusion for robust event detection in web videos. In: Computer Vision and Pattern Recognition, pp. 1298–1305. IEEE (2012) Natarajan, P., Wu, S., Vitaladevuni, S., Zhuang, X., Tsakalidis, S., Park, U., Prasad, R., Natarajan, P.: Multimodal feature fusion for robust event detection in web videos. In: Computer Vision and Pattern Recognition, pp. 1298–1305. IEEE (2012)
16.
Zurück zum Zitat Rokach, L.: Ensemble-based classifiers. Artif. Intell. Rev. 33(1–2), 1–39 (2010)CrossRef Rokach, L.: Ensemble-based classifiers. Artif. Intell. Rev. 33(1–2), 1–39 (2010)CrossRef
17.
Zurück zum Zitat Shawe-Taylor, N., Kandola, A.: On kernel target alignment. Adv. Neural Inf. Process. Syst. 14, 367 (2002) Shawe-Taylor, N., Kandola, A.: On kernel target alignment. Adv. Neural Inf. Process. Syst. 14, 367 (2002)
18.
Zurück zum Zitat Sonnenburg, S., Ratsch, G., Henschel, S., Widmer, C., Behr, J., Zien, A., de Bona, F., Binder, A., Gehl, C.: The shogun machine learning toolbox. J. Mach. Learn. Res. 11, 1799–1802 (2010)MATH Sonnenburg, S., Ratsch, G., Henschel, S., Widmer, C., Behr, J., Zien, A., de Bona, F., Binder, A., Gehl, C.: The shogun machine learning toolbox. J. Mach. Learn. Res. 11, 1799–1802 (2010)MATH
19.
Zurück zum Zitat Sonnenburg, S., Ratsch, G., Schafer, C., Scholkopf, B.: Large scale multiple kernel learning. J. Mach. Learn. Res. 7, 1531–1565 (2006)MATHMathSciNet Sonnenburg, S., Ratsch, G., Schafer, C., Scholkopf, B.: Large scale multiple kernel learning. J. Mach. Learn. Res. 7, 1531–1565 (2006)MATHMathSciNet
20.
Zurück zum Zitat Tanabe, H., Ho, T.B., Nguyen, C.H., Kawasaki, S.: Simple but effective methods for combining kernels in computational biology. In: RIVF, pp. 71–78. IEEE (2008) Tanabe, H., Ho, T.B., Nguyen, C.H., Kawasaki, S.: Simple but effective methods for combining kernels in computational biology. In: RIVF, pp. 71–78. IEEE (2008)
21.
Zurück zum Zitat Vahdat, A., Cannons, K., Mori, G., Oh, S., Kim, I.: Compositional models for video event detection: a multiple kernel learning latent variable approach. In: International Conference on Computer Vision, pp. 1185–1192. IEEE (2013) Vahdat, A., Cannons, K., Mori, G., Oh, S., Kim, I.: Compositional models for video event detection: a multiple kernel learning latent variable approach. In: International Conference on Computer Vision, pp. 1185–1192. IEEE (2013)
22.
Zurück zum Zitat Wu, X., Satoh, S.: Ultrahigh-speed tv commercial detection, extraction, and matching. IEEE Trans. Circ. Syst. Video Technol. 23(6), 1054–1069 (2013)CrossRef Wu, X., Satoh, S.: Ultrahigh-speed tv commercial detection, extraction, and matching. IEEE Trans. Circ. Syst. Video Technol. 23(6), 1054–1069 (2013)CrossRef
23.
Zurück zum Zitat Wang, X., Guo, Z.: A novel real-time commercial detection scheme. In: International Conference on Innovative Computing Information and Control, pp. 536–536 (2008) Wang, X., Guo, Z.: A novel real-time commercial detection scheme. In: International Conference on Innovative Computing Information and Control, pp. 536–536 (2008)
24.
Zurück zum Zitat Zhang, L., Zhu, Z., Zhao, Y.: Robust commercial detection system. In: International Conference on Multimedia and Expo, pp. 587–590 (2007) Zhang, L., Zhu, Z., Zhao, Y.: Robust commercial detection system. In: International Conference on Multimedia and Expo, pp. 587–590 (2007)
Metadaten
Titel
TV Commercial Detection Using Success Based Locally Weighted Kernel Combination
verfasst von
Raghvendra Kannao
Prithwijit Guha
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-27671-7_66

Neuer Inhalt