Skip to main content
Erschienen in: Pattern Analysis and Applications 1/2013

01.02.2013 | Short Paper

Transductive multi-distance learning for video search

verfasst von: Songhao Zhu, Zhiwei Liang, Yuncai Liu

Erschienen in: Pattern Analysis and Applications | Ausgabe 1/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Graph-based semi-supervised learning approaches have been proven effective and efficient in solving the problem of the inefficiency of labeled training data in many real-world application areas, such as video annotation. As a significant factor of these algorithms, however, pair-wise similarity metric of samples has not been fully investigated. Specifically, for existing approaches, the estimation of pair-wise similarity between two samples relies on the spatial property of video data. On the other hand, temporal property, an essential characteristic of video data, is not embedded into the pair-wise similarity measure. Accordingly, in this paper, a novel framework for video annotation, called Joint Spatio-Temporal Correlation Learning (JSTCL) is proposed. This framework is characterized by simultaneously taking into account both the spatial and temporal property of video data to improve the estimation of pair-wise similarity. We apply the proposed framework to video annotation and report superior performance compared to key existing approaches over the benchmark TRECVID data set.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Seeger M (2001) Learning with labeled and unlabeled data. Technical report, Edinburgh University Seeger M (2001) Learning with labeled and unlabeled data. Technical report, Edinburgh University
2.
Zurück zum Zitat Chapelle O, Zien A, Scholkopf B (2006) Semi-supervised learning. MIT Press, Cambridge Chapelle O, Zien A, Scholkopf B (2006) Semi-supervised learning. MIT Press, Cambridge
3.
Zurück zum Zitat Song Y, Hua X, Wang M (2005) Semi-automatic video annotation based on active learning with multiple complementary predictors. In: Proceeding of ACM international conference on multimedia information retrieval, pp 97–104 Song Y, Hua X, Wang M (2005) Semi-automatic video annotation based on active learning with multiple complementary predictors. In: Proceeding of ACM international conference on multimedia information retrieval, pp 97–104
4.
Zurück zum Zitat Yan R, Naphade M (2005) Semi-supervised cross feature learning for semantic annotation in videos. In: Proceeding of IEEE international conference on computer vision and pattern recognition, pp. 657–663, 2005 Yan R, Naphade M (2005) Semi-supervised cross feature learning for semantic annotation in videos. In: Proceeding of IEEE international conference on computer vision and pattern recognition, pp. 657–663, 2005
5.
Zurück zum Zitat Zhu X, Ghahramani Z, Lafferty J (2003) Semi-supervised learning using Gaussian fields and harmonic function. In: Proceeding of IEEE international conference on machine learning, pp 912–919 Zhu X, Ghahramani Z, Lafferty J (2003) Semi-supervised learning using Gaussian fields and harmonic function. In: Proceeding of IEEE international conference on machine learning, pp 912–919
6.
Zurück zum Zitat Zhou D, Bousquet O, SchÄolkopf B (2003) Learning with local and global consistency. In: Proceeding of IEEE international conference on neural information processing systems, pp 321–328 Zhou D, Bousquet O, SchÄolkopf B (2003) Learning with local and global consistency. In: Proceeding of IEEE international conference on neural information processing systems, pp 321–328
7.
Zurück zum Zitat Belkin M, Matveeva I, Niyogi P (2004) Regularization and semi-supervised learning on large graphs. In: Proceeding of IEEE international conference on annual conference on computational learning theory, pp 624–638 Belkin M, Matveeva I, Niyogi P (2004) Regularization and semi-supervised learning on large graphs. In: Proceeding of IEEE international conference on annual conference on computational learning theory, pp 624–638
8.
Zurück zum Zitat He J, Li M, Zhang C (2006) Generalized manifold-ranking based image retrieval. In: IEEE transaction on image processing, pp 3170–3177 He J, Li M, Zhang C (2006) Generalized manifold-ranking based image retrieval. In: IEEE transaction on image processing, pp 3170–3177
9.
Zurück zum Zitat Wang C, Jing F, Zhang L, Zhang H (2007) Image annotation refinement using random walk with restarts. In: Proceeding of ACM international conference on multimedia, pp 647–650 Wang C, Jing F, Zhang L, Zhang H (2007) Image annotation refinement using random walk with restarts. In: Proceeding of ACM international conference on multimedia, pp 647–650
10.
Zurück zum Zitat Yuan X, Hua X, Wang M, Wu X (2007) Manifold-ranking based video concept detection on large database and feature pool. In: Proceeding of ACM international conference on multimedia, pp 623–626 Yuan X, Hua X, Wang M, Wu X (2007) Manifold-ranking based video concept detection on large database and feature pool. In: Proceeding of ACM international conference on multimedia, pp 623–626
11.
Zurück zum Zitat Wang M, Hua X, Zhang H (2008) Automatic video annotation by semi-supervised learning with kernel density estimation. In: Proceeding of ACM international conference on multimedia, pp 967–976 Wang M, Hua X, Zhang H (2008) Automatic video annotation by semi-supervised learning with kernel density estimation. In: Proceeding of ACM international conference on multimedia, pp 967–976
12.
Zurück zum Zitat Wang M, Meiz T, Dai L (2008) Video annotation by graph-based learning with neighborhood similarity. In: Proceedings of ACM international conference on multimedia, pp 325–328 Wang M, Meiz T, Dai L (2008) Video annotation by graph-based learning with neighborhood similarity. In: Proceedings of ACM international conference on multimedia, pp 325–328
13.
Zurück zum Zitat Tang J, Hua X, Wu X (2009) Anisotropic manifold ranking for video annotation. In: Proceedings of IEEE international conference on multimedia and expo, pp 492–495 Tang J, Hua X, Wu X (2009) Anisotropic manifold ranking for video annotation. In: Proceedings of IEEE international conference on multimedia and expo, pp 492–495
14.
Zurück zum Zitat Stricker M, Orengo M (1995) Similarity of color images. In: Proceedings of IEEE international conference on storage and retrieval for image and video databases, pp 381–392 Stricker M, Orengo M (1995) Similarity of color images. In: Proceedings of IEEE international conference on storage and retrieval for image and video databases, pp 381–392
15.
Zurück zum Zitat Pass G (1997) Comparing images using color coherence vectors. In: Proceeding of ACM international conference on multimedia, pp 65–73 Pass G (1997) Comparing images using color coherence vectors. In: Proceeding of ACM international conference on multimedia, pp 65–73
16.
Zurück zum Zitat Kokare M, Chatterji B, Biswas P (2003) Comparison of similarity metrics for texture image retrieval. In: Proceedings of IEEE international conference on multimedia and expo, pp 571–575 Kokare M, Chatterji B, Biswas P (2003) Comparison of similarity metrics for texture image retrieval. In: Proceedings of IEEE international conference on multimedia and expo, pp 571–575
17.
Zurück zum Zitat Zhu X (2007). Semi-supervised learning literature survey. Technical report, University of Wisconsin-Madison Zhu X (2007). Semi-supervised learning literature survey. Technical report, University of Wisconsin-Madison
20.
Zurück zum Zitat Wang J, Zhao Y, Wu X, Hua X (2008) Transductive multi-label learning for video concept detection. In: Proceedings of ACM international conference on multimedia, pp 298–304 Wang J, Zhao Y, Wu X, Hua X (2008) Transductive multi-label learning for video concept detection. In: Proceedings of ACM international conference on multimedia, pp 298–304
Metadaten
Titel
Transductive multi-distance learning for video search
verfasst von
Songhao Zhu
Zhiwei Liang
Yuncai Liu
Publikationsdatum
01.02.2013
Verlag
Springer-Verlag
Erschienen in
Pattern Analysis and Applications / Ausgabe 1/2013
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-010-0196-4

Weitere Artikel der Ausgabe 1/2013

Pattern Analysis and Applications 1/2013 Zur Ausgabe

Premium Partner