skip to main content
10.1145/1291233.1291430acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Structure-sensitive manifold ranking for video concept detection

Authors Info & Claims
Published:29 September 2007Publication History

ABSTRACT

Pairwise similarity of samples is an essential factor in graph propagation based semi-supervised learning methods. Usually it is estimated based on Euclidean distance. However, the structural assumption, which is a basic assumption in these methods, has not been taken into consideration in the normal pairwise similarity measure. In this paper, we propose a novel graph-based learning approach, named Structure-Sensitive Manifold Ranking (SSMR),based on a structure-sensitive similarity measure. Instead of using distance only, SSMR takes local distribution differences into account to more accurately measure pairwise similarity. Furthermore, we show that SSMR can also be deduced from a partial differential equation based anisotropic diffusion. Experiments conducted on the TRECVID dataset show that this approach significantly outperforms existing graph-based semi-supervised learning methods for video semantic concept detection.

References

  1. Guidelines for the trecvid 2005 evaluation. http://www-nlpir.nist.gov/projects/tv2005/tv2005.html.Google ScholarGoogle Scholar
  2. Trec-10 proceedings appendix on common evaluation measures. http://trec.nist.gov/pubs/trec10/appendices/measures.pdf.Google ScholarGoogle Scholar
  3. M. Belkin, P. Niyogi, and V. Sindhwani. Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. Journal of Machine Learning Research, (7):2399--2434, Nov. 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Blum and S. Chawla. Learning from labeled and unlabeled data using graph min-cuts. In Proc. 18-th International Conference on Machine Learning, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. Blum and T. Mitchell. Combining labeled and unlabeled data with co-training. In Workshop on Computational Learning Theory, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. O. Bousquet, O. Chapelle, and M. Hein. Measure based regularization. In Proc. 17-th Annual Conference on Neural Information Processing Systems, 2003.Google ScholarGoogle Scholar
  7. O. Chapelle, A. Zien, and B. Scholkopf. Semi-supervised Learning. MIT Press, 2006.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. F. Chung. Spectral Graph Theory. American Mathematical Society, 1997.Google ScholarGoogle Scholar
  9. R. Duda, D. Stork, and P. Hart. Pattern Classification. JOHN WILEY, 2nd edition, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Feng, R. Manmatha, and V. Lavrenko. Multiple bernoulli relevance models for image and video annotation. In IEEE Conference on Computer Vision and Pattern Recognition, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Ghoshal, P. Arcing, and S. Khudanpur. Hidden markov models for automatic annotation and content-based retrieval of images and video. In ACM Conference on Research & Development on Information Retrieval, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. He, M. Li, H.-J. Zhang, H. Tong, and C. Zhang. Manifold-ranking based image retrieval. In ACM Multimedia, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. He, M. Li, H.-J. Zhang, H. Tong, and C. Zhang. Generalized manifold-ranking based image retrieval. IEEE Transaction on Image Processing, 15(10), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. A. Horn and C. R. Johnson. Matrix Analysis. Cambridge University Press (Reprint Edition), 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. L. Lavrenko, S. Feng, and R. Manmatha. Statistical models for automatic video annotation and retrieval. In IEEE International Conference on Acoustics, Speech and Signal Processing, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  16. P. Over, T. Ianeva, W. Kraaij, and A. F. Smeaton. Trecvid 2005 - an overview. In TREC Video Retrieval Evaluation Online Proceedings. NIST, 2005.Google ScholarGoogle Scholar
  17. P. Perona and J. Malik. Scale-space and edge detection using anisotropic diffusion. IEEE Transaction on Pattern Analysis and Machine Intelligence, 12(7), 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. R. Rahmani and S. A. Goldman. Missl: Multiple-instance semi-supervised learning. In Proc. 23rd International Conference on Machine Learning, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. C. Rosenberg, M. Heberg, and H. Schneiderman. Semi-supervised self-training of object detection models. In 7-th IEEE Workshop on Applications of Computer Vision, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. G. Sapiro. Geometric Partial Differential Equation and Image Analysis. Cambridge University Press, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. M. Seeger. Learning with labeled and unlabeled data. Technical Report, Edinburgh University, 2001.Google ScholarGoogle Scholar
  22. Y. Song, X.-S. Hua, L. Dai, and M. Wang. Semi-automatic video annotation based on active learning with multiple complementary predictors. In ACM International Workshop on Multimedia Information Retrieval, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. Tang, X.-S. Hua, T. Mei, G.-J. Qi, and X. Wu. Video annotation based on temporally consistent gaussian random field. Electronics Letters, 43(8), 2007.Google ScholarGoogle ScholarCross RefCross Ref
  24. J. Tang, X.-S. Hua, G.-J. Qi, T. Mei, and X. Wu. Anisotropic manifold ranking for video annotation. In Proc. of the IEEE International Conference on Multimedia & Expo, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  25. H. Tong, J. He, M. Li, C. Zhang, and W. Ma. Graph based multi-modality learning. In Proc. ACM Multimedia, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. C. Wang, F. Jing, L. Zhang, and H.-J. Zhang. Image annotation refinement using random walk with restarts. In Proc. ACM Multimedia, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. M. Wang, X.-S. Hua, Y. Song, X. Yuan, S. Li, and H.-J. Zhang. Automatic video annotation by semi-supervised learning with kernel density estimation. In Proc. ACM Multimedia, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. M. Wang, T. Mei, X. Yuan, Y. Song, and L. Dai. Video annotation by graph-based learning with neighborhood similarity. In Proc. ACM Multimedia, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. R. Yan and M. Naphade. Semi-supervised cross feature learning for semantic concept detection in videos. In IEEE Conference on Computer Vision and Pattern Recognition, July 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. X. Yuan, X.-S. Hua, M. Wang, and X. Wu. Manifold-ranking based video concept detection on large database and feature pool. In Proc. ACM Multimedia, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. T. Zhang and F. Oles. A probability analysis on the value of unlabeled data for classification problems. In Proc. 17-th International Conference on Machine Learning, 2000.Google ScholarGoogle Scholar
  32. D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Scholkopf. Learning with local and global consistency. In Proc. 17-th Annual Conference on Neural Information Processing Systems, 2003.Google ScholarGoogle Scholar
  33. X. Zhu. Semi-supervised Learning with Graphs. PhD Thesis, CMU-LTI-05-192, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. X. Zhu, Z. Ghahramani, and J. Lafferty. Semi-supervised learning using gaussian fields and harmonic function. In Proc. 20-th International Conference on Machine Learning, 2003.Google ScholarGoogle Scholar

Index Terms

  1. Structure-sensitive manifold ranking for video concept detection

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          MM '07: Proceedings of the 15th ACM international conference on Multimedia
          September 2007
          1115 pages
          ISBN:9781595937025
          DOI:10.1145/1291233

          Copyright © 2007 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 29 September 2007

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          Overall Acceptance Rate995of4,171submissions,24%

          Upcoming Conference

          MM '24
          MM '24: The 32nd ACM International Conference on Multimedia
          October 28 - November 1, 2024
          Melbourne , VIC , Australia

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader