nach oben

International Journal of Multimedia Information Retrieval

Erschienen in:

01.06.2013 | Regular Paper

Content analysis meets viewers: linking concept detection with demographics on YouTube

verfasst von: Adrian Ulges, Damian Borth, Markus Koch

Erschienen in: International Journal of Multimedia Information Retrieval | Ausgabe 2/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Social image and video sharing provides the opportunity for a user-centric, behavioral auto-understanding of image and video content. We add demographic aspects to this puzzle, i.e. the popularity of content across different ages and genders: employing user comments, we calculate demographic viewership profiles for YouTube clips and provide evidence that these profiles are strongly correlated with semantic concepts appearing in a video. Based on this fact, we outline two approaches that combine video content analysis with demographic aspects: first, we show that concept detection can be used to establish a mapping from content via concepts to viewer demographics (which we refer to as content-based demographics prediction). Second, in case sufficient view statistics already give an estimate of a clip’s audience, they can be used as a demographic signal to disambiguate concept detection in cases of visually similar concepts. We validate the above statements on a dataset of 14,000 YouTube clips covering 105 concepts and commented by 1 mio. users: content-based demographics prediction is shown to provide an accuracy comparable to other information sources (such as a video’s tags or uploader data). Also, demographic signals can improve the accuracy of concept detection significantly (by 47 % compared to a content-only approach).

Vorheriger Artikel Beyond audio and video retrieval: topic-oriented multimedia summarization

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

For the full list of concepts, please refer to http://madm.dfki.de/demo/tubetagger.

See http://www.youtube.com/t/advertising_video_targeting for YouTube’s targeting tool.

Atrey P, Hossain M, El Saddik A, Kankanhalli M (2010) Multimodal fusion for multimedia analysis: a survey. Multimed Syst 16(6):345–379CrossRef

Bozios T, Lekakos G, Skoularidou V (2001) Advanced techniques for personalized advertising in a digital TV environment: the IMedia system. In: Proceedings of the eBusiness and eWork conference

Chen Y, Pavlov D, Canny JF (2009) Behavioral targeting: The art of scaling up simple algorithms. ACM Trans. Knowl. Discov. Data 4(4):17:1–17:31. doi:10.1145/1857947.1857949

ComScore August 2011 U.S. Online Video Rankings. http://www.nytimes.com/2010/04/07/business/media/07adco.html (retrieved: Oct’12), April 2010

Cao L et al (2011) IBM Research and Columbia University TRECVID-2011 multimedia event detection (MED) system. In: Proceedings of the TRECVID workshop. http://www-nlpir.nist.gov/projects/tvpubs/tv11.papers/ibm.pdf

Everingham M, Van Gool L, Williams C, Winn J, Zisserman A (2010) The Pascal Visual Object Classes (VOC) Challenge. Int. Journal of Computer Vision 88(2):303–338CrossRef

Hofmann T (2001) Unsupervised Learning by Probabilistic Latent Semantic Analysis. Machine Learning 42:177–196MATHCrossRef

Hollis (2005) Ten years of learning on how online advertising builds brands. Advert Res 45:255–268

Hrishikesh A, Toderici G, Yagnik J (2009) Video2Text: learning to annotate video content. In: Proceedings of the workshop on internet multimedia, mining

10.

Hu J, Zeng H-J, Li H, Niu C, Chen Z (2007) Demographic prediction based on user’s browsing behavior. In: Proceeidngs of WWW, pp 151–160

11.

Huurnink B, Snoek C, de Rijke M, Smeulders A (2012) Content-Based Analysis Improves Audiovisual Archive Retrieval. Multimedia, IEEE Transactions on 14(4):1166–1178CrossRef

12.

Jansen BJ, Mullen T (2008) Sponsored Search: An Overview of the Concept, History, and Technology. IJEB 6(2):114–131CrossRef

13.

Kabbur S, Han E-H, Karypis G (2010) Content-based methods for predicting web-site demographic attributes. In: Proceedings of ICDM, pp 863–868

14.

Kennedy L, Chang S-F, Kozintsev I (2006) To search or to label?: predicting the performance of search-based automatic image classifiers. In: Workshop multimedia, information retrieval

15.

Koppel M, Argamon S, Shimoni AR (2002) Automatically categorizing written texts by author gender. Lit Linguist Comput 17(4):401–412CrossRef

16.

Li X, Snoek C, Worring M (2008) Learning tag relevance by neighbor voting for social image retrieval. In: Proceedings of MIR, pp 180–187

17.

Lin H-T, Lin C-J, Weng R (2007) A Note on Platt’s Probabilistic Outputs for Support Vector Machines. Mach. Learn. 68(3): 267–276CrossRef

18.

Lowe D (2004) Distinctive Image Features from Scale-Invariant Keypoints. Int. J. Comput. Vis. 60(2):91–110CrossRef

19.

Mei T, Hua X-S, Li S (2008) Contextual in-image advertising. In: Proceedings of ACM multimedia, pp 439–448

20.

Mei T, Hua X-S, Li S (2009) VideoSense: A Contextual In-video Advertising System. IEEE Trans. Cir. and Sys. for Video Technol 19:1866–1879CrossRef

21.

Naphade M, Smith J, Tesic J, Chang S, Hsu W, Kennedy L, Hauptmann A, Curtis J (2006) Large-scale concept ontology for multimedia. IEEE MultiMed 13(3):86–91CrossRef

22.

Over P, Awad G, Fiscus J, Antonishek B, Smeaton AF, Kraaij W, Quenot G (2010) TRECVID 2010-an overview of the goals, tasks, data. Evaluation mechanisms and metrics. In: Proceedings of TRECVID workshop

23.

Schler J, Koppel M, Argamon S, Pennebaker J (2006) Effects of age and gender on blogging. In: Proceedings of AAAI spring symposium on computational approaches for analyzing weblogs

24.

Schölkopf B, Smola A (2001) Learning with kernels: support vector machines, regularization, optimization, and beyond. MIT Press, Cambridge

25.

Sengamedu SH, Sawant N, Wadhwa S (2007) vADeo: video advertising system. In: Proceedings of ACM multimedia, pp 455–456

26.

Silversmith D (2011) Google losing up to 1.65M a day on YouTube. internetevolution.com (retrieved: December 2011)

27.

Sivic J, Zisserman A (2003) Video google: a text retrieval approach to object matching in videos. In: Proceedings of the international conference computer vision, pp 1470–1477

28.

Smeaton A (2005) Large scale evaluations of multimedia information retrieval: the TRECVid experience. In: Proceedings of CIVR, pp 11–17

29.

Smeaton A, Over P, Kraaij W (2009) High-level feature detection from video in TRECVid: a 5-year retrospective of achievements. Multimed Content Anal, pp 1–24

30.

Snoek C, Worring M (2009) Concept-based video retrieval. Found Trends Inf Retr 4(2):215–322. http://www.science.uva.nl/research/publications/2009/SnoekFTIR2009

31.

Snoek C et al (2011) The mediaMill TRECVID 2011 semantic video search engine. In: Proceedings of TRECVID workshop (unreviewed workshop paper)

32.

Torresani L, Szummer M, Fitzgibbon A (2010) Efficient object category recognition using classemes. Comput Vis ECCV 2010: 776–789

33.

Ulges A, Koch M, Borth D, Breuel T (2009) TubeTagger-YouTube-based concept detection. In: Proceedings of workshop on internet multimedia, mining

34.

Wang X-J, Yu M, Zhang L, Cai R, Argo W-YMa (2009) Intelligent advertising by mining a user’s interest from his photo collections. In: Proceedings of KDD workshop on data mining and audience intelligence for advertising, pp 18–26

35.

Wesch M (2008) An anthropological introduction to YouTube. http://www.youtube.com/watch?v=TPAO-lZ4_hU (retrieved: March 2010)

36.

Wu X, Bolivar A (2008) Keyword extraction for contextual advertisement. In: Proceedings of the 17th international conference on, World Wide Web, pp 1195–1196

37.

Yan J, Liu N, Wang G, Zhang W, Jiang Y, Chen Z (2009) How much can behavioral targeting help online advertising? In: Proceedings of the 18th international conference on, World wide web, pp 261–270

38.

Yan J et al (2009) How much can behavioral targeting help online advertising? In: Proceedings of WWW, pp 261–270

39.

Yang J, Hauptmann A (2008) (Un)Reliability of video concept detection. In: Proceedings of the international conference image and video retrieval, pp 85–94

40.

Yih W-t, Goodman J, Carvalho VR (2006) Finding advertising keywords on web pages. In: Proceedings of WWW, pp 213–222

41.

YouTube Press Statistics. http://youtube.com/t/press_statistics (retrieved: Mar’12)

42.

Zelnik-Manor L, Zanetti S, Perona P (2008) A walk through the web’s video clips. In: Proceedings of first internet vision workshop

Titel: Content analysis meets viewers: linking concept detection with demographics on YouTube
verfasst von: Adrian Ulges
Damian Borth
Markus Koch
Publikationsdatum: 01.06.2013
Verlag: Springer-Verlag
Erschienen in: International Journal of Multimedia Information Retrieval / Ausgabe 2/2013
Print ISSN: 2192-6611
Elektronische ISSN: 2192-662X
DOI: https://doi.org/10.1007/s13735-012-0029-x

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 2/2013

Best papers in multimedia information retrieval

3D object retrieval using salient views

High-level event recognition in unconstrained videos

Beyond audio and video retrieval: topic-oriented multimedia summarization

Exploiting semantics on external resources to gather visual examples for video retrieval