Skip to main content
Erschienen in: International Journal of Multimedia Information Retrieval 2/2013

01.06.2013 | Regular Paper

Content analysis meets viewers: linking concept detection with demographics on YouTube

verfasst von: Adrian Ulges, Damian Borth, Markus Koch

Erschienen in: International Journal of Multimedia Information Retrieval | Ausgabe 2/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Social image and video sharing provides the opportunity for a user-centric, behavioral auto-understanding of image and video content. We add demographic aspects to this puzzle, i.e. the popularity of content across different ages and genders: employing user comments, we calculate demographic viewership profiles for YouTube clips and provide evidence that these profiles are strongly correlated with semantic concepts appearing in a video. Based on this fact, we outline two approaches that combine video content analysis with demographic aspects: first, we show that concept detection can be used to establish a mapping from content via concepts to viewer demographics (which we refer to as content-based demographics prediction). Second, in case sufficient view statistics already give an estimate of a clip’s audience, they can be used as a demographic signal to disambiguate concept detection in cases of visually similar concepts. We validate the above statements on a dataset of 14,000 YouTube clips covering 105 concepts and commented by 1 mio. users: content-based demographics prediction is shown to provide an accuracy comparable to other information sources (such as a video’s tags or uploader data). Also, demographic signals can improve the accuracy of concept detection significantly (by 47 % compared to a content-only approach).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
For the full list of concepts, please refer to http://​madm.​dfki.​de/​demo/​tubetagger.
 
Literatur
1.
Zurück zum Zitat Atrey P, Hossain M, El Saddik A, Kankanhalli M (2010) Multimodal fusion for multimedia analysis: a survey. Multimed Syst 16(6):345–379CrossRef Atrey P, Hossain M, El Saddik A, Kankanhalli M (2010) Multimodal fusion for multimedia analysis: a survey. Multimed Syst 16(6):345–379CrossRef
2.
Zurück zum Zitat Bozios T, Lekakos G, Skoularidou V (2001) Advanced techniques for personalized advertising in a digital TV environment: the IMedia system. In: Proceedings of the eBusiness and eWork conference Bozios T, Lekakos G, Skoularidou V (2001) Advanced techniques for personalized advertising in a digital TV environment: the IMedia system. In: Proceedings of the eBusiness and eWork conference
3.
Zurück zum Zitat Chen Y, Pavlov D, Canny JF (2009) Behavioral targeting: The art of scaling up simple algorithms. ACM Trans. Knowl. Discov. Data 4(4):17:1–17:31. doi:10.1145/1857947.1857949 Chen Y, Pavlov D, Canny JF (2009) Behavioral targeting: The art of scaling up simple algorithms. ACM Trans. Knowl. Discov. Data 4(4):17:1–17:31. doi:10.​1145/​1857947.​1857949
6.
Zurück zum Zitat Everingham M, Van Gool L, Williams C, Winn J, Zisserman A (2010) The Pascal Visual Object Classes (VOC) Challenge. Int. Journal of Computer Vision 88(2):303–338CrossRef Everingham M, Van Gool L, Williams C, Winn J, Zisserman A (2010) The Pascal Visual Object Classes (VOC) Challenge. Int. Journal of Computer Vision 88(2):303–338CrossRef
7.
Zurück zum Zitat Hofmann T (2001) Unsupervised Learning by Probabilistic Latent Semantic Analysis. Machine Learning 42:177–196MATHCrossRef Hofmann T (2001) Unsupervised Learning by Probabilistic Latent Semantic Analysis. Machine Learning 42:177–196MATHCrossRef
8.
Zurück zum Zitat Hollis (2005) Ten years of learning on how online advertising builds brands. Advert Res 45:255–268 Hollis (2005) Ten years of learning on how online advertising builds brands. Advert Res 45:255–268
9.
Zurück zum Zitat Hrishikesh A, Toderici G, Yagnik J (2009) Video2Text: learning to annotate video content. In: Proceedings of the workshop on internet multimedia, mining Hrishikesh A, Toderici G, Yagnik J (2009) Video2Text: learning to annotate video content. In: Proceedings of the workshop on internet multimedia, mining
10.
Zurück zum Zitat Hu J, Zeng H-J, Li H, Niu C, Chen Z (2007) Demographic prediction based on user’s browsing behavior. In: Proceeidngs of WWW, pp 151–160 Hu J, Zeng H-J, Li H, Niu C, Chen Z (2007) Demographic prediction based on user’s browsing behavior. In: Proceeidngs of WWW, pp 151–160
11.
Zurück zum Zitat Huurnink B, Snoek C, de Rijke M, Smeulders A (2012) Content-Based Analysis Improves Audiovisual Archive Retrieval. Multimedia, IEEE Transactions on 14(4):1166–1178CrossRef Huurnink B, Snoek C, de Rijke M, Smeulders A (2012) Content-Based Analysis Improves Audiovisual Archive Retrieval. Multimedia, IEEE Transactions on 14(4):1166–1178CrossRef
12.
Zurück zum Zitat Jansen BJ, Mullen T (2008) Sponsored Search: An Overview of the Concept, History, and Technology. IJEB 6(2):114–131CrossRef Jansen BJ, Mullen T (2008) Sponsored Search: An Overview of the Concept, History, and Technology. IJEB 6(2):114–131CrossRef
13.
Zurück zum Zitat Kabbur S, Han E-H, Karypis G (2010) Content-based methods for predicting web-site demographic attributes. In: Proceedings of ICDM, pp 863–868 Kabbur S, Han E-H, Karypis G (2010) Content-based methods for predicting web-site demographic attributes. In: Proceedings of ICDM, pp 863–868
14.
Zurück zum Zitat Kennedy L, Chang S-F, Kozintsev I (2006) To search or to label?: predicting the performance of search-based automatic image classifiers. In: Workshop multimedia, information retrieval Kennedy L, Chang S-F, Kozintsev I (2006) To search or to label?: predicting the performance of search-based automatic image classifiers. In: Workshop multimedia, information retrieval
15.
Zurück zum Zitat Koppel M, Argamon S, Shimoni AR (2002) Automatically categorizing written texts by author gender. Lit Linguist Comput 17(4):401–412CrossRef Koppel M, Argamon S, Shimoni AR (2002) Automatically categorizing written texts by author gender. Lit Linguist Comput 17(4):401–412CrossRef
16.
Zurück zum Zitat Li X, Snoek C, Worring M (2008) Learning tag relevance by neighbor voting for social image retrieval. In: Proceedings of MIR, pp 180–187 Li X, Snoek C, Worring M (2008) Learning tag relevance by neighbor voting for social image retrieval. In: Proceedings of MIR, pp 180–187
17.
Zurück zum Zitat Lin H-T, Lin C-J, Weng R (2007) A Note on Platt’s Probabilistic Outputs for Support Vector Machines. Mach. Learn. 68(3): 267–276CrossRef Lin H-T, Lin C-J, Weng R (2007) A Note on Platt’s Probabilistic Outputs for Support Vector Machines. Mach. Learn. 68(3): 267–276CrossRef
18.
Zurück zum Zitat Lowe D (2004) Distinctive Image Features from Scale-Invariant Keypoints. Int. J. Comput. Vis. 60(2):91–110CrossRef Lowe D (2004) Distinctive Image Features from Scale-Invariant Keypoints. Int. J. Comput. Vis. 60(2):91–110CrossRef
19.
Zurück zum Zitat Mei T, Hua X-S, Li S (2008) Contextual in-image advertising. In: Proceedings of ACM multimedia, pp 439–448 Mei T, Hua X-S, Li S (2008) Contextual in-image advertising. In: Proceedings of ACM multimedia, pp 439–448
20.
Zurück zum Zitat Mei T, Hua X-S, Li S (2009) VideoSense: A Contextual In-video Advertising System. IEEE Trans. Cir. and Sys. for Video Technol 19:1866–1879CrossRef Mei T, Hua X-S, Li S (2009) VideoSense: A Contextual In-video Advertising System. IEEE Trans. Cir. and Sys. for Video Technol 19:1866–1879CrossRef
21.
Zurück zum Zitat Naphade M, Smith J, Tesic J, Chang S, Hsu W, Kennedy L, Hauptmann A, Curtis J (2006) Large-scale concept ontology for multimedia. IEEE MultiMed 13(3):86–91CrossRef Naphade M, Smith J, Tesic J, Chang S, Hsu W, Kennedy L, Hauptmann A, Curtis J (2006) Large-scale concept ontology for multimedia. IEEE MultiMed 13(3):86–91CrossRef
22.
Zurück zum Zitat Over P, Awad G, Fiscus J, Antonishek B, Smeaton AF, Kraaij W, Quenot G (2010) TRECVID 2010-an overview of the goals, tasks, data. Evaluation mechanisms and metrics. In: Proceedings of TRECVID workshop Over P, Awad G, Fiscus J, Antonishek B, Smeaton AF, Kraaij W, Quenot G (2010) TRECVID 2010-an overview of the goals, tasks, data. Evaluation mechanisms and metrics. In: Proceedings of TRECVID workshop
23.
Zurück zum Zitat Schler J, Koppel M, Argamon S, Pennebaker J (2006) Effects of age and gender on blogging. In: Proceedings of AAAI spring symposium on computational approaches for analyzing weblogs Schler J, Koppel M, Argamon S, Pennebaker J (2006) Effects of age and gender on blogging. In: Proceedings of AAAI spring symposium on computational approaches for analyzing weblogs
24.
Zurück zum Zitat Schölkopf B, Smola A (2001) Learning with kernels: support vector machines, regularization, optimization, and beyond. MIT Press, Cambridge Schölkopf B, Smola A (2001) Learning with kernels: support vector machines, regularization, optimization, and beyond. MIT Press, Cambridge
25.
Zurück zum Zitat Sengamedu SH, Sawant N, Wadhwa S (2007) vADeo: video advertising system. In: Proceedings of ACM multimedia, pp 455–456 Sengamedu SH, Sawant N, Wadhwa S (2007) vADeo: video advertising system. In: Proceedings of ACM multimedia, pp 455–456
26.
Zurück zum Zitat Silversmith D (2011) Google losing up to 1.65M a day on YouTube. internetevolution.com (retrieved: December 2011) Silversmith D (2011) Google losing up to 1.65M a day on YouTube. internetevolution.com (retrieved: December 2011)
27.
Zurück zum Zitat Sivic J, Zisserman A (2003) Video google: a text retrieval approach to object matching in videos. In: Proceedings of the international conference computer vision, pp 1470–1477 Sivic J, Zisserman A (2003) Video google: a text retrieval approach to object matching in videos. In: Proceedings of the international conference computer vision, pp 1470–1477
28.
Zurück zum Zitat Smeaton A (2005) Large scale evaluations of multimedia information retrieval: the TRECVid experience. In: Proceedings of CIVR, pp 11–17 Smeaton A (2005) Large scale evaluations of multimedia information retrieval: the TRECVid experience. In: Proceedings of CIVR, pp 11–17
29.
Zurück zum Zitat Smeaton A, Over P, Kraaij W (2009) High-level feature detection from video in TRECVid: a 5-year retrospective of achievements. Multimed Content Anal, pp 1–24 Smeaton A, Over P, Kraaij W (2009) High-level feature detection from video in TRECVid: a 5-year retrospective of achievements. Multimed Content Anal, pp 1–24
31.
Zurück zum Zitat Snoek C et al (2011) The mediaMill TRECVID 2011 semantic video search engine. In: Proceedings of TRECVID workshop (unreviewed workshop paper) Snoek C et al (2011) The mediaMill TRECVID 2011 semantic video search engine. In: Proceedings of TRECVID workshop (unreviewed workshop paper)
32.
Zurück zum Zitat Torresani L, Szummer M, Fitzgibbon A (2010) Efficient object category recognition using classemes. Comput Vis ECCV 2010: 776–789 Torresani L, Szummer M, Fitzgibbon A (2010) Efficient object category recognition using classemes. Comput Vis ECCV 2010: 776–789
33.
Zurück zum Zitat Ulges A, Koch M, Borth D, Breuel T (2009) TubeTagger-YouTube-based concept detection. In: Proceedings of workshop on internet multimedia, mining Ulges A, Koch M, Borth D, Breuel T (2009) TubeTagger-YouTube-based concept detection. In: Proceedings of workshop on internet multimedia, mining
34.
Zurück zum Zitat Wang X-J, Yu M, Zhang L, Cai R, Argo W-YMa (2009) Intelligent advertising by mining a user’s interest from his photo collections. In: Proceedings of KDD workshop on data mining and audience intelligence for advertising, pp 18–26 Wang X-J, Yu M, Zhang L, Cai R, Argo W-YMa (2009) Intelligent advertising by mining a user’s interest from his photo collections. In: Proceedings of KDD workshop on data mining and audience intelligence for advertising, pp 18–26
36.
Zurück zum Zitat Wu X, Bolivar A (2008) Keyword extraction for contextual advertisement. In: Proceedings of the 17th international conference on, World Wide Web, pp 1195–1196 Wu X, Bolivar A (2008) Keyword extraction for contextual advertisement. In: Proceedings of the 17th international conference on, World Wide Web, pp 1195–1196
37.
Zurück zum Zitat Yan J, Liu N, Wang G, Zhang W, Jiang Y, Chen Z (2009) How much can behavioral targeting help online advertising? In: Proceedings of the 18th international conference on, World wide web, pp 261–270 Yan J, Liu N, Wang G, Zhang W, Jiang Y, Chen Z (2009) How much can behavioral targeting help online advertising? In: Proceedings of the 18th international conference on, World wide web, pp 261–270
38.
Zurück zum Zitat Yan J et al (2009) How much can behavioral targeting help online advertising? In: Proceedings of WWW, pp 261–270 Yan J et al (2009) How much can behavioral targeting help online advertising? In: Proceedings of WWW, pp 261–270
39.
Zurück zum Zitat Yang J, Hauptmann A (2008) (Un)Reliability of video concept detection. In: Proceedings of the international conference image and video retrieval, pp 85–94 Yang J, Hauptmann A (2008) (Un)Reliability of video concept detection. In: Proceedings of the international conference image and video retrieval, pp 85–94
40.
Zurück zum Zitat Yih W-t, Goodman J, Carvalho VR (2006) Finding advertising keywords on web pages. In: Proceedings of WWW, pp 213–222 Yih W-t, Goodman J, Carvalho VR (2006) Finding advertising keywords on web pages. In: Proceedings of WWW, pp 213–222
42.
Zurück zum Zitat Zelnik-Manor L, Zanetti S, Perona P (2008) A walk through the web’s video clips. In: Proceedings of first internet vision workshop Zelnik-Manor L, Zanetti S, Perona P (2008) A walk through the web’s video clips. In: Proceedings of first internet vision workshop
Metadaten
Titel
Content analysis meets viewers: linking concept detection with demographics on YouTube
verfasst von
Adrian Ulges
Damian Borth
Markus Koch
Publikationsdatum
01.06.2013
Verlag
Springer-Verlag
Erschienen in
International Journal of Multimedia Information Retrieval / Ausgabe 2/2013
Print ISSN: 2192-6611
Elektronische ISSN: 2192-662X
DOI
https://doi.org/10.1007/s13735-012-0029-x

Weitere Artikel der Ausgabe 2/2013

International Journal of Multimedia Information Retrieval 2/2013 Zur Ausgabe