Skip to main content

2014 | OriginalPaper | Buchkapitel

2. User-Perceptive Multimedia Content Analysis

verfasst von : Jitao Sang

Erschienen in: User-centric Social Multimedia Computing

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Typical social multimedia services allow users as uploaders, viewers, taggers, and commenters to interact and collaborate with each other in a communication dialog. The wisdom of crowds provides a huge resource for understanding social multimedia content. In this chapter, we explicitly model user interaction in the tag generation process and propose a regularized tensor factorization solution to refine the ternary correlations among user, image, and tag. While the traditional social tag analysis work focus on analyzing the image-tag binary correlation, taking user factor into consideration shows superior performance in image tag refinement task.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
 We show a running example consisting of three users, five tags, and four images in Fig. 2.1a.
 
2
 Note in most tag processing work, while tag is contributed by users, user factor is not explicitly considered. We will discuss the difference between our work in this chapter and the existing tag process work in next subsection.
 
3
 In practice, for new images not in the training dataset, we can approximate their positions in the learnt image subspace by using approximated eigenfunctions based on the kernel trick [2].
 
4
 We call triplets like \((u_3, i_2, :)\) and \((u_3, i_4, :)\) as the neutral triplets.
 
5
 Detail of \(W^T\) construction is introduced in next subsection.
 
6
 In the experiment, we choose \(\lambda _c=0.9\) and \(\lambda _s=0.1\).
 
7
 The user factor \(U\) and tag factor\(T\) are the same cases as the image factor \(I\).
 
8
 Due to link failures, the owner ID of some images is unavailable.
 
Literatur
1.
Zurück zum Zitat Acar, E., Yener, B.: Unsupervised multiway data analysis: a literature survey. IEEE Trans. Knowl. Data Eng. 21(1), 6–20 (2009)CrossRef Acar, E., Yener, B.: Unsupervised multiway data analysis: a literature survey. IEEE Trans. Knowl. Data Eng. 21(1), 6–20 (2009)CrossRef
2.
Zurück zum Zitat Bengio, Y., Paiement, J.-F., Vincent, P., Delalleau, O., Roux, N.L., Ouimet, M.: Out-of-sample extensions for lle, isomap, mds, eigenmaps, and spectral clustering. In: NIPS (2003) Bengio, Y., Paiement, J.-F., Vincent, P., Delalleau, O., Roux, N.L., Ouimet, M.: Out-of-sample extensions for lle, isomap, mds, eigenmaps, and spectral clustering. In: NIPS (2003)
3.
Zurück zum Zitat Borghol, Y., Ardon, S., Carlsson, N., Eager, D., Mahanti, A.: The untold story of the clones: content-agnostic factors that impact youtube video popularity. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’12, pp. 1186–1194 (2012) Borghol, Y., Ardon, S., Carlsson, N., Eager, D., Mahanti, A.: The untold story of the clones: content-agnostic factors that impact youtube video popularity. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’12, pp. 1186–1194 (2012)
4.
Zurück zum Zitat Chen, L., Xu, D., Tsang, I.W.-H., Luo, J.: Tag-based web photo retrieval improved by batch mode re-tagging. In: CVPR, pp. 3440–3446 (2010) Chen, L., Xu, D., Tsang, I.W.-H., Luo, J.: Tag-based web photo retrieval improved by batch mode re-tagging. In: CVPR, pp. 3440–3446 (2010)
5.
Zurück zum Zitat Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: Nus-wide: a real-world web image database from national university of singapore. In: CIVR (2009) Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: Nus-wide: a real-world web image database from national university of singapore. In: CIVR (2009)
6.
Zurück zum Zitat Cranshaw, J., Schwartz, R., Hong, J.I., Sadeh, N.M.: The livehoods project: utilizing social media to understand the dynamics of a city. In: ICWSM (2012) Cranshaw, J., Schwartz, R., Hong, J.I., Sadeh, N.M.: The livehoods project: utilizing social media to understand the dynamics of a city. In: ICWSM (2012)
7.
Zurück zum Zitat De Choudhury, M., Sundaram, H., John, A., Seligmann, D.D.: What makes conversations interesting? Themes, participants and consequences of conversations in online social media. In: Proceedings of the 18th International Conference on World Wide Web, WWW’09, pp. 331–340 (2009) De Choudhury, M., Sundaram, H., John, A., Seligmann, D.D.: What makes conversations interesting? Themes, participants and consequences of conversations in online social media. In: Proceedings of the 18th International Conference on World Wide Web, WWW’09, pp. 331–340 (2009)
8.
Zurück zum Zitat Eickhoff, C., Li, W., de Vries, A.P.: Exploiting user comments for audio-visual content indexing and retrieval. In: 34th European Conference on Information Retrieval (ECIR) (2013) Eickhoff, C., Li, W., de Vries, A.P.: Exploiting user comments for audio-visual content indexing and retrieval. In: 34th European Conference on Information Retrieval (ECIR) (2013)
9.
Zurück zum Zitat Fang, Q., Sang, J., Xu, C., Rui, Y.: Topic-sensitive influencer mining in interest-based social media networks via hypergraph learning. IEEE Trans. Multimed. 16(3), 796–812 (2014)CrossRef Fang, Q., Sang, J., Xu, C., Rui, Y.: Topic-sensitive influencer mining in interest-based social media networks via hypergraph learning. IEEE Trans. Multimed. 16(3), 796–812 (2014)CrossRef
10.
Zurück zum Zitat Feng, W., Wang, J.: Incorporating heterogeneous information for personalized tag recommendation in social tagging systems. In: KDD, pp. 1276–1284 (2012) Feng, W., Wang, J.: Incorporating heterogeneous information for personalized tag recommendation in social tagging systems. In: KDD, pp. 1276–1284 (2012)
11.
Zurück zum Zitat Filippova, K., Hall, K.B.: Improved video categorization from text metadata and user comments. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR’11, pp. 835–842 (2011) Filippova, K., Hall, K.B.: Improved video categorization from text metadata and user comments. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR’11, pp. 835–842 (2011)
12.
Zurück zum Zitat He, X., Kan, M.-Y., Xie, P., Chen, X.: Comment-based multi-view clustering of web 2.0 items. In: Proceedings of the 23rd International Conference on World Wide Web, WWW’14, pp. 771–782 (2014) He, X., Kan, M.-Y., Xie, P., Chen, X.: Comment-based multi-view clustering of web 2.0 items. In: Proceedings of the 23rd International Conference on World Wide Web, WWW’14, pp. 771–782 (2014)
13.
Zurück zum Zitat Helic, D.,Strohmaier, M.: Building directories for social tagging systems. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, CIKM’10, pp. 525–534 (2011) Helic, D.,Strohmaier, M.: Building directories for social tagging systems. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, CIKM’10, pp. 525–534 (2011)
14.
Zurück zum Zitat Hu, X., Tang, L., Tang, J., Liu, H.: Exploiting social relations for sentiment analysis in microblogging. In: WSDM, pp. 537–546 (2013) Hu, X., Tang, L., Tang, J., Liu, H.: Exploiting social relations for sentiment analysis in microblogging. In: WSDM, pp. 537–546 (2013)
15.
Zurück zum Zitat Jin, X., Wang, C., Luo, J., Yu, X., Han, J.: Likeminer: a system for mining the power of ‘like’ in social media networks. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’11, pp. 753–756 (2011) Jin, X., Wang, C., Luo, J., Yu, X., Han, J.: Likeminer: a system for mining the power of ‘like’ in social media networks. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’11, pp. 753–756 (2011)
16.
Zurück zum Zitat Jin, Y., Khan, L., Wang, L., Awad, M.: Image annotations by combining multiple evidence & wordnet. In: ACM Multimedia, pp. 706–715 (2005) Jin, Y., Khan, L., Wang, L., Awad, M.: Image annotations by combining multiple evidence & wordnet. In: ACM Multimedia, pp. 706–715 (2005)
17.
Zurück zum Zitat Lappas, T., Punera, K., Sarlos, T.: Mining tags using social endorsement networks. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR’11, pp. 195–204 (2011) Lappas, T., Punera, K., Sarlos, T.: Mining tags using social endorsement networks. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR’11, pp. 195–204 (2011)
18.
Zurück zum Zitat Li, W.-J., Yeung, D.-Y.: Relation regularized matrix factorization. In: IJCAI, pp. 1126–1131 (2009) Li, W.-J., Yeung, D.-Y.: Relation regularized matrix factorization. In: IJCAI, pp. 1126–1131 (2009)
19.
Zurück zum Zitat Li, Z., Liu, J., Zhu, X., Liu, T., Lu, H.: Image annotation using multi-correlation probabilistic matrix factorization. In: ACM Multimedia, pp. 1187–1190 (2010) Li, Z., Liu, J., Zhu, X., Liu, T., Lu, H.: Image annotation using multi-correlation probabilistic matrix factorization. In: ACM Multimedia, pp. 1187–1190 (2010)
20.
Zurück zum Zitat Liu, D., Hua, X.-S., Wang, M., Zhang, H.-J.: Image retagging. In: ACM Multimedia, pp. 491–500 (2010) Liu, D., Hua, X.-S., Wang, M., Zhang, H.-J.: Image retagging. In: ACM Multimedia, pp. 491–500 (2010)
21.
Zurück zum Zitat Liu, D., Hua, X.-S., Yang, L., Wang, M., Zhang, H.-J.: Tag ranking. In: WWW, pp. 351–360 (2009) Liu, D., Hua, X.-S., Yang, L., Wang, M., Zhang, H.-J.: Tag ranking. In: WWW, pp. 351–360 (2009)
22.
Zurück zum Zitat Liu, D., Hua, X.-S., Zhang, H.-J.: Content-based tag processing for internet social images. Multimed. Tool. Appl. 51, 723–738 (2011)CrossRef Liu, D., Hua, X.-S., Zhang, H.-J.: Content-based tag processing for internet social images. Multimed. Tool. Appl. 51, 723–738 (2011)CrossRef
23.
Zurück zum Zitat Liu, D., Yan, S., Rui, Y., Zhang, H.-J.: Unified tag analysis with multi-edge graph. In: ACM Multimedia, pp. 25–34 (2010) Liu, D., Yan, S., Rui, Y., Zhang, H.-J.: Unified tag analysis with multi-edge graph. In: ACM Multimedia, pp. 25–34 (2010)
24.
Zurück zum Zitat Liu, J., Wang, B., Li, M., Li, Z., Ma, W.-Y., Lu, H., Ma, S.: Dual cross-media relevance model for image annotation. In: ACM Multimedia, pp. 605–614 (2007) Liu, J., Wang, B., Li, M., Li, Z., Ma, W.-Y., Lu, H., Ma, S.: Dual cross-media relevance model for image annotation. In: ACM Multimedia, pp. 605–614 (2007)
25.
Zurück zum Zitat Liu, X., Yan, S., Cheng, B., Tang, J., Chua, T.-S., Jin, H.: Label-to-region with continuity-biased bi-layer sparsity priors. ACM Trans. Multimed. Comput. Commun. Appl. (TOMCCAP) 8(4), 50 (2012) Liu, X., Yan, S., Cheng, B., Tang, J., Chua, T.-S., Jin, H.: Label-to-region with continuity-biased bi-layer sparsity priors. ACM Trans. Multimed. Comput. Commun. Appl. (TOMCCAP) 8(4), 50 (2012)
26.
Zurück zum Zitat Lu, C., Hu, X., Chen, X., Park, J.-R., He, T., Li, Z.: The topic-perspective model for social tagging systems. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 683–692 (2010) Lu, C., Hu, X., Chen, X., Park, J.-R., He, T., Li, Z.: The topic-perspective model for social tagging systems. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 683–692 (2010)
27.
Zurück zum Zitat man Au Yeung, C., Gibbins, N., Shadbolt, N.: A study of user profile generation from folksonomies. In: SWKM (2008) man Au Yeung, C., Gibbins, N., Shadbolt, N.: A study of user profile generation from folksonomies. In: SWKM (2008)
28.
Zurück zum Zitat Pinto, H., Almeida, J.M., Gonçalves, M.A.: Using early view patterns to predict the popularity of youtube videos. In: Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, WSDM’13, pp. 365–374 (2013) Pinto, H., Almeida, J.M., Gonçalves, M.A.: Using early view patterns to predict the popularity of youtube videos. In: Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, WSDM’13, pp. 365–374 (2013)
29.
Zurück zum Zitat Plangprasopchok, A., Lerman, K., Getoor, L.: Growing a tree in the forest: Constructing folksonomies by integrating structured metadata. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’10, pp. 949–958 (2010) Plangprasopchok, A., Lerman, K., Getoor, L.: Growing a tree in the forest: Constructing folksonomies by integrating structured metadata. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’10, pp. 949–958 (2010)
30.
Zurück zum Zitat Potthast, M., Stein, B., Becker, S.: Towards comment-based cross-media retrieval. In: Proceedings of the 19th International Conference on World Wide Web, WWW’10, pp. 1169–1170 (2010) Potthast, M., Stein, B., Becker, S.: Towards comment-based cross-media retrieval. In: Proceedings of the 19th International Conference on World Wide Web, WWW’10, pp. 1169–1170 (2010)
31.
Zurück zum Zitat Rendle, S., Marinho, L.B., Nanopoulos, A., Schmidt-Thieme, L.: Learning optimal ranking with tensor factorization for tag recommendation. In: KDD, pp. 727–736 (2009) Rendle, S., Marinho, L.B., Nanopoulos, A., Schmidt-Thieme, L.: Learning optimal ranking with tensor factorization for tag recommendation. In: KDD, pp. 727–736 (2009)
32.
Zurück zum Zitat Rendle, S., Schmidt-Thieme, L.: Pairwise interaction tensor factorization for personalized tag recommendation. In: WSDM, pp. 81–90 (2010) Rendle, S., Schmidt-Thieme, L.: Pairwise interaction tensor factorization for personalized tag recommendation. In: WSDM, pp. 81–90 (2010)
33.
Zurück zum Zitat Sang, J., Liu, J., Xu, C.: Exploiting user information for image tag refinement. In: ACM Multimedia, pp. 1129–1132 (2011) Sang, J., Liu, J., Xu, C.: Exploiting user information for image tag refinement. In: ACM Multimedia, pp. 1129–1132 (2011)
34.
Zurück zum Zitat Sang, J., Xu, C., Liu, J.: User-aware image tag refinement via ternary semantic analysis. IEEE Trans. Multimed. 14(3–2), 883–895 (2012)CrossRef Sang, J., Xu, C., Liu, J.: User-aware image tag refinement via ternary semantic analysis. IEEE Trans. Multimed. 14(3–2), 883–895 (2012)CrossRef
35.
Zurück zum Zitat Sang, J., Xu, C., Lu, D.: Learn to personalized image search from the photo sharing websites. IEEE Trans. Multimed. 14(4), 963–974 (2012)CrossRef Sang, J., Xu, C., Lu, D.: Learn to personalized image search from the photo sharing websites. IEEE Trans. Multimed. 14(4), 963–974 (2012)CrossRef
36.
Zurück zum Zitat Siersdorfer, S., Chelaru, S., Nejdl, W., San Pedro, J.: How useful are your comments? Analyzing and predicting youtube comments and comment ratings. In: Proceedings of the 19th International Conference on World Wide Web, WWW’10, pp. 891–900 (2010) Siersdorfer, S., Chelaru, S., Nejdl, W., San Pedro, J.: How useful are your comments? Analyzing and predicting youtube comments and comment ratings. In: Proceedings of the 19th International Conference on World Wide Web, WWW’10, pp. 891–900 (2010)
37.
Zurück zum Zitat Trevisiol, M., Jégou, H., Delhumeau, J., Gravier, G.: Retrieving geo-location of videos with a divide & conquer hierarchical multimodal approach. In: ICMR, pp. 1–8 (2013) Trevisiol, M., Jégou, H., Delhumeau, J., Gravier, G.: Retrieving geo-location of videos with a divide & conquer hierarchical multimodal approach. In: ICMR, pp. 1–8 (2013)
38.
Zurück zum Zitat von Ahn, L., Dabbish, L.: Esp: Labeling images with a computer game. In: AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors, pp. 91–98 (2005) von Ahn, L., Dabbish, L.: Esp: Labeling images with a computer game. In: AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors, pp. 91–98 (2005)
39.
Zurück zum Zitat Wang, C., Jing, F., Zhang, L., Zhang, H.: Image annotation refinement using random walk with restarts. In: ACM Multimedia, pp. 647–650 (2006) Wang, C., Jing, F., Zhang, L., Zhang, H.: Image annotation refinement using random walk with restarts. In: ACM Multimedia, pp. 647–650 (2006)
40.
Zurück zum Zitat Wang, C., Jing, F., Zhang, L., Zhang, H.-J.: Content-based image annotation refinement. In: CVPR (2007) Wang, C., Jing, F., Zhang, L., Zhang, H.-J.: Content-based image annotation refinement. In: CVPR (2007)
41.
Zurück zum Zitat Xie, L., Natsev, A., Hill, M.L., Smith, J.R., Phillips, A.: The accuracy and value of machine-generated image tags: design and user evaluation of an end-to-end image tagging system. In: CIVR, pp. 58–65 (2010) Xie, L., Natsev, A., Hill, M.L., Smith, J.R., Phillips, A.: The accuracy and value of machine-generated image tags: design and user evaluation of an end-to-end image tagging system. In: CIVR, pp. 58–65 (2010)
42.
Zurück zum Zitat Xu, H., Wang, J., Hua, X.-S., Li, S.: Tag refinement by regularized lda. In: ACM Multimedia, pp. 573–576 (2009) Xu, H., Wang, J., Hua, X.-S., Li, S.: Tag refinement by regularized lda. In: ACM Multimedia, pp. 573–576 (2009)
43.
Zurück zum Zitat Yamamoto, T., Nakamura, S.: Leveraging viewer comments for mood classification of music video clips. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR’13, pp. 797–800 (2013) Yamamoto, T., Nakamura, S.: Leveraging viewer comments for mood classification of music video clips. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR’13, pp. 797–800 (2013)
44.
Zurück zum Zitat Ye, M., Shou, D., Lee, W.-C., Yin, P., Janowicz, K.: On the semantic annotation of places in location-based social networks. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’11, pp. 520–528 (2011) Ye, M., Shou, D., Lee, W.-C., Yin, P., Janowicz, K.: On the semantic annotation of places in location-based social networks. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’11, pp. 520–528 (2011)
45.
Zurück zum Zitat Yu, B., Ma, W.-Y., Nahrstedt, K., Zhang, H.-J.: Video summarization based on user log enhanced link analysis. In: Proceedings of the Eleventh ACM International Conference on Multimedia, MULTIMEDIA’03, pp. 382–391 (2003) Yu, B., Ma, W.-Y., Nahrstedt, K., Zhang, H.-J.: Video summarization based on user log enhanced link analysis. In: Proceedings of the Eleventh ACM International Conference on Multimedia, MULTIMEDIA’03, pp. 382–391 (2003)
46.
Zurück zum Zitat Zhou, Y., Wilkinson, D.M., Schreiber, R., Pan, R.: Large-scale parallel collaborative filtering for the netflix prize. In: AAIM, pp. 337–348 (2008) Zhou, Y., Wilkinson, D.M., Schreiber, R., Pan, R.: Large-scale parallel collaborative filtering for the netflix prize. In: AAIM, pp. 337–348 (2008)
47.
Zurück zum Zitat Zhu, G., Yan, S., Ma, Y.: Image tag refinement towards low-rank, content-tag prior and error sparsity. In: ACM Multimedia, pp. 461–470 (2010) Zhu, G., Yan, S., Ma, Y.: Image tag refinement towards low-rank, content-tag prior and error sparsity. In: ACM Multimedia, pp. 461–470 (2010)
Metadaten
Titel
User-Perceptive Multimedia Content Analysis
verfasst von
Jitao Sang
Copyright-Jahr
2014
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-662-44671-3_2

Neuer Inhalt