nach oben

Multimedia Systems

Erschienen in:

26.07.2023 | Regular Paper

Multimodal heterogeneous graph convolutional network for image recommendation

verfasst von: Weiyi Wei, Jian Wang, Mengyu Xu, Futong Zhang

Erschienen in: Multimedia Systems | Ausgabe 5/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

To improve the efficiency of the connection between people and information in specific scenarios, recent work has focused on mining user preferences from interactions. However, with the emergence of multimodal information in recent years, user choice in the image recommendation domain is influenced by multiple factors, such as image style, tags, and user social relationships, etc. Therefore, to explore user preferences under different modalities, we capture potential user preferences in a multimodal collaborative manner. In this work, a multimodal heterogeneous graph convolutional network model for image recommendation is proposed, which explores the differences in the representation of user preferences under different modalities. For different modalities, deep propagation networks are employed to construct higher-order connectivity coding between user heterogeneous interactions and image, tag, and user preference information. In addition, a dual-channel attention strategy with the idea of partitioning is employed to optimize the potential preferences of users. The experiments are conducted on public real-world datasets, the results clearly demonstrate the collaborative ability of multimodal information and heterogeneous interaction relations in exploring user preferences.

Vorheriger Artikel Context-guided coarse-to-fine detection model for bird nest detection on high-speed railway catenary

Nächster Artikel DRL-based transmission control for QoE guaranteed transmission efficiency optimization in tile-based panoramic video streaming

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Zhang, Y., Dong, Z., Meng, X.: Research on personalized advertisement recommendation system and its application. J. Comput. Sci. 44(3), 531–563 (2021)

Sun, J.: Personalized music recommendation algorithm based on spark platform. Comput. Intell. Neurosci. 2022, 1–19 (2022)

Wang, Z.H., Hou, D.Z.: Research on book recommendation algorithm based on collaborative filtering and interest degree. Wirel. Commun. Mob. Comput. 2021, 1–7 (2021)

Liu, J., Choi, W.-H., Liu, J.: Personalized movie recommendation method based on deep learning. Math. Probl. Eng. 2021, 1–12 (2021)

Leng, Y.L.C., Lu, Q.: A review of collaborative filtering recommendation techniques. Pattern Recognit. Artif. Intell. 2014, 720–734 (2014)

Yaxiong, C.: Research on Retrieval Technology Based on Deep Learning. University of Chinese Academy of Sciences (Xi’an Institute of Optical Precision Machinery, Chinese Academy of Sciences), vol. 2020 (2020)

Guo, G., Meng, Y., Zhang, Y., Han, C., Li, Y.: Visual semantic image recommendation. IEEE Access 7, 33424–33433 (2019)CrossRef

Geng, X., Zhang, H., Bian, J., Chua, T.-S.: Learning image and user features for recommendation in social networks. In: ICCV

Wu, L., Chen, L., Hong, R., Fu, Y., Xie, X., Wang, M.: A hierarchical attention model for social contextual image recommendation. IEEE Trans. Knowl. Data Eng. 32(10), 1854–1867 (2019)CrossRef

10.

Jian, M., Guo, J., Shi, G., Wu, L., Wang, Z.: Multimodal collaborative graph for image recommendation. Appl. Intell. 53, 560–573 (2022)CrossRef

11.

Jun, H., Caiqing, Z., Xiaozhen, L., Dehai, Z.: Survey of research on multimodal fusion technology for deep learning. Comput. Eng. 46(5), 1–11 (2020)

12.

Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., Bengio, Y.: Graph attention networks. arXiv preprint arXiv:1710.10903 (2017)

13.

Tan Xinyuan, P.S.: Research on a heterogeneous graph neural network model for aggregating high-order neighbour nodes. Small Micro. Comput. Syst. 43, 1–8 (2022)

14.

Yuan, J., Cao, M., Cheng, H., Yu, H., Xie, J., Wang, C.: A unified structure learning framework for graph attention networks. Neurocomputing 495, 194–204 (2022)CrossRef

15.

Zhao, J., Zhuang, F., Ao, X., He, Q., Jiang, H., Ma, L.: Survey of collaborative filtering recommender systems. J. Cyber Secur. 6(5), 17–34 (2021)

16.

Cao, D., Miao, L., Rong, H., Qin, Z., Nie, L.: Hashtag our stories: hashtag recommendation for micro-videos via harnessing multiple modalities. Knowl. Based Syst. 203, 106114 (2020)CrossRef

17.

Sun, R., Cao, X., Zhao, Y., Wan, J., Zhou, K., Zhang, F., Wang, Z., Zheng, K.: Multi-modal knowledge graphs for recommender systems. In: CIKM

18.

Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)

19.

Zhang, S., Yao, Y., Xu, F., Tong, H., Yan, X., Lu, J.: Hashtag recommendation for photo sharing services. In: AAAI

20.

Radford, A., Kim, J.W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., et al.: Learning transferable visual models from natural language supervision. In: International Conference on Machine Learning, pp. 8748–8763. PMLR (2021)

21.

Ren, Y., Cheng, X., Li, X., et al.: Image description and recognition based on concept-level semantics. Comput. Sci. 2008(7), 206–212 (2008)

22.

Kim, H.-U., Koh, Y.J., Kim, C.-S.: Pienet: Personalized image enhancement network. In: European Conference on Computer Vision, pp. 374–390. Springer (2020)

23.

Hu, Y., Koren, Y., Volinsky, C.: Collaborative filtering for implicit feedback datasets. In: 2008 Eighth IEEE International Conference on Data Mining, pp. 263–272. IEEE (2008)

24.

He, J., Zhang, C., Li, X., et al.: A review of research on multimodal fusion techniques for deep learning. Comput. Eng. 46(5):1–11 (2020)

25.

Ding, Y., Yu, J., Liu, B., Hu, Y., Cui, M., Wu, Q.: Mukea: multimodal knowledge extraction and accumulation for knowledge-based visual question answering. In: CVPR

26.

Wu, Q., Shen, Q., Luan, J., Wang, Y.: Msdtron: a high-capability multi-speaker speech synthesis system for diverse data using characteristic information. In: ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6327–6331. IEEE (2022)

27.

Liu, Z.-X., Zhang, Z.-H., Zhang, J.: A top-N recommendation method for graph attention based on multi-level and multi-perspective. Comput. Sci. 48(4), 104–110 (2021)

28.

Tang, J., Gao, H., Liu, H.: mtrust: discerning multi-faceted trust in a connected world. In: WSDM

29.

Ni, J., Li, J., McAuley, J.: Justifying recommendations using distantly-labeled reviews and fine-grained aspects. In: EMNLP-IJCNLP

30.

Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: Nus-wide: a real-world web image database from national university of Singapore. In: ICMR

31.

Rendle, S., Freudenthaler, C., Gantner, Z., Schmidt-Thieme, L.: Bpr: Bayesian personalized ranking from implicit feedback. arXiv preprint arXiv:1205.2618 (2012)

32.

He, X., Liao, L., Zhang, H., Nie, L., Hu, X., Chua, T.-S.: Neural collaborative filtering. In: WWW

33.

Xue, H.-J., Dai, X., Zhang, J., Huang, S., Chen, J.: Deep matrix factorization models for recommender systems. In: IJCAI, vol. 17, pp. 3203–3209. Melbourne (2017)

34.

Zheng, L., Lu, C.-T., Jiang, F., Zhang, J., Yu, P.S.: Spectral collaborative filtering. In: RecSys

35.

Berg, R.V.d., Kipf, T.N., Welling, M.: Graph convolutional matrix completion. arXiv preprint arXiv:1706.02263 (2017)

36.

Ma, J., Zhou, C., Cui, P., Yang, H., Zhu, W.: Learning disentangled representations for recommendation. Adv. Neural Inf. Process. Syst. 32, 1–14 (2019)

37.

Wang, X., He, X., Wang, M., Feng, F., Chua, T.-S.: Neural graph collaborative filtering. In: SIGIR

38.

He, X., Deng, K., Wang, X., Li, Y., Zhang, Y., Wang, M.: Lightgcn: simplifying and powering graph convolution network for recommendation. In: SIGIR

39.

Li, Q., Han, Z., Wu, X.-M.: Deeper insights into graph convolutional networks for semi-supervised learning. In: AAAI

Titel: Multimodal heterogeneous graph convolutional network for image recommendation
verfasst von: Weiyi Wei
Jian Wang
Mengyu Xu
Futong Zhang
Publikationsdatum: 26.07.2023
Verlag: Springer Berlin Heidelberg
Erschienen in: Multimedia Systems / Ausgabe 5/2023
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI: https://doi.org/10.1007/s00530-023-01136-4

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 5/2023

Image aesthetics assessment using composite features from transformer and CNN

HCPSNet: heterogeneous cross-pseudo-supervision network with confidence evaluation for semi-supervised medical image segmentation

Numerical computation based few-shot learning for intelligent sea surface temperature prediction

SI-Net: spatial interaction network for deepfake detection

Conversion of infrared ocean target images to visible images driven by energy information

Multilevel progressive recursive dilated networks with correlation filter (MPRDNCF) for image super-resolution