Skip to main content
Top
Published in: Multimedia Systems 5/2023

26-07-2023 | Regular Paper

Multimodal heterogeneous graph convolutional network for image recommendation

Authors: Weiyi Wei, Jian Wang, Mengyu Xu, Futong Zhang

Published in: Multimedia Systems | Issue 5/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

To improve the efficiency of the connection between people and information in specific scenarios, recent work has focused on mining user preferences from interactions. However, with the emergence of multimodal information in recent years, user choice in the image recommendation domain is influenced by multiple factors, such as image style, tags, and user social relationships, etc. Therefore, to explore user preferences under different modalities, we capture potential user preferences in a multimodal collaborative manner. In this work, a multimodal heterogeneous graph convolutional network model for image recommendation is proposed, which explores the differences in the representation of user preferences under different modalities. For different modalities, deep propagation networks are employed to construct higher-order connectivity coding between user heterogeneous interactions and image, tag, and user preference information. In addition, a dual-channel attention strategy with the idea of partitioning is employed to optimize the potential preferences of users. The experiments are conducted on public real-world datasets, the results clearly demonstrate the collaborative ability of multimodal information and heterogeneous interaction relations in exploring user preferences.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Zhang, Y., Dong, Z., Meng, X.: Research on personalized advertisement recommendation system and its application. J. Comput. Sci. 44(3), 531–563 (2021) Zhang, Y., Dong, Z., Meng, X.: Research on personalized advertisement recommendation system and its application. J. Comput. Sci. 44(3), 531–563 (2021)
2.
go back to reference Sun, J.: Personalized music recommendation algorithm based on spark platform. Comput. Intell. Neurosci. 2022, 1–19 (2022) Sun, J.: Personalized music recommendation algorithm based on spark platform. Comput. Intell. Neurosci. 2022, 1–19 (2022)
3.
go back to reference Wang, Z.H., Hou, D.Z.: Research on book recommendation algorithm based on collaborative filtering and interest degree. Wirel. Commun. Mob. Comput. 2021, 1–7 (2021) Wang, Z.H., Hou, D.Z.: Research on book recommendation algorithm based on collaborative filtering and interest degree. Wirel. Commun. Mob. Comput. 2021, 1–7 (2021)
4.
go back to reference Liu, J., Choi, W.-H., Liu, J.: Personalized movie recommendation method based on deep learning. Math. Probl. Eng. 2021, 1–12 (2021) Liu, J., Choi, W.-H., Liu, J.: Personalized movie recommendation method based on deep learning. Math. Probl. Eng. 2021, 1–12 (2021)
5.
go back to reference Leng, Y.L.C., Lu, Q.: A review of collaborative filtering recommendation techniques. Pattern Recognit. Artif. Intell. 2014, 720–734 (2014) Leng, Y.L.C., Lu, Q.: A review of collaborative filtering recommendation techniques. Pattern Recognit. Artif. Intell. 2014, 720–734 (2014)
6.
go back to reference Yaxiong, C.: Research on Retrieval Technology Based on Deep Learning. University of Chinese Academy of Sciences (Xi’an Institute of Optical Precision Machinery, Chinese Academy of Sciences), vol. 2020 (2020) Yaxiong, C.: Research on Retrieval Technology Based on Deep Learning. University of Chinese Academy of Sciences (Xi’an Institute of Optical Precision Machinery, Chinese Academy of Sciences), vol. 2020 (2020)
7.
go back to reference Guo, G., Meng, Y., Zhang, Y., Han, C., Li, Y.: Visual semantic image recommendation. IEEE Access 7, 33424–33433 (2019)CrossRef Guo, G., Meng, Y., Zhang, Y., Han, C., Li, Y.: Visual semantic image recommendation. IEEE Access 7, 33424–33433 (2019)CrossRef
8.
go back to reference Geng, X., Zhang, H., Bian, J., Chua, T.-S.: Learning image and user features for recommendation in social networks. In: ICCV Geng, X., Zhang, H., Bian, J., Chua, T.-S.: Learning image and user features for recommendation in social networks. In: ICCV
9.
go back to reference Wu, L., Chen, L., Hong, R., Fu, Y., Xie, X., Wang, M.: A hierarchical attention model for social contextual image recommendation. IEEE Trans. Knowl. Data Eng. 32(10), 1854–1867 (2019)CrossRef Wu, L., Chen, L., Hong, R., Fu, Y., Xie, X., Wang, M.: A hierarchical attention model for social contextual image recommendation. IEEE Trans. Knowl. Data Eng. 32(10), 1854–1867 (2019)CrossRef
10.
go back to reference Jian, M., Guo, J., Shi, G., Wu, L., Wang, Z.: Multimodal collaborative graph for image recommendation. Appl. Intell. 53, 560–573 (2022)CrossRef Jian, M., Guo, J., Shi, G., Wu, L., Wang, Z.: Multimodal collaborative graph for image recommendation. Appl. Intell. 53, 560–573 (2022)CrossRef
11.
go back to reference Jun, H., Caiqing, Z., Xiaozhen, L., Dehai, Z.: Survey of research on multimodal fusion technology for deep learning. Comput. Eng. 46(5), 1–11 (2020) Jun, H., Caiqing, Z., Xiaozhen, L., Dehai, Z.: Survey of research on multimodal fusion technology for deep learning. Comput. Eng. 46(5), 1–11 (2020)
12.
go back to reference Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., Bengio, Y.: Graph attention networks. arXiv preprint arXiv:1710.10903 (2017) Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., Bengio, Y.: Graph attention networks. arXiv preprint arXiv:​1710.​10903 (2017)
13.
go back to reference Tan Xinyuan, P.S.: Research on a heterogeneous graph neural network model for aggregating high-order neighbour nodes. Small Micro. Comput. Syst. 43, 1–8 (2022) Tan Xinyuan, P.S.: Research on a heterogeneous graph neural network model for aggregating high-order neighbour nodes. Small Micro. Comput. Syst. 43, 1–8 (2022)
14.
go back to reference Yuan, J., Cao, M., Cheng, H., Yu, H., Xie, J., Wang, C.: A unified structure learning framework for graph attention networks. Neurocomputing 495, 194–204 (2022)CrossRef Yuan, J., Cao, M., Cheng, H., Yu, H., Xie, J., Wang, C.: A unified structure learning framework for graph attention networks. Neurocomputing 495, 194–204 (2022)CrossRef
15.
go back to reference Zhao, J., Zhuang, F., Ao, X., He, Q., Jiang, H., Ma, L.: Survey of collaborative filtering recommender systems. J. Cyber Secur. 6(5), 17–34 (2021) Zhao, J., Zhuang, F., Ao, X., He, Q., Jiang, H., Ma, L.: Survey of collaborative filtering recommender systems. J. Cyber Secur. 6(5), 17–34 (2021)
16.
go back to reference Cao, D., Miao, L., Rong, H., Qin, Z., Nie, L.: Hashtag our stories: hashtag recommendation for micro-videos via harnessing multiple modalities. Knowl. Based Syst. 203, 106114 (2020)CrossRef Cao, D., Miao, L., Rong, H., Qin, Z., Nie, L.: Hashtag our stories: hashtag recommendation for micro-videos via harnessing multiple modalities. Knowl. Based Syst. 203, 106114 (2020)CrossRef
17.
go back to reference Sun, R., Cao, X., Zhao, Y., Wan, J., Zhou, K., Zhang, F., Wang, Z., Zheng, K.: Multi-modal knowledge graphs for recommender systems. In: CIKM Sun, R., Cao, X., Zhao, Y., Wan, J., Zhou, K., Zhang, F., Wang, Z., Zheng, K.: Multi-modal knowledge graphs for recommender systems. In: CIKM
18.
go back to reference Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:​1409.​1556 (2014)
19.
go back to reference Zhang, S., Yao, Y., Xu, F., Tong, H., Yan, X., Lu, J.: Hashtag recommendation for photo sharing services. In: AAAI Zhang, S., Yao, Y., Xu, F., Tong, H., Yan, X., Lu, J.: Hashtag recommendation for photo sharing services. In: AAAI
20.
go back to reference Radford, A., Kim, J.W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., et al.: Learning transferable visual models from natural language supervision. In: International Conference on Machine Learning, pp. 8748–8763. PMLR (2021) Radford, A., Kim, J.W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., et al.: Learning transferable visual models from natural language supervision. In: International Conference on Machine Learning, pp. 8748–8763. PMLR (2021)
21.
go back to reference Ren, Y., Cheng, X., Li, X., et al.: Image description and recognition based on concept-level semantics. Comput. Sci. 2008(7), 206–212 (2008) Ren, Y., Cheng, X., Li, X., et al.: Image description and recognition based on concept-level semantics. Comput. Sci. 2008(7), 206–212 (2008)
22.
go back to reference Kim, H.-U., Koh, Y.J., Kim, C.-S.: Pienet: Personalized image enhancement network. In: European Conference on Computer Vision, pp. 374–390. Springer (2020) Kim, H.-U., Koh, Y.J., Kim, C.-S.: Pienet: Personalized image enhancement network. In: European Conference on Computer Vision, pp. 374–390. Springer (2020)
23.
go back to reference Hu, Y., Koren, Y., Volinsky, C.: Collaborative filtering for implicit feedback datasets. In: 2008 Eighth IEEE International Conference on Data Mining, pp. 263–272. IEEE (2008) Hu, Y., Koren, Y., Volinsky, C.: Collaborative filtering for implicit feedback datasets. In: 2008 Eighth IEEE International Conference on Data Mining, pp. 263–272. IEEE (2008)
24.
go back to reference He, J., Zhang, C., Li, X., et al.: A review of research on multimodal fusion techniques for deep learning. Comput. Eng. 46(5):1–11 (2020) He, J., Zhang, C., Li, X., et al.: A review of research on multimodal fusion techniques for deep learning. Comput. Eng. 46(5):1–11 (2020)
25.
go back to reference Ding, Y., Yu, J., Liu, B., Hu, Y., Cui, M., Wu, Q.: Mukea: multimodal knowledge extraction and accumulation for knowledge-based visual question answering. In: CVPR Ding, Y., Yu, J., Liu, B., Hu, Y., Cui, M., Wu, Q.: Mukea: multimodal knowledge extraction and accumulation for knowledge-based visual question answering. In: CVPR
26.
go back to reference Wu, Q., Shen, Q., Luan, J., Wang, Y.: Msdtron: a high-capability multi-speaker speech synthesis system for diverse data using characteristic information. In: ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6327–6331. IEEE (2022) Wu, Q., Shen, Q., Luan, J., Wang, Y.: Msdtron: a high-capability multi-speaker speech synthesis system for diverse data using characteristic information. In: ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6327–6331. IEEE (2022)
27.
go back to reference Liu, Z.-X., Zhang, Z.-H., Zhang, J.: A top-N recommendation method for graph attention based on multi-level and multi-perspective. Comput. Sci. 48(4), 104–110 (2021) Liu, Z.-X., Zhang, Z.-H., Zhang, J.: A top-N recommendation method for graph attention based on multi-level and multi-perspective. Comput. Sci. 48(4), 104–110 (2021)
28.
go back to reference Tang, J., Gao, H., Liu, H.: mtrust: discerning multi-faceted trust in a connected world. In: WSDM Tang, J., Gao, H., Liu, H.: mtrust: discerning multi-faceted trust in a connected world. In: WSDM
29.
go back to reference Ni, J., Li, J., McAuley, J.: Justifying recommendations using distantly-labeled reviews and fine-grained aspects. In: EMNLP-IJCNLP Ni, J., Li, J., McAuley, J.: Justifying recommendations using distantly-labeled reviews and fine-grained aspects. In: EMNLP-IJCNLP
30.
go back to reference Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: Nus-wide: a real-world web image database from national university of Singapore. In: ICMR Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: Nus-wide: a real-world web image database from national university of Singapore. In: ICMR
31.
go back to reference Rendle, S., Freudenthaler, C., Gantner, Z., Schmidt-Thieme, L.: Bpr: Bayesian personalized ranking from implicit feedback. arXiv preprint arXiv:1205.2618 (2012) Rendle, S., Freudenthaler, C., Gantner, Z., Schmidt-Thieme, L.: Bpr: Bayesian personalized ranking from implicit feedback. arXiv preprint arXiv:​1205.​2618 (2012)
32.
go back to reference He, X., Liao, L., Zhang, H., Nie, L., Hu, X., Chua, T.-S.: Neural collaborative filtering. In: WWW He, X., Liao, L., Zhang, H., Nie, L., Hu, X., Chua, T.-S.: Neural collaborative filtering. In: WWW
33.
go back to reference Xue, H.-J., Dai, X., Zhang, J., Huang, S., Chen, J.: Deep matrix factorization models for recommender systems. In: IJCAI, vol. 17, pp. 3203–3209. Melbourne (2017) Xue, H.-J., Dai, X., Zhang, J., Huang, S., Chen, J.: Deep matrix factorization models for recommender systems. In: IJCAI, vol. 17, pp. 3203–3209. Melbourne (2017)
34.
go back to reference Zheng, L., Lu, C.-T., Jiang, F., Zhang, J., Yu, P.S.: Spectral collaborative filtering. In: RecSys Zheng, L., Lu, C.-T., Jiang, F., Zhang, J., Yu, P.S.: Spectral collaborative filtering. In: RecSys
36.
go back to reference Ma, J., Zhou, C., Cui, P., Yang, H., Zhu, W.: Learning disentangled representations for recommendation. Adv. Neural Inf. Process. Syst. 32, 1–14 (2019) Ma, J., Zhou, C., Cui, P., Yang, H., Zhu, W.: Learning disentangled representations for recommendation. Adv. Neural Inf. Process. Syst. 32, 1–14 (2019)
37.
go back to reference Wang, X., He, X., Wang, M., Feng, F., Chua, T.-S.: Neural graph collaborative filtering. In: SIGIR Wang, X., He, X., Wang, M., Feng, F., Chua, T.-S.: Neural graph collaborative filtering. In: SIGIR
38.
go back to reference He, X., Deng, K., Wang, X., Li, Y., Zhang, Y., Wang, M.: Lightgcn: simplifying and powering graph convolution network for recommendation. In: SIGIR He, X., Deng, K., Wang, X., Li, Y., Zhang, Y., Wang, M.: Lightgcn: simplifying and powering graph convolution network for recommendation. In: SIGIR
39.
go back to reference Li, Q., Han, Z., Wu, X.-M.: Deeper insights into graph convolutional networks for semi-supervised learning. In: AAAI Li, Q., Han, Z., Wu, X.-M.: Deeper insights into graph convolutional networks for semi-supervised learning. In: AAAI
Metadata
Title
Multimodal heterogeneous graph convolutional network for image recommendation
Authors
Weiyi Wei
Jian Wang
Mengyu Xu
Futong Zhang
Publication date
26-07-2023
Publisher
Springer Berlin Heidelberg
Published in
Multimedia Systems / Issue 5/2023
Print ISSN: 0942-4962
Electronic ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-023-01136-4

Other articles of this Issue 5/2023

Multimedia Systems 5/2023 Go to the issue