Skip to main content
Erschienen in: Multimedia Systems 5/2023

26.07.2023 | Regular Paper

Multimodal heterogeneous graph convolutional network for image recommendation

verfasst von: Weiyi Wei, Jian Wang, Mengyu Xu, Futong Zhang

Erschienen in: Multimedia Systems | Ausgabe 5/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

To improve the efficiency of the connection between people and information in specific scenarios, recent work has focused on mining user preferences from interactions. However, with the emergence of multimodal information in recent years, user choice in the image recommendation domain is influenced by multiple factors, such as image style, tags, and user social relationships, etc. Therefore, to explore user preferences under different modalities, we capture potential user preferences in a multimodal collaborative manner. In this work, a multimodal heterogeneous graph convolutional network model for image recommendation is proposed, which explores the differences in the representation of user preferences under different modalities. For different modalities, deep propagation networks are employed to construct higher-order connectivity coding between user heterogeneous interactions and image, tag, and user preference information. In addition, a dual-channel attention strategy with the idea of partitioning is employed to optimize the potential preferences of users. The experiments are conducted on public real-world datasets, the results clearly demonstrate the collaborative ability of multimodal information and heterogeneous interaction relations in exploring user preferences.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zhang, Y., Dong, Z., Meng, X.: Research on personalized advertisement recommendation system and its application. J. Comput. Sci. 44(3), 531–563 (2021) Zhang, Y., Dong, Z., Meng, X.: Research on personalized advertisement recommendation system and its application. J. Comput. Sci. 44(3), 531–563 (2021)
2.
Zurück zum Zitat Sun, J.: Personalized music recommendation algorithm based on spark platform. Comput. Intell. Neurosci. 2022, 1–19 (2022) Sun, J.: Personalized music recommendation algorithm based on spark platform. Comput. Intell. Neurosci. 2022, 1–19 (2022)
3.
Zurück zum Zitat Wang, Z.H., Hou, D.Z.: Research on book recommendation algorithm based on collaborative filtering and interest degree. Wirel. Commun. Mob. Comput. 2021, 1–7 (2021) Wang, Z.H., Hou, D.Z.: Research on book recommendation algorithm based on collaborative filtering and interest degree. Wirel. Commun. Mob. Comput. 2021, 1–7 (2021)
4.
Zurück zum Zitat Liu, J., Choi, W.-H., Liu, J.: Personalized movie recommendation method based on deep learning. Math. Probl. Eng. 2021, 1–12 (2021) Liu, J., Choi, W.-H., Liu, J.: Personalized movie recommendation method based on deep learning. Math. Probl. Eng. 2021, 1–12 (2021)
5.
Zurück zum Zitat Leng, Y.L.C., Lu, Q.: A review of collaborative filtering recommendation techniques. Pattern Recognit. Artif. Intell. 2014, 720–734 (2014) Leng, Y.L.C., Lu, Q.: A review of collaborative filtering recommendation techniques. Pattern Recognit. Artif. Intell. 2014, 720–734 (2014)
6.
Zurück zum Zitat Yaxiong, C.: Research on Retrieval Technology Based on Deep Learning. University of Chinese Academy of Sciences (Xi’an Institute of Optical Precision Machinery, Chinese Academy of Sciences), vol. 2020 (2020) Yaxiong, C.: Research on Retrieval Technology Based on Deep Learning. University of Chinese Academy of Sciences (Xi’an Institute of Optical Precision Machinery, Chinese Academy of Sciences), vol. 2020 (2020)
7.
Zurück zum Zitat Guo, G., Meng, Y., Zhang, Y., Han, C., Li, Y.: Visual semantic image recommendation. IEEE Access 7, 33424–33433 (2019)CrossRef Guo, G., Meng, Y., Zhang, Y., Han, C., Li, Y.: Visual semantic image recommendation. IEEE Access 7, 33424–33433 (2019)CrossRef
8.
Zurück zum Zitat Geng, X., Zhang, H., Bian, J., Chua, T.-S.: Learning image and user features for recommendation in social networks. In: ICCV Geng, X., Zhang, H., Bian, J., Chua, T.-S.: Learning image and user features for recommendation in social networks. In: ICCV
9.
Zurück zum Zitat Wu, L., Chen, L., Hong, R., Fu, Y., Xie, X., Wang, M.: A hierarchical attention model for social contextual image recommendation. IEEE Trans. Knowl. Data Eng. 32(10), 1854–1867 (2019)CrossRef Wu, L., Chen, L., Hong, R., Fu, Y., Xie, X., Wang, M.: A hierarchical attention model for social contextual image recommendation. IEEE Trans. Knowl. Data Eng. 32(10), 1854–1867 (2019)CrossRef
10.
Zurück zum Zitat Jian, M., Guo, J., Shi, G., Wu, L., Wang, Z.: Multimodal collaborative graph for image recommendation. Appl. Intell. 53, 560–573 (2022)CrossRef Jian, M., Guo, J., Shi, G., Wu, L., Wang, Z.: Multimodal collaborative graph for image recommendation. Appl. Intell. 53, 560–573 (2022)CrossRef
11.
Zurück zum Zitat Jun, H., Caiqing, Z., Xiaozhen, L., Dehai, Z.: Survey of research on multimodal fusion technology for deep learning. Comput. Eng. 46(5), 1–11 (2020) Jun, H., Caiqing, Z., Xiaozhen, L., Dehai, Z.: Survey of research on multimodal fusion technology for deep learning. Comput. Eng. 46(5), 1–11 (2020)
12.
Zurück zum Zitat Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., Bengio, Y.: Graph attention networks. arXiv preprint arXiv:1710.10903 (2017) Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., Bengio, Y.: Graph attention networks. arXiv preprint arXiv:​1710.​10903 (2017)
13.
Zurück zum Zitat Tan Xinyuan, P.S.: Research on a heterogeneous graph neural network model for aggregating high-order neighbour nodes. Small Micro. Comput. Syst. 43, 1–8 (2022) Tan Xinyuan, P.S.: Research on a heterogeneous graph neural network model for aggregating high-order neighbour nodes. Small Micro. Comput. Syst. 43, 1–8 (2022)
14.
Zurück zum Zitat Yuan, J., Cao, M., Cheng, H., Yu, H., Xie, J., Wang, C.: A unified structure learning framework for graph attention networks. Neurocomputing 495, 194–204 (2022)CrossRef Yuan, J., Cao, M., Cheng, H., Yu, H., Xie, J., Wang, C.: A unified structure learning framework for graph attention networks. Neurocomputing 495, 194–204 (2022)CrossRef
15.
Zurück zum Zitat Zhao, J., Zhuang, F., Ao, X., He, Q., Jiang, H., Ma, L.: Survey of collaborative filtering recommender systems. J. Cyber Secur. 6(5), 17–34 (2021) Zhao, J., Zhuang, F., Ao, X., He, Q., Jiang, H., Ma, L.: Survey of collaborative filtering recommender systems. J. Cyber Secur. 6(5), 17–34 (2021)
16.
Zurück zum Zitat Cao, D., Miao, L., Rong, H., Qin, Z., Nie, L.: Hashtag our stories: hashtag recommendation for micro-videos via harnessing multiple modalities. Knowl. Based Syst. 203, 106114 (2020)CrossRef Cao, D., Miao, L., Rong, H., Qin, Z., Nie, L.: Hashtag our stories: hashtag recommendation for micro-videos via harnessing multiple modalities. Knowl. Based Syst. 203, 106114 (2020)CrossRef
17.
Zurück zum Zitat Sun, R., Cao, X., Zhao, Y., Wan, J., Zhou, K., Zhang, F., Wang, Z., Zheng, K.: Multi-modal knowledge graphs for recommender systems. In: CIKM Sun, R., Cao, X., Zhao, Y., Wan, J., Zhou, K., Zhang, F., Wang, Z., Zheng, K.: Multi-modal knowledge graphs for recommender systems. In: CIKM
18.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:​1409.​1556 (2014)
19.
Zurück zum Zitat Zhang, S., Yao, Y., Xu, F., Tong, H., Yan, X., Lu, J.: Hashtag recommendation for photo sharing services. In: AAAI Zhang, S., Yao, Y., Xu, F., Tong, H., Yan, X., Lu, J.: Hashtag recommendation for photo sharing services. In: AAAI
20.
Zurück zum Zitat Radford, A., Kim, J.W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., et al.: Learning transferable visual models from natural language supervision. In: International Conference on Machine Learning, pp. 8748–8763. PMLR (2021) Radford, A., Kim, J.W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., et al.: Learning transferable visual models from natural language supervision. In: International Conference on Machine Learning, pp. 8748–8763. PMLR (2021)
21.
Zurück zum Zitat Ren, Y., Cheng, X., Li, X., et al.: Image description and recognition based on concept-level semantics. Comput. Sci. 2008(7), 206–212 (2008) Ren, Y., Cheng, X., Li, X., et al.: Image description and recognition based on concept-level semantics. Comput. Sci. 2008(7), 206–212 (2008)
22.
Zurück zum Zitat Kim, H.-U., Koh, Y.J., Kim, C.-S.: Pienet: Personalized image enhancement network. In: European Conference on Computer Vision, pp. 374–390. Springer (2020) Kim, H.-U., Koh, Y.J., Kim, C.-S.: Pienet: Personalized image enhancement network. In: European Conference on Computer Vision, pp. 374–390. Springer (2020)
23.
Zurück zum Zitat Hu, Y., Koren, Y., Volinsky, C.: Collaborative filtering for implicit feedback datasets. In: 2008 Eighth IEEE International Conference on Data Mining, pp. 263–272. IEEE (2008) Hu, Y., Koren, Y., Volinsky, C.: Collaborative filtering for implicit feedback datasets. In: 2008 Eighth IEEE International Conference on Data Mining, pp. 263–272. IEEE (2008)
24.
Zurück zum Zitat He, J., Zhang, C., Li, X., et al.: A review of research on multimodal fusion techniques for deep learning. Comput. Eng. 46(5):1–11 (2020) He, J., Zhang, C., Li, X., et al.: A review of research on multimodal fusion techniques for deep learning. Comput. Eng. 46(5):1–11 (2020)
25.
Zurück zum Zitat Ding, Y., Yu, J., Liu, B., Hu, Y., Cui, M., Wu, Q.: Mukea: multimodal knowledge extraction and accumulation for knowledge-based visual question answering. In: CVPR Ding, Y., Yu, J., Liu, B., Hu, Y., Cui, M., Wu, Q.: Mukea: multimodal knowledge extraction and accumulation for knowledge-based visual question answering. In: CVPR
26.
Zurück zum Zitat Wu, Q., Shen, Q., Luan, J., Wang, Y.: Msdtron: a high-capability multi-speaker speech synthesis system for diverse data using characteristic information. In: ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6327–6331. IEEE (2022) Wu, Q., Shen, Q., Luan, J., Wang, Y.: Msdtron: a high-capability multi-speaker speech synthesis system for diverse data using characteristic information. In: ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6327–6331. IEEE (2022)
27.
Zurück zum Zitat Liu, Z.-X., Zhang, Z.-H., Zhang, J.: A top-N recommendation method for graph attention based on multi-level and multi-perspective. Comput. Sci. 48(4), 104–110 (2021) Liu, Z.-X., Zhang, Z.-H., Zhang, J.: A top-N recommendation method for graph attention based on multi-level and multi-perspective. Comput. Sci. 48(4), 104–110 (2021)
28.
Zurück zum Zitat Tang, J., Gao, H., Liu, H.: mtrust: discerning multi-faceted trust in a connected world. In: WSDM Tang, J., Gao, H., Liu, H.: mtrust: discerning multi-faceted trust in a connected world. In: WSDM
29.
Zurück zum Zitat Ni, J., Li, J., McAuley, J.: Justifying recommendations using distantly-labeled reviews and fine-grained aspects. In: EMNLP-IJCNLP Ni, J., Li, J., McAuley, J.: Justifying recommendations using distantly-labeled reviews and fine-grained aspects. In: EMNLP-IJCNLP
30.
Zurück zum Zitat Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: Nus-wide: a real-world web image database from national university of Singapore. In: ICMR Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: Nus-wide: a real-world web image database from national university of Singapore. In: ICMR
31.
Zurück zum Zitat Rendle, S., Freudenthaler, C., Gantner, Z., Schmidt-Thieme, L.: Bpr: Bayesian personalized ranking from implicit feedback. arXiv preprint arXiv:1205.2618 (2012) Rendle, S., Freudenthaler, C., Gantner, Z., Schmidt-Thieme, L.: Bpr: Bayesian personalized ranking from implicit feedback. arXiv preprint arXiv:​1205.​2618 (2012)
32.
Zurück zum Zitat He, X., Liao, L., Zhang, H., Nie, L., Hu, X., Chua, T.-S.: Neural collaborative filtering. In: WWW He, X., Liao, L., Zhang, H., Nie, L., Hu, X., Chua, T.-S.: Neural collaborative filtering. In: WWW
33.
Zurück zum Zitat Xue, H.-J., Dai, X., Zhang, J., Huang, S., Chen, J.: Deep matrix factorization models for recommender systems. In: IJCAI, vol. 17, pp. 3203–3209. Melbourne (2017) Xue, H.-J., Dai, X., Zhang, J., Huang, S., Chen, J.: Deep matrix factorization models for recommender systems. In: IJCAI, vol. 17, pp. 3203–3209. Melbourne (2017)
34.
Zurück zum Zitat Zheng, L., Lu, C.-T., Jiang, F., Zhang, J., Yu, P.S.: Spectral collaborative filtering. In: RecSys Zheng, L., Lu, C.-T., Jiang, F., Zhang, J., Yu, P.S.: Spectral collaborative filtering. In: RecSys
36.
Zurück zum Zitat Ma, J., Zhou, C., Cui, P., Yang, H., Zhu, W.: Learning disentangled representations for recommendation. Adv. Neural Inf. Process. Syst. 32, 1–14 (2019) Ma, J., Zhou, C., Cui, P., Yang, H., Zhu, W.: Learning disentangled representations for recommendation. Adv. Neural Inf. Process. Syst. 32, 1–14 (2019)
37.
Zurück zum Zitat Wang, X., He, X., Wang, M., Feng, F., Chua, T.-S.: Neural graph collaborative filtering. In: SIGIR Wang, X., He, X., Wang, M., Feng, F., Chua, T.-S.: Neural graph collaborative filtering. In: SIGIR
38.
Zurück zum Zitat He, X., Deng, K., Wang, X., Li, Y., Zhang, Y., Wang, M.: Lightgcn: simplifying and powering graph convolution network for recommendation. In: SIGIR He, X., Deng, K., Wang, X., Li, Y., Zhang, Y., Wang, M.: Lightgcn: simplifying and powering graph convolution network for recommendation. In: SIGIR
39.
Zurück zum Zitat Li, Q., Han, Z., Wu, X.-M.: Deeper insights into graph convolutional networks for semi-supervised learning. In: AAAI Li, Q., Han, Z., Wu, X.-M.: Deeper insights into graph convolutional networks for semi-supervised learning. In: AAAI
Metadaten
Titel
Multimodal heterogeneous graph convolutional network for image recommendation
verfasst von
Weiyi Wei
Jian Wang
Mengyu Xu
Futong Zhang
Publikationsdatum
26.07.2023
Verlag
Springer Berlin Heidelberg
Erschienen in
Multimedia Systems / Ausgabe 5/2023
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-023-01136-4

Weitere Artikel der Ausgabe 5/2023

Multimedia Systems 5/2023 Zur Ausgabe