Skip to main content
Top
Published in: World Wide Web 4/2023

15-10-2022

How the four-nodes motifs work in heterogeneous node representation?

A case study on aminer

Authors: Siyuan Ye, Qian Li, Guangxu Mei, Shijun Liu, Li Pan

Published in: World Wide Web | Issue 4/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Heterogeneous information networks (HIN), containing different types of entities with various kinds of interaction relations in between, provide richer information than homogeneous networks. Heterogeneous motifs are induced structural subgraph patterns with semantic in HINs. There has been many works using motifs to participate in the representation learning of HINs, but rarely to understand the respective influences of motifs. Due to the rich semantic information contained in heterogeneous motifs, the effects of different structures are inconsistent in network representation. In this paper, we introduce a case study on AMiner dataset, by extracting the heterogeneous motifs with various types of nodes and edges, especially four-node motifs, the relations between those motifs also are explored. During the study process, we first construct a set of motif instances identified by subgraph isomorphism algorithm as a weighted bipartite graph and then use another semantically related node type to extract target node features from pruned adjacency matrix. Next, a series of experiments are designed to evaluate the effect of each motif and the irrelevance of different motifs. Experimental results show that embeddings by our framework achieves excellent results compared with several state-of-the-art alternatives in node classification and clustering tasks.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Footnotes
1
The main symbols used in our framework are given in Table 4.
 
Literature
1.
go back to reference Ahmed, N.K., Rossi, R.A., Lee, J.B., Kong, X., Willke, T.L., Zhou, R., Eldardiry, H.: Learning role-based graph embeddings. stat 1050, 7 (2018) Ahmed, N.K., Rossi, R.A., Lee, J.B., Kong, X., Willke, T.L., Zhou, R., Eldardiry, H.: Learning role-based graph embeddings. stat 1050, 7 (2018)
2.
go back to reference Arora, S.: A survey on graph neural networks for knowledge graph completion. arXiv preprint arXiv:2007.12374 (2020) Arora, S.: A survey on graph neural networks for knowledge graph completion. arXiv preprint arXiv:2007.12374 (2020)
3.
go back to reference Benson, A.R., Gleich, D.F., Leskovec, J.: Higher-order organization of complex networks. Science 353(6295), 163–166 (2016)CrossRef Benson, A.R., Gleich, D.F., Leskovec, J.: Higher-order organization of complex networks. Science 353(6295), 163–166 (2016)CrossRef
5.
go back to reference Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: Smote: synthetic minority over-sampling technique. Journal of Artificial Intelligence Research 16, 321–357 (2002)CrossRefMATH Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: Smote: synthetic minority over-sampling technique. Journal of Artificial Intelligence Research 16, 321–357 (2002)CrossRefMATH
7.
go back to reference Dong, Y., Chawla, N.V., Swami, A.: Metapath2vec: Scalable representation learning for heterogeneous networks. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’17, p. 135–144. Association for Computing Machinery, New York, NY, USA (2017). https://doi.org/10.1145/3097983.3098036. Dong, Y., Chawla, N.V., Swami, A.: Metapath2vec: Scalable representation learning for heterogeneous networks. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’17, p. 135–144. Association for Computing Machinery, New York, NY, USA (2017). https://​doi.​org/​10.​1145/​3097983.​3098036.​
8.
go back to reference Dong, Y., Hu, Z., Wang, K., Sun, Y., Tang, J.: Heterogeneous network representation learning. In: IJCAI 20, 4861–4867 (2020) Dong, Y., Hu, Z., Wang, K., Sun, Y., Tang, J.: Heterogeneous network representation learning. In: IJCAI 20, 4861–4867 (2020)
9.
go back to reference Fan, S., Zhu, J., Han, X., Shi, C., Hu, L., Ma, B., Li, Y.: Metapath-guided heterogeneous graph neural network for intent recommendation. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2478–2486 (2019) Fan, S., Zhu, J., Han, X., Shi, C., Hu, L., Ma, B., Li, Y.: Metapath-guided heterogeneous graph neural network for intent recommendation. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2478–2486 (2019)
11.
12.
go back to reference Hosseini, A., Chen, T., Wu, W., Sun, Y., Sarrafzadeh, M.: Heteromed: Heterogeneous information network for medical diagnosis. In: Proceedings of the 27th ACM International Conference on Information and Knowledge Management, pp. 763–772 (2018) Hosseini, A., Chen, T., Wu, W., Sun, Y., Sarrafzadeh, M.: Heteromed: Heterogeneous information network for medical diagnosis. In: Proceedings of the 27th ACM International Conference on Information and Knowledge Management, pp. 763–772 (2018)
13.
go back to reference Hou, S., Ye, Y., Song, Y., Abdulhayoglu, M.: Hindroid: An intelligent android malware detection system based on structured heterogeneous information network. In: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining, pp. 1507–1515 (2017) Hou, S., Ye, Y., Song, Y., Abdulhayoglu, M.: Hindroid: An intelligent android malware detection system based on structured heterogeneous information network. In: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining, pp. 1507–1515 (2017)
14.
go back to reference Huang, Z., Zheng, Y., Cheng, R., Sun, Y., Mamoulis, N., Li, X.: Meta structure: Computing relevance in large heterogeneous information networks. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2016) Huang, Z., Zheng, Y., Cheng, R., Sun, Y., Mamoulis, N., Li, X.: Meta structure: Computing relevance in large heterogeneous information networks. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2016)
15.
go back to reference Hulovatyy, Y., Chen, H., Milenković, T.: Exploring the structure and function of temporal networks with dynamic graphlets. Bioinformatics 31(12), i171–i180 (2015)CrossRef Hulovatyy, Y., Chen, H., Milenković, T.: Exploring the structure and function of temporal networks with dynamic graphlets. Bioinformatics 31(12), i171–i180 (2015)CrossRef
16.
go back to reference Kovanen, L., Karsai, M., Kaski, K., Kertész, J., Saramäki, J.: Temporal motifs in time-dependent networks. Journal of Statistical Mechanics: Theory and Experiment 2011(11), P11005 (2011)CrossRef Kovanen, L., Karsai, M., Kaski, K., Kertész, J., Saramäki, J.: Temporal motifs in time-dependent networks. Journal of Statistical Mechanics: Theory and Experiment 2011(11), P11005 (2011)CrossRef
17.
go back to reference Lichtenwalter, R.N., Chawla, N.V.: Vertex collocation profiles: subgraph counting for link analysis and prediction. In: Proceedings of the 21st international conference on World Wide Web, pp. 1019–1028 (2012) Lichtenwalter, R.N., Chawla, N.V.: Vertex collocation profiles: subgraph counting for link analysis and prediction. In: Proceedings of the 21st international conference on World Wide Web, pp. 1019–1028 (2012)
18.
go back to reference Ling, C.X., Li, C.: Data mining for direct marketing: Problems and solutions. In: KDD (1998) Ling, C.X., Li, C.: Data mining for direct marketing: Problems and solutions. In: KDD (1998)
20.
go back to reference Milenković, T., Pržulj, N.: Uncovering biological network function via graphlet degree signatures. Cancer informatics 6, CIN–S680 (2008) Milenković, T., Pržulj, N.: Uncovering biological network function via graphlet degree signatures. Cancer informatics 6, CIN–S680 (2008)
22.
go back to reference Palla, G., Derényi, I., Farkas, I., Vicsek, T.: Uncovering the overlapping community structure of complex networks in nature and society. nature 435(7043), 814–818 (2005) Palla, G., Derényi, I., Farkas, I., Vicsek, T.: Uncovering the overlapping community structure of complex networks in nature and society. nature 435(7043), 814–818 (2005)
23.
go back to reference Radicchi, F., Castellano, C., Cecconi, F., Loreto, V., Parisi, D.: Defining and identifying communities in networks. Proceedings of the national academy of sciences 101(9), 2658–2663 (2004)CrossRef Radicchi, F., Castellano, C., Cecconi, F., Loreto, V., Parisi, D.: Defining and identifying communities in networks. Proceedings of the national academy of sciences 101(9), 2658–2663 (2004)CrossRef
25.
go back to reference Rossi, R.A., Ahmed, N.K., Carranza, A., Arbour, D., Rao, A., Kim, S., Koh, E.: Heterogeneous graphlets. ACM Transactions on Knowledge Discovery from Data (TKDD) 15(1), 1–43 (2020)CrossRef Rossi, R.A., Ahmed, N.K., Carranza, A., Arbour, D., Rao, A., Kim, S., Koh, E.: Heterogeneous graphlets. ACM Transactions on Knowledge Discovery from Data (TKDD) 15(1), 1–43 (2020)CrossRef
26.
go back to reference Rossi, R.A., Ahmed, N.K., Koh, E., Kim, S., Rao, A., Yadkori, Y.A.: Hone: Higher-order network embeddings. arXiv preprint arXiv:1801.09303 (2018) Rossi, R.A., Ahmed, N.K., Koh, E., Kim, S., Rao, A., Yadkori, Y.A.: Hone: Higher-order network embeddings. arXiv preprint arXiv:1801.09303 (2018)
27.
go back to reference Sankar, A., Zhang, X., Chang, K.C.C.: Meta-gnn: Metagraph neural network for semi-supervised learning in attributed heterogeneous information networks. In: Proceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM ’19, p. 137–144. Association for Computing Machinery, New York, NY, USA (2019). https://doi.org/10.1145/3341161.3342859 Sankar, A., Zhang, X., Chang, K.C.C.: Meta-gnn: Metagraph neural network for semi-supervised learning in attributed heterogeneous information networks. In: Proceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM ’19, p. 137–144. Association for Computing Machinery, New York, NY, USA (2019). https://​doi.​org/​10.​1145/​3341161.​3342859
28.
go back to reference Shervashidze, N., Vishwanathan, S., Petri, T., Mehlhorn, K., Borgwardt, K.: Efficient graphlet kernels for large graph comparison. In: Artificial intelligence and statistics, pp. 488–495. PMLR (2009) Shervashidze, N., Vishwanathan, S., Petri, T., Mehlhorn, K., Borgwardt, K.: Efficient graphlet kernels for large graph comparison. In: Artificial intelligence and statistics, pp. 488–495. PMLR (2009)
29.
go back to reference Shi, C., Zhang, Z., Luo, P., Yu, P.S., Yue, Y., Wu, B.: Semantic path based personalized recommendation on weighted heterogeneous information networks. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 453–462 (2015) Shi, C., Zhang, Z., Luo, P., Yu, P.S., Yue, Y., Wu, B.: Semantic path based personalized recommendation on weighted heterogeneous information networks. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 453–462 (2015)
30.
go back to reference Solava, R.W., Michaels, R.P., Milenković, T.: Graphlet-based edge clustering reveals pathogen-interacting proteins. Bioinformatics 28(18), i480–i486 (2012)CrossRef Solava, R.W., Michaels, R.P., Milenković, T.: Graphlet-based edge clustering reveals pathogen-interacting proteins. Bioinformatics 28(18), i480–i486 (2012)CrossRef
31.
go back to reference Sorokin, D., Gurevych, I.: Modeling semantics with gated graph neural networks for knowledge base question answering. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 3306–3317. Association for Computational Linguistics, Santa Fe, New Mexico, USA (2018). https://aclanthology.org/C18-1280. Accessed Feb 2022 Sorokin, D., Gurevych, I.: Modeling semantics with gated graph neural networks for knowledge base question answering. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 3306–3317. Association for Computational Linguistics, Santa Fe, New Mexico, USA (2018). https://​aclanthology.​org/​C18-1280. Accessed  Feb 2022
32.
go back to reference Vishwanathan, S.V.N., Schraudolph, N.N., Kondor, R., Borgwardt, K.M.: Graph kernels. Journal of Machine Learning Research 11, 1201–1242 (2010)MathSciNetMATH Vishwanathan, S.V.N., Schraudolph, N.N., Kondor, R., Borgwardt, K.M.: Graph kernels. Journal of Machine Learning Research 11, 1201–1242 (2010)MathSciNetMATH
33.
go back to reference Wang, X., Bo, D., Shi, C., Fan, S., Ye, Y., Yu, P.S.: A survey on heterogeneous graph embedding: methods, techniques, applications and sources. arXiv preprint arXiv:2011.14867 (2020) Wang, X., Bo, D., Shi, C., Fan, S., Ye, Y., Yu, P.S.: A survey on heterogeneous graph embedding: methods, techniques, applications and sources. arXiv preprint arXiv:2011.14867 (2020)
34.
go back to reference Xiao, Y., Zhang, J., Deng, L.: Prediction of lncrna-protein interactions using hetesim scores based on heterogeneous networks. Scientific Reports 7(1), 3664 (2017)CrossRef Xiao, Y., Zhang, J., Deng, L.: Prediction of lncrna-protein interactions using hetesim scores based on heterogeneous networks. Scientific Reports 7(1), 3664 (2017)CrossRef
37.
go back to reference Zhao, J., Wang, X., Shi, C., Liu, Z., Ye, Y.: Network schema preserving heterogeneous information network embedding. In: C. Bessiere (ed.) Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI-20, pp. 1366–1372. International Joint Conferences on Artificial Intelligence Organization (2020). https://doi.org/10.24963/ijcai.2020/190. Main track Zhao, J., Wang, X., Shi, C., Liu, Z., Ye, Y.: Network schema preserving heterogeneous information network embedding. In: C. Bessiere (ed.) Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI-20, pp. 1366–1372. International Joint Conferences on Artificial Intelligence Organization (2020). https://​doi.​org/​10.​24963/​ijcai.​2020/​190.​ Main track
39.
go back to reference Zhou, Z.H., Liu, X.Y.: On multi-class cost-sensitive learning. Computational Intelligence 26 (2010) Zhou, Z.H., Liu, X.Y.: On multi-class cost-sensitive learning. Computational Intelligence 26 (2010)
Metadata
Title
How the four-nodes motifs work in heterogeneous node representation?
A case study on aminer
Authors
Siyuan Ye
Qian Li
Guangxu Mei
Shijun Liu
Li Pan
Publication date
15-10-2022
Publisher
Springer US
Published in
World Wide Web / Issue 4/2023
Print ISSN: 1386-145X
Electronic ISSN: 1573-1413
DOI
https://doi.org/10.1007/s11280-022-01115-1

Other articles of this Issue 4/2023

World Wide Web 4/2023 Go to the issue

Premium Partner