Skip to main content
Top
Published in: Data Mining and Knowledge Discovery 6/2023

08-08-2023

Reciprocity in directed hypergraphs: measures, findings, and generators

Authors: Sunwoo Kim, Minyoung Choe, Jaemin Yoo, Kijung Shin

Published in: Data Mining and Knowledge Discovery | Issue 6/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Group interactions are prevalent in a variety of areas. Many of them, including email exchanges, chemical reactions, and bitcoin transactions, are directional, and thus they are naturally modeled as directed hypergraphs, where each hyperarc consists of the set of source nodes and the set of destination nodes. For directed graphs, which are a special case of directed hypergraphs, reciprocity has played a key role as a fundamental graph statistic in revealing organizing principles of graphs and in solving graph learning tasks. For general directed hypergraphs, however, even no systematic measure of reciprocity has been developed. In this work, we investigate the reciprocity of 11 real-world hypergraphs. To this end, we first introduce eight axioms that any reasonable measure of reciprocity should satisfy. Second, we propose HyperRec, a family of principled measures of hypergraph reciprocity that satisfy all the axioms. Third, we develop FastHyperRec, a fast and exact algorithm for computing the measures. Fourth, using them, we examine 11 real-world hypergraphs and discover patterns that distinguish them from random hypergraphs. Lastly, we propose ReDi, an intuitive generative model for directed hypergraphs exhibiting the patterns.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Footnotes
1
This work is an extended version of Kim et al. (2022), which was presented at the 22nd IEEE International Conference on Data Mining (ICDM 2022). In the extended version, we introduce several theoretical extensions: (a) generalized versions of the axioms in Sect. :3.1 and a proof of Theorem 1 for the generalized versions (Appendix 1), (b) seven baseline hypergraph reciprocity measures (Sect. 3.3), (c) a proof that none of the baseline measures satisfies all the axioms (Appendix 2), and (d) proofs of Theorem 2 and Corollary 1 (Appendix 1). In addition, we conduct additional experiments regarding (a) the efficiency of FastHyperRec (Fig. 4 and Table 3 in Sect. 3.4), (b) the statistical significance of Observation 1 (Table 8 in Sect. 4.2), (c) the robustness of HyperRec with respect to the choice of \(\alpha\) (Tables 7 and 9 in Sect. 4.2), and (d) the verification of Observation 2 in 12 more real and synthetic hypergraphs (Fig. 5 in Sect. 4.2 and Fig. 7 in Sect. 5.2). At last, we provide one additional reciprocal pattern in real-world hypergraph (Observation 3: Fig. 6 in Sect. 4.2) and verify whether ReDi can reproduce this pattern.
 
2
Note that all arcs in \(R_{i}\) are used in computing the reciprocity of \(e_{i}\), and thus it does not correspond to a search space.
 
3
Note that, to improve legibility, we remove data points that lie outside the interquartile range (i.e., \([Q_{1} - 1.5(Q_{3} - Q_{1}), Q_{3} + 1.5(Q_{3} - Q_{1})]\), where \(Q_{3}\) and \(Q_{1}\) denote the third and first quantile of the corresponding distribution) from the box plots.
 
4
The search space of \(\beta _{1}\) is (a) \([0.05, 0.1, \ldots , 0.6]\) for the small datasets where \(|V \vert \le 10^{4}\), and (b) \([0.001, 0.0015, \ldots ,0.005]\) for the dense large datasets where \(|V \vert > 10^{4}\) and \({|E \vert }/{|V \vert } \ge 3\), and (c) \([0.01, 0.02, \ldots 0.15]\) for the other sparse large datasets. The search space of \(\beta _{2}\) is fixed to \(\in [0.1, 0.1, \ldots , 0.5]\) for all datasets.
 
5
As bitcoin transactions are made among randomly chosen accounts, the repetition of (partial) group interactions is rarely observed. Due to this intrinsic characteristic of the datasets, we use the degrees of individual nodes instead of the degrees of groups when ReDi and the baseline model are given the statistics from bitcoin datasets. The same strategy is also used for the baseline model when the input statistics are from the q &a server dataset. Without the strategy, the baseline model takes more than 12 hours.
 
6
Note that \((1+\frac{1}{x})^{x}\) is a well-known increasing function whose limit as \(x\rightarrow \infty\) is e. Thus, \(\log (1+\frac{1}{x})^{x}=x\log (1+\frac{1}{x})\) is also an increasing function, and since \(x'=1/x\) is decreasing at \(x>0\), \(x'\log (1+\frac{1}{x'})=\frac{1}{x}\log (1+x)\) is decreasing at \(x>0\).
 
7
We use the venues listed at Wikipedia (2022).
 
Literature
go back to reference Akoglu L, Vaz de Melo PO, Faloutsos C (2012) Quantifying reciprocity in large weighted communication networks. In: Advances in knowledge discovery and data mining: 16th Pacific-Asia conference, PAKDD 2012, Kuala Lumpur, Malaysia, May 29–June 1, 2012, Proceedings, Part II 16. Springer, pp 85–96. https://doi.org/10.1007/978-3-642-30220-6_8 Akoglu L, Vaz de Melo PO, Faloutsos C (2012) Quantifying reciprocity in large weighted communication networks. In: Advances in knowledge discovery and data mining: 16th Pacific-Asia conference, PAKDD 2012, Kuala Lumpur, Malaysia, May 29–June 1, 2012, Proceedings, Part II 16. Springer, pp 85–96. https://​doi.​org/​10.​1007/​978-3-642-30220-6_​8
go back to reference Leskovec J (2008) Dynamics of large networks. Carnegie Mellon University, Pittsburgh Leskovec J (2008) Dynamics of large networks. Carnegie Mellon University, Pittsburgh
go back to reference Ranshous S, Joslyn CA, Kreyling S et al (2017) Exchange pattern mining in the bitcoin transaction directed hypergraph. In: Financial cryptography and data security: FC 2017 international workshops, WAHC, BITCOIN, VOTING, WTSC, and TA, Sliema, Malta, April 7, 2017, Revised Selected Papers 21. Springer, pp 248–263. https://doi.org/10.1007/978-3-319-70278-0_16 Ranshous S, Joslyn CA, Kreyling S et al (2017) Exchange pattern mining in the bitcoin transaction directed hypergraph. In: Financial cryptography and data security: FC 2017 international workshops, WAHC, BITCOIN, VOTING, WTSC, and TA, Sliema, Malta, April 7, 2017, Revised Selected Papers 21. Springer, pp 248–263. https://​doi.​org/​10.​1007/​978-3-319-70278-0_​16
go back to reference Yadati N, Gao T, Asoodeh S et al (2021) Graph neural networks for soft semi-supervised learning on hypergraphs. In: Advances in knowledge discovery and data mining: 25th Pacific-Asia conference, PAKDD 2021, Virtual Event, May 11–14, 2021, Proceedings, Part I. Springer, pp 447–458. https://doi.org/10.1007/978-3-030-75762-5_36 Yadati N, Gao T, Asoodeh S et al (2021) Graph neural networks for soft semi-supervised learning on hypergraphs. In: Advances in knowledge discovery and data mining: 25th Pacific-Asia conference, PAKDD 2021, Virtual Event, May 11–14, 2021, Proceedings, Part I. Springer, pp 447–458. https://​doi.​org/​10.​1007/​978-3-030-75762-5_​36
Metadata
Title
Reciprocity in directed hypergraphs: measures, findings, and generators
Authors
Sunwoo Kim
Minyoung Choe
Jaemin Yoo
Kijung Shin
Publication date
08-08-2023
Publisher
Springer US
Published in
Data Mining and Knowledge Discovery / Issue 6/2023
Print ISSN: 1384-5810
Electronic ISSN: 1573-756X
DOI
https://doi.org/10.1007/s10618-023-00955-3

Other articles of this Issue 6/2023

Data Mining and Knowledge Discovery 6/2023 Go to the issue

Premium Partner