Skip to main content

2023 | OriginalPaper | Buchkapitel

DC-RST: A Parallel Algorithm for Random Spanning Trees in Network Analytics

verfasst von : Lucas Henke, Dinesh Mehta

Erschienen in: Complex Networks and Their Applications XI

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The Mantel Test, discovered in the 1960s, determines whether two distance metrics on a graph are related. We describe DC-RST, an algorithm to accelerate a key step of a network science statistical computation associated with DimeCost, an approach that is faster the Mantel Test. DC-RST is a parallel, divide-and-conquer algorithm to compute a random spanning tree of a complete graph on n vertices. Relative to an implementation of Wilson’s sequential random-walk algorithm, on a system with 48 cores, DC-RST was up to 4X faster when first creating random partitions and up to 20X faster without this sub-step. DC-RST is shown to be a suitable replacement for Wilson’s sequential algorithm through a combination of theoretical and statistical results.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Note: distance metrics are assumed to be symmetric.
 
2
The \(L_p\)-norm for \(p \ge 1\) of a vector \(\vec {x}\) is a commonly used measure of “distance” in machine learning for clustering, and is defined by \(||\vec {x}||_p = (|x_1|^p + |x_2|^p + \cdots + |x_n|^p)^\frac{1}{p}\). Note that \(L_\infty (\vec {x}) = \max \{|x_1|, |x_2|, ..., |x_n| \}\) is the limit of the \(L_p\) norm as \(p \longrightarrow \infty \).
 
Literatur
1.
Zurück zum Zitat Mantel, N.: The detection of disease clustering and a generalized regression approach. Cancer Res. 27 (1967) Mantel, N.: The detection of disease clustering and a generalized regression approach. Cancer Res. 27 (1967)
2.
Zurück zum Zitat Bourbour, S., Mehta, D.P., Navidi, W.C.: Improved methods to compare distance metrics in networks using uniform random spanning trees (dimecost). Networks 76 (2020) Bourbour, S., Mehta, D.P., Navidi, W.C.: Improved methods to compare distance metrics in networks using uniform random spanning trees (dimecost). Networks 76 (2020)
3.
Zurück zum Zitat Anari, N., Hu, N., Saberi, A., Schild, A.: Sampling arborescences in parallel. In: Innovations in Theoretical Computer Science (2021) Anari, N., Hu, N., Saberi, A., Schild, A.: Sampling arborescences in parallel. In: Innovations in Theoretical Computer Science (2021)
4.
Zurück zum Zitat Dray, S., Dufour, A.-B.: The ade4 package: implementing the duality diagram for ecologists. J. Stat. Softw. 22(4), 1–20 (2007)CrossRef Dray, S., Dufour, A.-B.: The ade4 package: implementing the duality diagram for ecologists. J. Stat. Softw. 22(4), 1–20 (2007)CrossRef
5.
Zurück zum Zitat Schneider, J.W., Borlund, P.: Matrix comparison, part 2: measuring the resemblance between proximity measures or ordination results by use of the mantel and Procrustes statistics. J. Am. Soc. Inf. Sci. Technol. 58 (2007) Schneider, J.W., Borlund, P.: Matrix comparison, part 2: measuring the resemblance between proximity measures or ordination results by use of the mantel and Procrustes statistics. J. Am. Soc. Inf. Sci. Technol. 58 (2007)
6.
Zurück zum Zitat Sokal, R.R., Rohlf, F.J.: The comparison of dendrograms by objective methods. TAXON 11 (1962) Sokal, R.R., Rohlf, F.J.: The comparison of dendrograms by objective methods. TAXON 11 (1962)
7.
Zurück zum Zitat Corliss, J.O., Sneath, P.H.A., Sokal, R.R.: Numerical taxonomy: the principles and practice of numerical classification. Trans. Am. Microsc. Soc. 93 (1974) Corliss, J.O., Sneath, P.H.A., Sokal, R.R.: Numerical taxonomy: the principles and practice of numerical classification. Trans. Am. Microsc. Soc. 93 (1974)
8.
Zurück zum Zitat Cooper-Ellis, S., Pielou, E.C.: The interpretation of ecological data. A primer on classification and ordination. Bryologist 97 (1994) Cooper-Ellis, S., Pielou, E.C.: The interpretation of ecological data. A primer on classification and ordination. Bryologist 97 (1994)
9.
Zurück zum Zitat Ricaut, F.X., Auriol, V., Cramon-Taubadel, N.V., Keyser, C., Murail, P., Ludes, B., Crubézy, E.: Comparison between morphological and genetic data to estimate biological relationship: the case of the Egyin Gol necropolis (Mongolia). Am. J. Phys. Anthropol. 143 (2010) Ricaut, F.X., Auriol, V., Cramon-Taubadel, N.V., Keyser, C., Murail, P., Ludes, B., Crubézy, E.: Comparison between morphological and genetic data to estimate biological relationship: the case of the Egyin Gol necropolis (Mongolia). Am. J. Phys. Anthropol. 143 (2010)
10.
Zurück zum Zitat Smouse, P.E., Long, J.C., Sokal, R.R.: Multiple regression and correlation extensions of the mantel test of matrix correspondence. Syst. Zool. 35 (1986) Smouse, P.E., Long, J.C., Sokal, R.R.: Multiple regression and correlation extensions of the mantel test of matrix correspondence. Syst. Zool. 35 (1986)
11.
Zurück zum Zitat Kouri, T.M., Awale, M., Slyby, J.K., Reymond, J.L., Mehta, D.P.: “Social” network of isomers based on bond count distance: algorithms. J. Chem. Inf. Model. 54 (2014) Kouri, T.M., Awale, M., Slyby, J.K., Reymond, J.L., Mehta, D.P.: “Social” network of isomers based on bond count distance: algorithms. J. Chem. Inf. Model. 54 (2014)
12.
Zurück zum Zitat Aldous, D.J.: The random walk construction of uniform spanning trees and uniform labelled trees. SIAM J. Discr. Math. 3 (1990) Aldous, D.J.: The random walk construction of uniform spanning trees and uniform labelled trees. SIAM J. Discr. Math. 3 (1990)
13.
Zurück zum Zitat Broder, A.: Generating random spanning trees (1989) Broder, A.: Generating random spanning trees (1989)
14.
Zurück zum Zitat Lovász, L.: Random walks on graphs: a survey. Combinatorics 2 (1993) Lovász, L.: Random walks on graphs: a survey. Combinatorics 2 (1993)
15.
Zurück zum Zitat Wilson, D.B.: Generating Random Spanning Trees More Quickly Than the Cover Time, vol. Part F129452 (1996) Wilson, D.B.: Generating Random Spanning Trees More Quickly Than the Cover Time, vol. Part F129452 (1996)
16.
Zurück zum Zitat Bacher, A., Bodini, O., Hollender, A., Lumbroso, J.: Mergeshuffle: A Very Fast, Parallel Random Permutation Algorithm, vol. 2113 (2018) Bacher, A., Bodini, O., Hollender, A., Lumbroso, J.: Mergeshuffle: A Very Fast, Parallel Random Permutation Algorithm, vol. 2113 (2018)
Metadaten
Titel
DC-RST: A Parallel Algorithm for Random Spanning Trees in Network Analytics
verfasst von
Lucas Henke
Dinesh Mehta
Copyright-Jahr
2023
DOI
https://doi.org/10.1007/978-3-031-21131-7_4

Premium Partner