Skip to main content
Erschienen in: Data Mining and Knowledge Discovery 3/2017

05.12.2016

Tiers for peers: a practical algorithm for discovering hierarchy in weighted networks

verfasst von: Nikolaj Tatti

Erschienen in: Data Mining and Knowledge Discovery | Ausgabe 3/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Interactions in many real-world phenomena can be explained by a strong hierarchical structure. Typically, this structure or ranking is not known; instead we only have observed outcomes of the interactions, and the goal is to infer the hierarchy from these observations. Discovering a hierarchy in the context of directed networks can be formulated as follows: given a graph, partition vertices into levels such that, ideally, there are only edges from upper levels to lower levels. The ideal case can only happen if the graph is acyclic. Consequently, in practice we have to introduce a penalty function that penalizes edges violating the hierarchy. A practical variant for such penalty is agony, where each violating edge is penalized based on the severity of the violation. Hierarchy minimizing agony can be discovered in https://static-content.springer.com/image/art%3A10.1007%2Fs10618-016-0485-7/MediaObjects/10618_2016_485_IEq1_HTML.gif time, and much faster in practice. In this paper we introduce several extensions to agony. We extend the definition for weighted graphs and allow a cardinality constraint that limits the number of levels. While, these are conceptually trivial extensions, current algorithms cannot handle them, nor they can be easily extended. We solve the problem by showing the connection to the capacitated circulation problem, and we demonstrate that we can compute the exact solution fast in practice for large datasets. We also introduce a provably fast heuristic algorithm that produces rankings with competitive scores. In addition, we show that we can compute agony in polynomial time for any convex penalty, and, to complete the picture, we show that minimizing hierarchy with any concave penalty is an NP-hard problem.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
The reason for the minus sign is that we expressed Circulation as a minimization problem and Capacitated circulation as a maximization problem.
 
2
Here \(\alpha \) is a fixed parameter \(1/2< \alpha < 1\), we use \(\alpha = 3/4\).
 
3
The edge can participate later when we delete edges.
 
4
Taking into account the \(\epsilon \) trick.
 
5
we will systematically denote the vertices in T with Greek letters
 
Literatur
Zurück zum Zitat Aggarwal A, Klawe M, Moran S, Shor P, Wilber R (1987) Geometric applications of a matrix-searching algorithm. Algorithmica 2(1–4):195–208MathSciNetCrossRef Aggarwal A, Klawe M, Moran S, Shor P, Wilber R (1987) Geometric applications of a matrix-searching algorithm. Algorithmica 2(1–4):195–208MathSciNetCrossRef
Zurück zum Zitat Edmonds J, Karp RM (1972) Theoretical improvements in algorithmic efficiency for network flow problems. J ACM 19(2):248–264CrossRef Edmonds J, Karp RM (1972) Theoretical improvements in algorithmic efficiency for network flow problems. J ACM 19(2):248–264CrossRef
Zurück zum Zitat Elo AE (1978) The rating of chessplayers, past and present. Arco Pub, New York Elo AE (1978) The rating of chessplayers, past and present. Arco Pub, New York
Zurück zum Zitat Even G, Naor J, Schieber B, Sudan M (1998) Approximating minimum feedback sets and multicuts in directed graphs. Algorithmica 20(2):151–174MathSciNetCrossRef Even G, Naor J, Schieber B, Sudan M (1998) Approximating minimum feedback sets and multicuts in directed graphs. Algorithmica 20(2):151–174MathSciNetCrossRef
Zurück zum Zitat Garey M, Johnson D (1979) Computers and intractability: a guide to the theory of NP-completeness. WH Freeman & Co, New YorkMATH Garey M, Johnson D (1979) Computers and intractability: a guide to the theory of NP-completeness. WH Freeman & Co, New YorkMATH
Zurück zum Zitat Gupte M, Shankar P, Li J, Muthukrishnan S, Iftode L (2011) Finding hierarchy in directed online social networks. In: Proceedings of the 20th international conference on World Wide Web, pp. 557–566 Gupte M, Shankar P, Li J, Muthukrishnan S, Iftode L (2011) Finding hierarchy in directed online social networks. In: Proceedings of the 20th international conference on World Wide Web, pp. 557–566
Zurück zum Zitat Henderson K, Gallagher B, Eliassi-Rad T, Tong H, Basu S, Akoglu L, Koutra D, Faloutsos C, Li L (2012) Rolx: structural role extraction & mining in large graphs. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp. 1231–1239 Henderson K, Gallagher B, Eliassi-Rad T, Tong H, Basu S, Akoglu L, Koutra D, Faloutsos C, Li L (2012) Rolx: structural role extraction & mining in large graphs. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp. 1231–1239
Zurück zum Zitat Jameson KA, Appleby MC, Freeman LC (1999) Finding an appropriate order for a hierarchy based on probabilistic dominance. Anim Behav 57:991–998CrossRef Jameson KA, Appleby MC, Freeman LC (1999) Finding an appropriate order for a hierarchy based on probabilistic dominance. Anim Behav 57:991–998CrossRef
Zurück zum Zitat Macchia L, Bonchi F, Gullo F, Chiarandini L (2013) Mining summaries of propagations. In: IEEE 13th international conference on data mining, pp. 498–507 Macchia L, Bonchi F, Gullo F, Chiarandini L (2013) Mining summaries of propagations. In: IEEE 13th international conference on data mining, pp. 498–507
Zurück zum Zitat Maiya AS, Berger-Wolf TY (2009) Inferring the maximum likelihood hierarchy in social networks. In: Proceedings IEEE CSE’09, 12th IEEE international conference on computational science and engineering, pp. 245–250 Maiya AS, Berger-Wolf TY (2009) Inferring the maximum likelihood hierarchy in social networks. In: Proceedings IEEE CSE’09, 12th IEEE international conference on computational science and engineering, pp. 245–250
Zurück zum Zitat McCallum A, Wang X, Corrada-Emmanuel A (2007) Topic and role discovery in social networks with experiments on enron and academic email. J Artif Int Res 30(1):249–272 McCallum A, Wang X, Corrada-Emmanuel A (2007) Topic and role discovery in social networks with experiments on enron and academic email. J Artif Int Res 30(1):249–272
Zurück zum Zitat Papadimitriou CH, Steiglitz K (1982) Combinatorial optimization: algorithms and complexity. Prentice-Hall, Inc, Upper Saddle RiverMATH Papadimitriou CH, Steiglitz K (1982) Combinatorial optimization: algorithms and complexity. Prentice-Hall, Inc, Upper Saddle RiverMATH
Zurück zum Zitat Ramalingam G, Reps T (1996) On the computational complexity of dynamic graph problems. Theor Comput Sci 158:233–277MathSciNetCrossRef Ramalingam G, Reps T (1996) On the computational complexity of dynamic graph problems. Theor Comput Sci 158:233–277MathSciNetCrossRef
Zurück zum Zitat Roopnarine PD, Hertog R (2013) Detailed food web networks of three Greater Antillean coral Reef systems: the Cayman Islands, Cuba, and Jamaica. Dataset Papers Ecol Roopnarine PD, Hertog R (2013) Detailed food web networks of three Greater Antillean coral Reef systems: the Cayman Islands, Cuba, and Jamaica. Dataset Papers Ecol
Zurück zum Zitat Roopnarine PD, Hertog R (2012) Data from: detailed food web networks of three Greater Antillean coral reef systems: the Cayman Islands, Cuba, and Jamaica. Dryad Digit Repos. doi:10.5061/dryad.c213h Roopnarine PD, Hertog R (2012) Data from: detailed food web networks of three Greater Antillean coral reef systems: the Cayman Islands, Cuba, and Jamaica. Dryad Digit Repos. doi:10.​5061/​dryad.​c213h
Zurück zum Zitat Tatti N (2014) Faster way to agony—discovering hierarchies in directed graphs. In: Proceeding of European conference of machine learning and knowledge discovery in databases, ECML PKDD, pp. 163–178 Tatti N (2014) Faster way to agony—discovering hierarchies in directed graphs. In: Proceeding of European conference of machine learning and knowledge discovery in databases, ECML PKDD, pp. 163–178
Zurück zum Zitat Tatti N (2015) Hierarchies in directed networks. In: Proceedings of the 15th IEEE international conference on data mining (ICDM 2015) Tatti N (2015) Hierarchies in directed networks. In: Proceedings of the 15th IEEE international conference on data mining (ICDM 2015)
Metadaten
Titel
Tiers for peers: a practical algorithm for discovering hierarchy in weighted networks
verfasst von
Nikolaj Tatti
Publikationsdatum
05.12.2016
Verlag
Springer US
Erschienen in
Data Mining and Knowledge Discovery / Ausgabe 3/2017
Print ISSN: 1384-5810
Elektronische ISSN: 1573-756X
DOI
https://doi.org/10.1007/s10618-016-0485-7

Weitere Artikel der Ausgabe 3/2017

Data Mining and Knowledge Discovery 3/2017 Zur Ausgabe

Premium Partner