Skip to main content
Erschienen in: Knowledge and Information Systems 3/2016

01.03.2016 | Regular Paper

Discovery of “comet” communities in temporal and labeled graphs Com \(^2\) 

verfasst von: Miguel Araujo, Stephan Günnemann, Spiros Papadimitriou, Christos Faloutsos, Prithwish Basu, Ananthram Swami, Evangelos E. Papalexakis, Danai Koutra

Erschienen in: Knowledge and Information Systems | Ausgabe 3/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

While the analysis of unlabeled networks has been studied extensively in the past, finding patterns in different kinds of labeled graphs is still an open challenge. Given a large edge-labeled network, e.g., a time-evolving network, how can we find interesting patterns? We propose Com \(^2\) , a novel, fast and incremental tensor analysis approach which can discover communities appearing over subsets of the labels. The method is (a) scalable, being linear on the input size, (b) general, (c) needs no user-defined parameters and (d) effective, returning results that agree with intuition. We apply our method to real datasets, including a phone call network, a computer-traffic network and a flight information network. The phone call network consists of 4 million mobile users, with 51 million edges (phone calls), over 14 days, while the flights dataset consists of 7733 airports and 5995 airline companies flying 67,663 different routes. We show that Com \(^2\)  spots intuitive patterns regarding edge labels that carry temporal or other discrete information. Our findings include large “star”-like patterns, near-bipartite cores, as well as tiny groups (five users), calling each other hundreds of times within a few days. We also show that we are able to automatically identify competing airline companies.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
We tested different methods with no significant differences found in the results since the subsequent steps of growing and shrinking lead to the selection of the most relevant edges and the removal of irrelevant ones. Selecting the edge (ijk) with highest \(min(a_i, b_j, c_k)\) provides a good initial seed.
 
Literatur
1.
Zurück zum Zitat Aggarwal C, Subbian K (2014) Evolutionary network analysis: a survey. ACM Comput Surv 47(1):10:1–10:36CrossRef Aggarwal C, Subbian K (2014) Evolutionary network analysis: a survey. ACM Comput Surv 47(1):10:1–10:36CrossRef
2.
Zurück zum Zitat Araujo M, Günnemann S, Mateos G, Faloutsos C (2014) Beyond blocks: hyperbolic community detection. ECML PKDD 8724:50–65 Araujo M, Günnemann S, Mateos G, Faloutsos C (2014) Beyond blocks: hyperbolic community detection. ECML PKDD 8724:50–65
3.
Zurück zum Zitat Araujo M, Papadimitriou S, Günnemann S, Faloutsos C, Basu P, Swami A, Papalexakis EE, Koutra D (2014) Com2: fast automatic discovery of temporal (‘comet’) communities. PAKDD 8444:271–283 Araujo M, Papadimitriou S, Günnemann S, Faloutsos C, Basu P, Swami A, Papalexakis EE, Koutra D (2014) Com2: fast automatic discovery of temporal (‘comet’) communities. PAKDD 8444:271–283
4.
Zurück zum Zitat Boden B, Günnemann S, Hoffmann H, Seidl T (2012) Mining coherent subgraphs in multi-layer graphs with edge labels. In: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining 1258–1266 Boden B, Günnemann S, Hoffmann H, Seidl T (2012) Mining coherent subgraphs in multi-layer graphs with edge labels. In: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining 1258–1266
5.
Zurück zum Zitat Boden B, Günnemann S, Hoffmann H, Seidl T (2013) RMiCS: a robust approach for mining coherent subgraphs in edge-labeled multi-layer graphs. In: Proceedings of the 25th international conference on scientific and statistical database management 1–23 Boden B, Günnemann S, Hoffmann H, Seidl T (2013) RMiCS: a robust approach for mining coherent subgraphs in edge-labeled multi-layer graphs. In: Proceedings of the 25th international conference on scientific and statistical database management 1–23
6.
Zurück zum Zitat Carroll J, Chang J-J (1970) Analysis of individual differences in multidimensional scaling via an n-way generalization of “eckart-young” decomposition. Psychometrika 35(3):283–319CrossRefMATH Carroll J, Chang J-J (1970) Analysis of individual differences in multidimensional scaling via an n-way generalization of “eckart-young” decomposition. Psychometrika 35(3):283–319CrossRefMATH
7.
Zurück zum Zitat Dunlavy DM, Kolda TG, Acar E (2011) Temporal link prediction using matrix and tensor factorizations. TKDD 5(2):10CrossRef Dunlavy DM, Kolda TG, Acar E (2011) Temporal link prediction using matrix and tensor factorizations. TKDD 5(2):10CrossRef
8.
Zurück zum Zitat Flake GW, Lawrence S, Giles CL (2000) Efficient identification of web communities. KDD 150–160 Flake GW, Lawrence S, Giles CL (2000) Efficient identification of web communities. KDD 150–160
10.
Zurück zum Zitat Gkantsidis C, Mihail M, Zegura EW (2003) Spectral analysis of internet topologies. INFOCOM 1:364–374 Gkantsidis C, Mihail M, Zegura EW (2003) Spectral analysis of internet topologies. INFOCOM 1:364–374
11.
Zurück zum Zitat Grünwald PD (2007) The minimum description length principle. The MIT Press, Cambridge Grünwald PD (2007) The minimum description length principle. The MIT Press, Cambridge
12.
Zurück zum Zitat Günnemann S, Färber I, Boden B, Seidl T (2014) Gamer: a synthesis of subspace clustering and dense subgraph mining. Knowl Inf Syst 40(2):243–278CrossRef Günnemann S, Färber I, Boden B, Seidl T (2014) Gamer: a synthesis of subspace clustering and dense subgraph mining. Knowl Inf Syst 40(2):243–278CrossRef
13.
Zurück zum Zitat Günnemann S, Färber I, Raubach S, Seidl T (2013) Spectral subspace clustering for graphs with feature vectors. In: IEEE 13th international conferance on data mining 231–240 Günnemann S, Färber I, Raubach S, Seidl T (2013) Spectral subspace clustering for graphs with feature vectors. In: IEEE 13th international conferance on data mining 231–240
14.
Zurück zum Zitat Harshman R (1970) Foundations of the PARAFAC procedure: models and conditions for an “explanatory” multimodal factor analysis. UCLA Work Pap Phon 16:1–84 Harshman R (1970) Foundations of the PARAFAC procedure: models and conditions for an “explanatory” multimodal factor analysis. UCLA Work Pap Phon 16:1–84
15.
Zurück zum Zitat Johnson DS, Krishnan S, Chhugani J, Kumar S, Venkatasubramanian S (2004) Compressing large boolean matrices using reordering techniques. VLDB 30:13–23 Johnson DS, Krishnan S, Chhugani J, Kumar S, Venkatasubramanian S (2004) Compressing large boolean matrices using reordering techniques. VLDB 30:13–23
16.
Zurück zum Zitat Karypis G, Kumar V (1995) Metis: unstructured graph partitioning and sparse matrix ordering system. Tech Rep Karypis G, Kumar V (1995) Metis: unstructured graph partitioning and sparse matrix ordering system. Tech Rep
17.
Zurück zum Zitat Kolda T, Bader B (2009) Tensor decompositions and applications. SIAM Rev 51(3):455–500 Kolda T, Bader B (2009) Tensor decompositions and applications. SIAM Rev 51(3):455–500
18.
Zurück zum Zitat Kolda TG, Bader BW, Kenny JP (2005) Higher-order web link analysis using multilinear algebra. In: Fifth IEEE international conference on data mining 242–249 Kolda TG, Bader BW, Kenny JP (2005) Higher-order web link analysis using multilinear algebra. In: Fifth IEEE international conference on data mining 242–249
19.
Zurück zum Zitat Koutra D, Kang U, Vreeken J, Faloutsos C (2014) VoG: summarizing and understanding large graphs. In: Proceedings of the 2014 SIAM international conference on data mining 91–99 Koutra D, Kang U, Vreeken J, Faloutsos C (2014) VoG: summarizing and understanding large graphs. In: Proceedings of the 2014 SIAM international conference on data mining 91–99
20.
Zurück zum Zitat Koutra D, Papalexakis E, Faloutsos C (2012) Tensorsplat: spotting latent anomalies in time. In: 16th Panhellenic conference on informatics (PCI) Koutra D, Papalexakis E, Faloutsos C (2012) Tensorsplat: spotting latent anomalies in time. In: 16th Panhellenic conference on informatics (PCI)
21.
Zurück zum Zitat Kumar R, Novak J, Raghavan P, Tomkins A (2003) On the bursty evolution of blogspace. WWW, pp 568–576 Kumar R, Novak J, Raghavan P, Tomkins A (2003) On the bursty evolution of blogspace. WWW, pp 568–576
22.
Zurück zum Zitat Leskovec J, Kleinberg J, Faloutsos C (2007) Graph evolution: densification and shrinking diameters. IEEE TKDD 1(1):917–922 Leskovec J, Kleinberg J, Faloutsos C (2007) Graph evolution: densification and shrinking diameters. IEEE TKDD 1(1):917–922
23.
Zurück zum Zitat Liu Z, Yu J, Ke Y, Lin X, Chen L (2008) Spotting significant changing subgraphs in evolving graphs. In: ICDM, pp 917–922 Liu Z, Yu J, Ke Y, Lin X, Chen L (2008) Spotting significant changing subgraphs in evolving graphs. In: ICDM, pp 917–922
24.
Zurück zum Zitat Maruhashi K, Guo F, Faloutsos C (2011) Multiaspectforensics: pattern mining on large-scale heterogeneous networks with tensor analysis. In: Proceedings of the 2011 international conference on advances in social networks analysis and mining 203–210 Maruhashi K, Guo F, Faloutsos C (2011) Multiaspectforensics: pattern mining on large-scale heterogeneous networks with tensor analysis. In: Proceedings of the 2011 international conference on advances in social networks analysis and mining 203–210
25.
Zurück zum Zitat Papalexakis E, Akoglu L, Ience D (2013) Do more views of a graph help? Community detection and clustering in multi-graphs. In: International conference on information FUSION, pp 899–905 Papalexakis E, Akoglu L, Ience D (2013) Do more views of a graph help? Community detection and clustering in multi-graphs. In: International conference on information FUSION, pp 899–905
26.
Zurück zum Zitat Papalexakis EE, Faloutsos C, Sidiropoulos ND (2012) Parcube: sparse parallelizable tensor decompositions. ECML/PKDD 1:521–536 Papalexakis EE, Faloutsos C, Sidiropoulos ND (2012) Parcube: sparse parallelizable tensor decompositions. ECML/PKDD 1:521–536
27.
Zurück zum Zitat Papalexakis EE, Sidiropoulos ND, Bro R (2013) From k-means to higher-way co-clustering: multilinear decomposition with sparse latent factors. IEEE Trans Signal Process 61(2):493–506 Papalexakis EE, Sidiropoulos ND, Bro R (2013) From k-means to higher-way co-clustering: multilinear decomposition with sparse latent factors. IEEE Trans Signal Process 61(2):493–506
28.
Zurück zum Zitat Paxson V, Floyd S (1995) Wide-area traffic: the failure of poisson modeling. IEEE/ACM Trans Netw 3:226–244CrossRef Paxson V, Floyd S (1995) Wide-area traffic: the failure of poisson modeling. IEEE/ACM Trans Netw 3:226–244CrossRef
29.
Zurück zum Zitat Prakash BA, Sridharan A, Seshadri M, Machiraju S, Faloutsos C (2010) Eigenspokes: surprising patterns and scalable community chipping in large graphs. PAKDD 6119:435–448 Prakash BA, Sridharan A, Seshadri M, Machiraju S, Faloutsos C (2010) Eigenspokes: surprising patterns and scalable community chipping in large graphs. PAKDD 6119:435–448
30.
Zurück zum Zitat Rissanen J (1983) A universal prior for integers and estimation by minimum description length. Ann Stat 11:416–431 Rissanen J (1983) A universal prior for integers and estimation by minimum description length. Ann Stat 11:416–431
31.
Zurück zum Zitat Rosvall M, Bergstrom CT (2007) An information-theoretic framework for resolving community structure in complex networks. Proc Nat Acad Sci 104(18):7327–7331CrossRef Rosvall M, Bergstrom CT (2007) An information-theoretic framework for resolving community structure in complex networks. Proc Nat Acad Sci 104(18):7327–7331CrossRef
32.
Zurück zum Zitat Sen T, Kloczkowski A, Jernigan R (2006) Functional clustering of yeast proteins from the protein-protein interaction network. BMC Bioinf 7:355–367CrossRef Sen T, Kloczkowski A, Jernigan R (2006) Functional clustering of yeast proteins from the protein-protein interaction network. BMC Bioinf 7:355–367CrossRef
33.
Zurück zum Zitat Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE PAMI 22(8):888–905CrossRef Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE PAMI 22(8):888–905CrossRef
34.
Zurück zum Zitat Sun J, Papadimitriou S, Faloutsos C, Yu PS (2007) Graphscope: parameter-free mining of large time-evolving graphs. KDD 687–696 Sun J, Papadimitriou S, Faloutsos C, Yu PS (2007) Graphscope: parameter-free mining of large time-evolving graphs. KDD 687–696
35.
Zurück zum Zitat Sun J, Tao D, Faloutsos C (2006) Beyond streams and graphs: dynamic tensor analysis. KDD, pp 374–383 Sun J, Tao D, Faloutsos C (2006) Beyond streams and graphs: dynamic tensor analysis. KDD, pp 374–383
36.
Zurück zum Zitat Tang L, Wang X, Liu H (2009) Uncovering groups via heterogeneous interaction analysis. In: Ninth IEEE international conference on data mining 503–512 Tang L, Wang X, Liu H (2009) Uncovering groups via heterogeneous interaction analysis. In: Ninth IEEE international conference on data mining 503–512
37.
Zurück zum Zitat Tantipathananandh C, Berger-Wolf TY (2011) Finding communities in dynamic social networks. ICDM, pp 1236–1241 Tantipathananandh C, Berger-Wolf TY (2011) Finding communities in dynamic social networks. ICDM, pp 1236–1241
38.
Zurück zum Zitat Wasserman S (1994) Social network analysis: methods and applications. cambridge University Press, CambridgeCrossRef Wasserman S (1994) Social network analysis: methods and applications. cambridge University Press, CambridgeCrossRef
39.
Zurück zum Zitat Wu Z, Yin W, Cao J, Xu G, Cuzzocrea A (2013) Community detection in multi-relational social networks. WISE 8181:43–56 Wu Z, Yin W, Cao J, Xu G, Cuzzocrea A (2013) Community detection in multi-relational social networks. WISE 8181:43–56
40.
Zurück zum Zitat Yang J, Leskovec J (2012) Community-affiliation graph model for overlapping network community detection. In: 12th IEEE International Conference on Data Mining 1170–1175 Yang J, Leskovec J (2012) Community-affiliation graph model for overlapping network community detection. In: 12th IEEE International Conference on Data Mining 1170–1175
Metadaten
Titel
Discovery of “comet” communities in temporal and labeled graphs Com  
verfasst von
Miguel Araujo
Stephan Günnemann
Spiros Papadimitriou
Christos Faloutsos
Prithwish Basu
Ananthram Swami
Evangelos E. Papalexakis
Danai Koutra
Publikationsdatum
01.03.2016
Verlag
Springer London
Erschienen in
Knowledge and Information Systems / Ausgabe 3/2016
Print ISSN: 0219-1377
Elektronische ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-015-0847-2

Weitere Artikel der Ausgabe 3/2016

Knowledge and Information Systems 3/2016 Zur Ausgabe

Premium Partner