Skip to main content
Top
Published in: The Journal of Supercomputing 3/2021

02-07-2020

GS4: Graph stream summarization based on both the structure and semantics

Authors: Nosratali Ashrafi-Payaman, Mohammad Reza Kangavari, Saeid Hosseini, Amir Mohammad Fander

Published in: The Journal of Supercomputing | Issue 3/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Nowadays internet-based applications collect and distribute large datasets, which are mostly modeled by pertinent massive graphs. One solution to process such massive graphs is summarization. There are two kinds of graphs, stationary and stream. There are several algorithms to summarize stationary graphs; however, no comprehensive method has been devised to summarize stream graphs. This is because of the challenges of the graph stream, which are the high data volume and the continuous changes of data over time. To tackle such challenges, we propose a novel method based on the sliding window model that performs summarization using both the structure and vertex attributes of the input graph stream. We devise a new structure for a summary graph by considering the structural and semantical attributes that can better elucidate every heterogeneous summary graph. Moreover, our framework comprises innovative components for comparing hybrid summary graphs. To the best of our knowledge, this is the first method that summarizes a graph stream using both the structure and vertex attributes with varying contributions. Our approach also takes user directions and ontology into account. Aiming to study the efficiency and effectiveness of our proposed method, we conduct extensive experiments on two real-life datasets: American political web-logs and Amazon co-purchasing products. The experimental results confirm that compared to the existing approaches the proposed method generates graph summaries with better quality. The expected time of our proposed method in this paper (\(O(n^3)\)) has significantly enhanced the efficiency compared to the current best complexity which is \(O(n^5)\).

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Cheng H, Zhou Y, Yu JX (2011) Clustering large attributed graphs: a balance between structural and attribute similarities. ACM Trans Knowl Discov Data (TKDD) 5(2):12MathSciNet Cheng H, Zhou Y, Yu JX (2011) Clustering large attributed graphs: a balance between structural and attribute similarities. ACM Trans Knowl Discov Data (TKDD) 5(2):12MathSciNet
2.
go back to reference LeFevre K, Terzi E (2010) GraSS: Graph structure summarization. In: Proceedings of the 2010 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics, pp 454–465 LeFevre K, Terzi E (2010) GraSS: Graph structure summarization. In: Proceedings of the 2010 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics, pp 454–465
3.
go back to reference Shah N, Koutra D, Zou T, Gallagher B, Faloutsos C (2015) Timecrunch: Interpretable dynamic graph summarization. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, pp 1055–1064 Shah N, Koutra D, Zou T, Gallagher B, Faloutsos C (2015) Timecrunch: Interpretable dynamic graph summarization. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, pp 1055–1064
5.
go back to reference Liu Y, Safavi T, Dighe A, Koutra D (2018) Graph summarization methods and applications: a survey. ACM Comput Surv (CSUR) 51(3):62CrossRef Liu Y, Safavi T, Dighe A, Koutra D (2018) Graph summarization methods and applications: a survey. ACM Comput Surv (CSUR) 51(3):62CrossRef
6.
go back to reference Koutra D, Kang U, Vreeken J, Faloutsos C (2015) Summarizing and understanding large graphs. Stat Anal Min ASA Data Sci J 8(3):183–202MathSciNetCrossRef Koutra D, Kang U, Vreeken J, Faloutsos C (2015) Summarizing and understanding large graphs. Stat Anal Min ASA Data Sci J 8(3):183–202MathSciNetCrossRef
7.
go back to reference Navlakha S, Schatz MC, Kingsford C (2009) Revealing biological modules via graph summarization. J Comput Biol 16(2):253–264CrossRef Navlakha S, Schatz MC, Kingsford C (2009) Revealing biological modules via graph summarization. J Comput Biol 16(2):253–264CrossRef
8.
go back to reference Thor A, Anderson P, Raschid L, Navlakha S, Saha B, Khuller S, Zhang XN (2011) Link prediction for annotation graphs using graph summarization. In: International Semantic Web Conference. Springer, Berlin, pp 714–729 Thor A, Anderson P, Raschid L, Navlakha S, Saha B, Khuller S, Zhang XN (2011) Link prediction for annotation graphs using graph summarization. In: International Semantic Web Conference. Springer, Berlin, pp 714–729
9.
go back to reference Riondato M, García-Soriano D, Bonchi F (2017) Graph summarization with quality guarantees. Data Min Knowl Discov 31(2):314–349MathSciNetCrossRef Riondato M, García-Soriano D, Bonchi F (2017) Graph summarization with quality guarantees. Data Min Knowl Discov 31(2):314–349MathSciNetCrossRef
10.
go back to reference Navlakha S, Rastogi R, Shrivastava N (2008) Graph summarization with bounded error. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data. ACM, pp 419–432 Navlakha S, Rastogi R, Shrivastava N (2008) Graph summarization with bounded error. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data. ACM, pp 419–432
11.
go back to reference Wu Y, Yang S, Srivatsa M, Iyengar A, Yan X (2013) Summarizing answer graphs induced by keyword queries. Proc VLDB Endow 6(14):1774–1785CrossRef Wu Y, Yang S, Srivatsa M, Iyengar A, Yan X (2013) Summarizing answer graphs induced by keyword queries. Proc VLDB Endow 6(14):1774–1785CrossRef
12.
go back to reference Bei Y, Lin Z, Chen D (2016) Summarizing scale-free networks based on virtual and real links. Phys A Stat Mech Appl 444:360–372CrossRef Bei Y, Lin Z, Chen D (2016) Summarizing scale-free networks based on virtual and real links. Phys A Stat Mech Appl 444:360–372CrossRef
13.
go back to reference Tian Y, Hankins RA, Patel JM (2008) Efficient aggregation for graph summarization. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data. ACM, pp 567–580 Tian Y, Hankins RA, Patel JM (2008) Efficient aggregation for graph summarization. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data. ACM, pp 567–580
14.
go back to reference Zhang N, Tian Y, Patel JM (2010) Discovery-driven graph summarization. In: IEEE 26th International Conference on Data Engineering (ICDE 2010). IEEE, pp 880–891 Zhang N, Tian Y, Patel JM (2010) Discovery-driven graph summarization. In: IEEE 26th International Conference on Data Engineering (ICDE 2010). IEEE, pp 880–891
16.
go back to reference Feigenbaum J, Kannan S, McGregor A, Suri S, Zhang J (2008) Graph distances in the data-stream model. SIAM J Comput 38(5):1709–1727MathSciNetCrossRef Feigenbaum J, Kannan S, McGregor A, Suri S, Zhang J (2008) Graph distances in the data-stream model. SIAM J Comput 38(5):1709–1727MathSciNetCrossRef
17.
go back to reference Aggarwal CC, Zhao Y, Yu PS (2010) On clustering graph streams. In: Proceedings of the 2010 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics, pp 478–489 Aggarwal CC, Zhao Y, Yu PS (2010) On clustering graph streams. In: Proceedings of the 2010 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics, pp 478–489
18.
go back to reference Gou X, Zou L, Zhao C, Yang T (2019) Fast and accurate graph stream summarization. In: IEEE 35th International Conference on Data Engineering (ICDE). IEEE, pp 1118–1129 Gou X, Zou L, Zhao C, Yang T (2019) Fast and accurate graph stream summarization. In: IEEE 35th International Conference on Data Engineering (ICDE). IEEE, pp 1118–1129
19.
go back to reference Hosseini S, Yin H, Zhang M, Elovici Y, Zhou X (2018) Mining subgraphs from propagation networks through temporal dynamic analysis. In: 19th IEEE International Conference on Mobile Data Management (MDM). IEEE, pp 66–75 Hosseini S, Yin H, Zhang M, Elovici Y, Zhou X (2018) Mining subgraphs from propagation networks through temporal dynamic analysis. In: 19th IEEE International Conference on Mobile Data Management (MDM). IEEE, pp 66–75
20.
go back to reference Hosseini S, Yin H, Cheung NM, Leng KP, Elovici Y, Zhou X (2018) Exploiting reshaping subgraphs from bilateral propagation graphs. In: International Conference on Database Systems for Advanced Applications. Springer, Cham, pp 342–351 Hosseini S, Yin H, Cheung NM, Leng KP, Elovici Y, Zhou X (2018) Exploiting reshaping subgraphs from bilateral propagation graphs. In: International Conference on Database Systems for Advanced Applications. Springer, Cham, pp 342–351
21.
go back to reference Ashrafi-Payaman N, Kangavari MR, Fander AM (2017) A new method for graph stream summarization based on both the structure and concepts. Open Eng 9(1):500–511CrossRef Ashrafi-Payaman N, Kangavari MR, Fander AM (2017) A new method for graph stream summarization based on both the structure and concepts. Open Eng 9(1):500–511CrossRef
22.
go back to reference Datar M, Motwani R (2007) The sliding-window computation model and results. In: Data streams. Springer, Boston, pp 149–167 Datar M, Motwani R (2007) The sliding-window computation model and results. In: Data streams. Springer, Boston, pp 149–167
23.
go back to reference Liu X, Tian Y, He Q, Lee WC, McPherson J (2014) Distributed graph summarization. In: Proceedings of the 23rd ACM International Conference on Information and Knowledge Management. ACM, pp 799–808 Liu X, Tian Y, He Q, Lee WC, McPherson J (2014) Distributed graph summarization. In: Proceedings of the 23rd ACM International Conference on Information and Knowledge Management. ACM, pp 799–808
24.
go back to reference Chen C, Lin CX, Fredrikson M, Christodorescu M, Yan X, Han J (2009) Mining graph patterns efficiently via randomized summaries. Proc VLDB Endow 2(1):742–753CrossRef Chen C, Lin CX, Fredrikson M, Christodorescu M, Yan X, Han J (2009) Mining graph patterns efficiently via randomized summaries. Proc VLDB Endow 2(1):742–753CrossRef
26.
go back to reference Tang N, Chen Q, Mitra P (2016) Graph stream summarization: from big bang to big crunch. In: Proceedings of the 2016 International Conference on Management of Data. ACM, pp 1481–1496 Tang N, Chen Q, Mitra P (2016) Graph stream summarization: from big bang to big crunch. In: Proceedings of the 2016 International Conference on Management of Data. ACM, pp 1481–1496
27.
go back to reference Khan A, Bhowmick SS, Bonchi F (2017) Summarizing static and dynamic big graphs. Proc VLDB Endow 10(12):1981–1984CrossRef Khan A, Bhowmick SS, Bonchi F (2017) Summarizing static and dynamic big graphs. Proc VLDB Endow 10(12):1981–1984CrossRef
28.
go back to reference Tsalouchidou I, Bonchi F, Morales GDF, Baeza-Yates R (2018) Scalable dynamic graph summarization. IEEE Trans Knowl Data Eng 32:360–373CrossRef Tsalouchidou I, Bonchi F, Morales GDF, Baeza-Yates R (2018) Scalable dynamic graph summarization. IEEE Trans Knowl Data Eng 32:360–373CrossRef
29.
go back to reference Lim Y, Kang U, Faloutsos C (2014) Slashburn: graph compression and mining beyond caveman communities. IEEE Trans Knowl Data Eng 26(12):3077–3089CrossRef Lim Y, Kang U, Faloutsos C (2014) Slashburn: graph compression and mining beyond caveman communities. IEEE Trans Knowl Data Eng 26(12):3077–3089CrossRef
30.
go back to reference Boldi P, Vigna S (2004) The webgraph framework I: compression techniques. In: Proceedings of the 13th International Conference on World Wide Web. ACM, pp 595–602 Boldi P, Vigna S (2004) The webgraph framework I: compression techniques. In: Proceedings of the 13th International Conference on World Wide Web. ACM, pp 595–602
31.
go back to reference Seo H, Kim H, Park K, Han Y, Lee YK (2015) Summarization technique on a compressed graph for massive graph analysis. Korean Soc Big Data Serv 2(1):25–35 Seo H, Kim H, Park K, Han Y, Lee YK (2015) Summarization technique on a compressed graph for massive graph analysis. Korean Soc Big Data Serv 2(1):25–35
32.
go back to reference Jouili S, Mili I, Tabbone S (2009) Attributed graph matching using local descriptions. International Conference on Advanced Concepts for Intelligent Vision Systems. Springer, Berlin, pp 89–99CrossRef Jouili S, Mili I, Tabbone S (2009) Attributed graph matching using local descriptions. International Conference on Advanced Concepts for Intelligent Vision Systems. Springer, Berlin, pp 89–99CrossRef
33.
go back to reference Duchenne O, Joulin A, Ponce J (2011) A graph-matching kernel for object categorization. In: International Conference on Computer Vision. IEEE, pp 1792–1799 Duchenne O, Joulin A, Ponce J (2011) A graph-matching kernel for object categorization. In: International Conference on Computer Vision. IEEE, pp 1792–1799
34.
go back to reference Ashrafi-Payaman N, Kangavari M (2017) GSSC: Graph summarization based on both structure and concepts. Int J Inf Commun Technol Res 9(1):33–44 Ashrafi-Payaman N, Kangavari M (2017) GSSC: Graph summarization based on both structure and concepts. Int J Inf Commun Technol Res 9(1):33–44
35.
go back to reference White S, Smyth P (2005) A spectral clustering approach to finding communities in graphs. In: Proceedings of the 2005 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics, pp 274–285 White S, Smyth P (2005) A spectral clustering approach to finding communities in graphs. In: Proceedings of the 2005 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics, pp 274–285
37.
go back to reference Dhillon IS, Guan Y, Kulis B (2004) A unified view of kernel k-means, spectral clustering and graph cuts. Computer Science Department University of Texas at Austin, Austin Dhillon IS, Guan Y, Kulis B (2004) A unified view of kernel k-means, spectral clustering and graph cuts. Computer Science Department University of Texas at Austin, Austin
38.
go back to reference Van Dongen SM (2000) Graph clustering by flow simulation. Doctoral dissertation Van Dongen SM (2000) Graph clustering by flow simulation. Doctoral dissertation
39.
go back to reference Ng AY, Jordan MI, Weiss Y (2002) On spectral clustering: analysis and an algorithm. In: Advances in neural information processing systems, pp 849–856 Ng AY, Jordan MI, Weiss Y (2002) On spectral clustering: analysis and an algorithm. In: Advances in neural information processing systems, pp 849–856
40.
go back to reference Zhou D, Burges CJ (2007) Spectral clustering and transductive learning with multiple views. In: Proceedings of the 24th International Conference on Machine Learning. ACM, pp 1159–1166 Zhou D, Burges CJ (2007) Spectral clustering and transductive learning with multiple views. In: Proceedings of the 24th International Conference on Machine Learning. ACM, pp 1159–1166
41.
go back to reference Liu J, Wang C, Danilevsky M, Han J (2013) Large-scale spectral clustering on graphs. In: Twenty-Third International Joint Conference on Artificial Intelligence Liu J, Wang C, Danilevsky M, Han J (2013) Large-scale spectral clustering on graphs. In: Twenty-Third International Joint Conference on Artificial Intelligence
42.
go back to reference Wang CD, Lai JH, Yu PS (2013) Dynamic community detection in weighted graph streams. In: Proceedings of the 2013 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics, pp 151–161 Wang CD, Lai JH, Yu PS (2013) Dynamic community detection in weighted graph streams. In: Proceedings of the 2013 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics, pp 151–161
43.
go back to reference Prabavathi MG, Thiagarasu V (2013) Overlapping community detection algorithms in dynamic networks: an overview Prabavathi MG, Thiagarasu V (2013) Overlapping community detection algorithms in dynamic networks: an overview
44.
go back to reference Lancichinetti A, Fortunato S (2009) Community detection algorithms: a comparative analysis. Phys Rev E 80(5):056117CrossRef Lancichinetti A, Fortunato S (2009) Community detection algorithms: a comparative analysis. Phys Rev E 80(5):056117CrossRef
45.
go back to reference Wang W, Street WN (2014) A novel algorithm for community detection and influence ranking in social networks. In: IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014). IEEE, pp 555–560 Wang W, Street WN (2014) A novel algorithm for community detection and influence ranking in social networks. In: IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014). IEEE, pp 555–560
46.
go back to reference Benyahia O, Largeron C, Jeudy B (2017) Community detection in dynamic graphs with missing edges. In: 11th International Conference on Research Challenges in Information Science (RCIS). IEEE, pp 372–381 Benyahia O, Largeron C, Jeudy B (2017) Community detection in dynamic graphs with missing edges. In: 11th International Conference on Research Challenges in Information Science (RCIS). IEEE, pp 372–381
47.
go back to reference Ashrafi-Payaman N, Kangavari MR (2018) Graph hybrid summarization. J AI Data Min 6(2):335–340 Ashrafi-Payaman N, Kangavari MR (2018) Graph hybrid summarization. J AI Data Min 6(2):335–340
Metadata
Title
GS4: Graph stream summarization based on both the structure and semantics
Authors
Nosratali Ashrafi-Payaman
Mohammad Reza Kangavari
Saeid Hosseini
Amir Mohammad Fander
Publication date
02-07-2020
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 3/2021
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-020-03290-2

Other articles of this Issue 3/2021

The Journal of Supercomputing 3/2021 Go to the issue

Premium Partner