Skip to main content
Top
Published in: Arabian Journal for Science and Engineering 9/2021

22-04-2021 | Research Article-Computer Engineering and Computer Science

D-SAG: A Distributed Sort-Based Algorithm for Graph Clustering

Authors: Yaman A. Saeed, Raed A. Ismail, Sherenaz W. Al-Haj Baddar

Published in: Arabian Journal for Science and Engineering | Issue 9/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Graph clustering has become a mainstream branch of computing due to its necessity for solving a wide range of problems nowadays. Thus, harnessing the capabilities of parallel and distributed computing has become instrumental. In this work, we introduce SAG, a quasilinear sort-based algorithm for graph clustering that maps naturally to distributed and/or parallel architectures. The main idea behind SAG is that nodes within a cluster naturally have similar adjacent nodes. Experiments on graphs with varying sizes compared SAG to its distributed counter-part, D-SAG, in terms of execution time, space, speedup, efficiency, and cost. Results showed the superiority of D-SAG in terms of execution time for graphs with more than 0.2 × 106 nodes. Moreover, the best speedup D-SAG achieved was 3.7-fold for synthetic graphs and 3.96-fold for real-world graphs, both using 6 computers.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Kurasova, O.; Cius, V.; Medvedev, V.; Cka, R.; Stefanovi, P.: Strategies for big data clustering. In: IEEE 26th International Conference on Tools with Artificial, Limassol (2014) Kurasova, O.; Cius, V.; Medvedev, V.; Cka, R.; Stefanovi, P.: Strategies for big data clustering. In: IEEE 26th International Conference on Tools with Artificial, Limassol (2014)
2.
go back to reference Lin, S.; Li, F.; Tian, E.; Fu, Y.; Li, D.: Clustering load profiles for demand response applications. IEEE Trans. Smart Grid 10(2), 1599–1607 (2019)CrossRef Lin, S.; Li, F.; Tian, E.; Fu, Y.; Li, D.: Clustering load profiles for demand response applications. IEEE Trans. Smart Grid 10(2), 1599–1607 (2019)CrossRef
3.
go back to reference Saxena, A.; Prasard, M.; Gupta, A.; Bharill, N.; Patel, O.P.; Tiwari, A.; Er, M.J.; Ding, W.; Lin, C.-T.: A review of clustering techniques and developments. Neurocomputing 267, 664–681 (2017)CrossRef Saxena, A.; Prasard, M.; Gupta, A.; Bharill, N.; Patel, O.P.; Tiwari, A.; Er, M.J.; Ding, W.; Lin, C.-T.: A review of clustering techniques and developments. Neurocomputing 267, 664–681 (2017)CrossRef
4.
go back to reference Sibaldo, M.; Carvalho, T.D.; Ren, T., Cavalcanti, G.: Recommender system based on modularity. In: Proceedings of the 2014 Recommender Systems Challenge, Foster City (2014) Sibaldo, M.; Carvalho, T.D.; Ren, T., Cavalcanti, G.: Recommender system based on modularity. In: Proceedings of the 2014 Recommender Systems Challenge, Foster City (2014)
5.
go back to reference Al-Adwan, A.; Zaghloul, R.; Mahafzah, B.A.; Sharieh, A.: Parallel quicksort algorithm on OTIS hyper hexa-cell optoelectronic architecture. J. Parallel Distrib. Comput. 141, 61–73 (2020)CrossRef Al-Adwan, A.; Zaghloul, R.; Mahafzah, B.A.; Sharieh, A.: Parallel quicksort algorithm on OTIS hyper hexa-cell optoelectronic architecture. J. Parallel Distrib. Comput. 141, 61–73 (2020)CrossRef
6.
go back to reference Al-Haj Baddar, S.W.; Mahafzah, B.A.: Bitonic sort on a chained-cubic tree interconnection network. J. Parallel Distrib. Comput. 74(1), 1744–1761 (2014)CrossRef Al-Haj Baddar, S.W.; Mahafzah, B.A.: Bitonic sort on a chained-cubic tree interconnection network. J. Parallel Distrib. Comput. 74(1), 1744–1761 (2014)CrossRef
7.
go back to reference Chen, Z.; Ji, H.: Graph-based clustering for computational linguistics, a survey. In: Proceedings of the 2010 Workshop on Graph-based Methods for Natural Language Processing, Vancouver, July 16th 2010 Chen, Z.; Ji, H.: Graph-based clustering for computational linguistics, a survey. In: Proceedings of the 2010 Workshop on Graph-based Methods for Natural Language Processing, Vancouver, July 16th 2010
8.
go back to reference Shiokawa, H.; Fujiwara, Y.; Onizuka, M.: Fast algorithm for modularity-based graph clustering. In: Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, Washington (2013) Shiokawa, H.; Fujiwara, Y.; Onizuka, M.: Fast algorithm for modularity-based graph clustering. In: Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, Washington (2013)
9.
go back to reference Karypis, G.; Eui-Hong, S.; Vipin, K.: Chameleon: hierarchical clustering using dynamic modeling. Computer 32(8), 68–75 (1999)CrossRef Karypis, G.; Eui-Hong, S.; Vipin, K.: Chameleon: hierarchical clustering using dynamic modeling. Computer 32(8), 68–75 (1999)CrossRef
10.
go back to reference Barton, T.; Bruna, T.; Kordik, P.: Chameleon 2: an improved graph-based clustering algorithm. ACM Trans. Knowl. Discov. 13(1), 10:1-10:27 (2019) Barton, T.; Bruna, T.; Kordik, P.: Chameleon 2: an improved graph-based clustering algorithm. ACM Trans. Knowl. Discov. 13(1), 10:1-10:27 (2019)
11.
go back to reference Mendes, R.; De Santiago, R.; Lamb, L.: Novel parallel anytime A* for graph and network clustering. In: IEEE Congress on Evolutionary Computation (CEC), Rio de Janeiro (2018) Mendes, R.; De Santiago, R.; Lamb, L.: Novel parallel anytime A* for graph and network clustering. In: IEEE Congress on Evolutionary Computation (CEC), Rio de Janeiro (2018)
12.
go back to reference Li, Z.; Zhang, S.; Wang, R.; Zhang, X.; Chen, L.: Quantitative function for community detection. Phys. Rev. 77(3), 036109–036109 (2008) Li, Z.; Zhang, S.; Wang, R.; Zhang, X.; Chen, L.: Quantitative function for community detection. Phys. Rev. 77(3), 036109–036109 (2008)
13.
go back to reference Shun, J.; Roosta-Khorasani, F.; Fountoulakis, K.; Mahoney, M.: Parallel local graph clustering. Proc. VLDB Endow. 9(12), 1041–1052 (2016)CrossRef Shun, J.; Roosta-Khorasani, F.; Fountoulakis, K.; Mahoney, M.: Parallel local graph clustering. Proc. VLDB Endow. 9(12), 1041–1052 (2016)CrossRef
14.
go back to reference Staudt, C.; Meyerhenke, H.: Engineering parallel algorithms for community detection in massive networks. IEEE Trans. Parallel Distrib. Syst. 27(1), 171–184 (2016)CrossRef Staudt, C.; Meyerhenke, H.: Engineering parallel algorithms for community detection in massive networks. IEEE Trans. Parallel Distrib. Syst. 27(1), 171–184 (2016)CrossRef
15.
go back to reference Bader, D.A.; Meyerhenke, H.; Sanders, P.; Wagner, D. Eds. Graph Partitioning and Graph Clustering, Providence, RI, USA: AMS,: ser. Contemporary Mathematics (2013) Bader, D.A.; Meyerhenke, H.; Sanders, P.; Wagner, D. Eds. Graph Partitioning and Graph Clustering, Providence, RI, USA: AMS,: ser. Contemporary Mathematics (2013)
16.
go back to reference Zeng, J.; Yu, H.: Parallel modularity-based community detection on large-scale graphs. In: IEEE International Conference on Cluster Computing, Chicago (2015) Zeng, J.; Yu, H.: Parallel modularity-based community detection on large-scale graphs. In: IEEE International Conference on Cluster Computing, Chicago (2015)
17.
go back to reference Jaccard, P.: The distribution of the flora of the alpine zone. New Phytol. 11(2), 37–50 (1912)CrossRef Jaccard, P.: The distribution of the flora of the alpine zone. New Phytol. 11(2), 37–50 (1912)CrossRef
18.
go back to reference Kumar, V.; Grama, A.; Gupta, A.; Karypis, G.: Introduction to Parallel Computing, 2nd edn. Addison Wesley, New York (2003)MATH Kumar, V.; Grama, A.; Gupta, A.; Karypis, G.: Introduction to Parallel Computing, 2nd edn. Addison Wesley, New York (2003)MATH
20.
go back to reference Shafi, A.; Carpenter, B.; Baker, M.: Nested parallelism for multi-core HPC systems using Java. J. Parallel Distrib. Comput. 69(6), 532–545 (2009)CrossRef Shafi, A.; Carpenter, B.; Baker, M.: Nested parallelism for multi-core HPC systems using Java. J. Parallel Distrib. Comput. 69(6), 532–545 (2009)CrossRef
22.
go back to reference Gorke, R.; Kappes, A.; Wagner, D.: Experiments on density-constrained graph clustering. ACM J. Exp. Algorithmics 19, 1.1-1.31 (2014)MathSciNetMATH Gorke, R.; Kappes, A.; Wagner, D.: Experiments on density-constrained graph clustering. ACM J. Exp. Algorithmics 19, 1.1-1.31 (2014)MathSciNetMATH
23.
go back to reference Newman, M.E.J.: Community detection in networks: modularity optimization and maximum likelihood are equivalent, 2016. [Online]. M. E. J. Newman. Accessed 25 July 2019 Newman, M.E.J.: Community detection in networks: modularity optimization and maximum likelihood are equivalent, 2016. [Online]. M. E. J. Newman. Accessed 25 July 2019
24.
go back to reference Abdullah, M.; Abuelrub, E.; Mahafzah, B.: The chained-cubic tree interconnection network. Int. Arab J. Inf. Technol. 8(3), 334–343 (2011) Abdullah, M.; Abuelrub, E.; Mahafzah, B.: The chained-cubic tree interconnection network. Int. Arab J. Inf. Technol. 8(3), 334–343 (2011)
25.
go back to reference Mahafzah, B.; Alshraideh, M.; Abu-Kabeer, T.; Ahmad, E.F.; Hamad, N.A.: The optical chained-cubic tree interconnection network: topological structure and properties. Comput. Electr. Eng. 38(2), 330–345 (2012)CrossRef Mahafzah, B.; Alshraideh, M.; Abu-Kabeer, T.; Ahmad, E.F.; Hamad, N.A.: The optical chained-cubic tree interconnection network: topological structure and properties. Comput. Electr. Eng. 38(2), 330–345 (2012)CrossRef
26.
go back to reference Mahafzah, B.; Al-Zoubi, I.: Broadcast communication operations for hyper hexa-cell interconnection network. Telecommun. Syst. 67, 73–93 (2018)CrossRef Mahafzah, B.; Al-Zoubi, I.: Broadcast communication operations for hyper hexa-cell interconnection network. Telecommun. Syst. 67, 73–93 (2018)CrossRef
27.
go back to reference Mahafzah, B.; Sleit, A.; Hamad, N.A.; Ahmad, E.F.; Abu-Kabeer, T.M.: The OTIS hyper hexa-cell optoelectronic architecture. Computing 94, 411–432 (2012)MathSciNetCrossRef Mahafzah, B.; Sleit, A.; Hamad, N.A.; Ahmad, E.F.; Abu-Kabeer, T.M.: The OTIS hyper hexa-cell optoelectronic architecture. Computing 94, 411–432 (2012)MathSciNetCrossRef
Metadata
Title
D-SAG: A Distributed Sort-Based Algorithm for Graph Clustering
Authors
Yaman A. Saeed
Raed A. Ismail
Sherenaz W. Al-Haj Baddar
Publication date
22-04-2021
Publisher
Springer Berlin Heidelberg
Published in
Arabian Journal for Science and Engineering / Issue 9/2021
Print ISSN: 2193-567X
Electronic ISSN: 2191-4281
DOI
https://doi.org/10.1007/s13369-021-05664-x

Other articles of this Issue 9/2021

Arabian Journal for Science and Engineering 9/2021 Go to the issue

Research Article-Computer Engineering and Computer Science

A New Set of Mutation Operators for Dragonfly Algorithm

Research Article-Computer Engineering and Computer Science

P-STORE: Extension of STORE Methodology to Elicit Privacy Requirements

Research Article-Computer Engineering and Computer Science

JayaL: A Novel Jaya Algorithm Based on Elite Local Search for Optimization Problems

Research Article-Computer Engineering and Computer Science

Fake News Detection Using BERT Model with Joint Learning

Research Article-Computer Engineering and Computer Science

Quantum Algorithm to Identify Division Property of a Multiset

Premium Partners