Skip to main content
Top
Published in: Computing 2/2023

27-11-2022 | Regular Paper

HaSGP: an effective graph partition method for heterogeneous-aware

Authors: Ying Zhong, Chenze Huang, Qingbiao Zhou

Published in: Computing | Issue 2/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Graph partitioning is an effective way to improve the performance of parallel computing. However, the research of graph partitioning is driven by the demand of practical applications. In this paper, we propose a streaming graph partitioning algorithm based on heterogeneous-aware for distributed heterogeneous computing environment. It not only considers the difference of network bandwidth and the compute ability of computer nodes, but also considers the shared resources competition between cores in a high-speed network. Taking breadth first search, single source shortest path and PageRank algorithms as examples, compared with the streaming algorithms without considering the heterogeneous environment, the efficiency of graph computing is improved on average by 38%, 45.7% and 61.8% respectively. Meanwhile, in view of the low efficiency of streaming graph partitioning based on adjacent vertex structure, we design a cache management mechanism based on adjacent edge structure for streaming graph partitioning, which effectively improves the partitioning efficiency. The experimental results show that our method is suitable for graph vertex assignment in distributed heterogeneous cluster environment.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Abbas Z, Kalavri V, Carbone P, Vlassov V (2018) Streaming graph partitioning: an experimental study. Very Large Data Bases 11(11):1590–1603 Abbas Z, Kalavri V, Carbone P, Vlassov V (2018) Streaming graph partitioning: an experimental study. Very Large Data Bases 11(11):1590–1603
2.
go back to reference Buluc A, Meyerhenke H, Safro I, Sanders P, Schulz C (2013) Recent advances in graph partitioning. arXiv: Data Structures and Algorithms Buluc A, Meyerhenke H, Safro I, Sanders P, Schulz C (2013) Recent advances in graph partitioning. arXiv:​ Data Structures and Algorithms
3.
4.
go back to reference Chen Q, Yao J, Li B, Xiao Z (2019) Pisces: optimizing multi-job application execution in mapreduce. IEEE Trans Cloud Comput 7(1):273–286CrossRef Chen Q, Yao J, Li B, Xiao Z (2019) Pisces: optimizing multi-job application execution in mapreduce. IEEE Trans Cloud Comput 7(1):273–286CrossRef
5.
go back to reference Chen R, Shi J, Chen Y, Chen H (2015) Powerlyra: differentiated graph computation and partitioning on skewed graphs. ACM Trans Parallel Comput (TOPC) 5:1–39 Chen R, Shi J, Chen Y, Chen H (2015) Powerlyra: differentiated graph computation and partitioning on skewed graphs. ACM Trans Parallel Comput (TOPC) 5:1–39
6.
go back to reference Chen R, Yang M, Weng X, Choi B, He BJ, Li X (2012) Improving large graph processing on partitioned graphs in the cloud, p 3 Chen R, Yang M, Weng X, Choi B, He BJ, Li X (2012) Improving large graph processing on partitioned graphs in the cloud, p 3
7.
go back to reference Choi D, Han J, Lim J, Han J, Bok K, Yoo J (2021) Dynamic graph partitioning scheme for supporting load balancing in distributed graph environments. IEEE Access 9:65254–65265CrossRef Choi D, Han J, Lim J, Han J, Bok K, Yoo J (2021) Dynamic graph partitioning scheme for supporting load balancing in distributed graph environments. IEEE Access 9:65254–65265CrossRef
8.
go back to reference Dathathri R, Gill G, Hoang L, Dang HV, Pingali K (2018) Gluon: a communication-optimizing substrate for distributed heterogeneous graph analytics. In: Acm Sigplan conference Dathathri R, Gill G, Hoang L, Dang HV, Pingali K (2018) Gluon: a communication-optimizing substrate for distributed heterogeneous graph analytics. In: Acm Sigplan conference
9.
go back to reference El Moussawi A, Seghouani NB, Bugiotti F (2021) B-grap: balanced graph partitioning algorithm for large graphs. J Data Intell 2(2):116–135CrossRef El Moussawi A, Seghouani NB, Bugiotti F (2021) B-grap: balanced graph partitioning algorithm for large graphs. J Data Intell 2(2):116–135CrossRef
10.
go back to reference Fernandezmusoles C, Coca D, Richmond P (2019) Communication sparsity in distributed spiking neural network simulations to improve scalability. Front Neuroinform 13:19CrossRef Fernandezmusoles C, Coca D, Richmond P (2019) Communication sparsity in distributed spiking neural network simulations to improve scalability. Front Neuroinform 13:19CrossRef
11.
go back to reference Hua Q, Li Y, Yu D, Jin H (2019) Quasi-streaming graph partitioning: a game theoretical approach. IEEE Trans Parallel Distrib Syst 30(7):1643–1656CrossRef Hua Q, Li Y, Yu D, Jin H (2019) Quasi-streaming graph partitioning: a game theoretical approach. IEEE Trans Parallel Distrib Syst 30(7):1643–1656CrossRef
12.
go back to reference Ji S, Bu C, Li L, Wu X (2021) Local graph edge partitioning. ACM Trans Intell Syst Technol (TIST) 12(5):1–25CrossRef Ji S, Bu C, Li L, Wu X (2021) Local graph edge partitioning. ACM Trans Intell Syst Technol (TIST) 12(5):1–25CrossRef
13.
go back to reference Kalavri V, Vlassov V, Haridi S (2018) High-level programming abstractions for distributed graph processing. IEEE Trans Knowl Data Eng 30(2):305–324CrossRef Kalavri V, Vlassov V, Haridi S (2018) High-level programming abstractions for distributed graph processing. IEEE Trans Knowl Data Eng 30(2):305–324CrossRef
14.
go back to reference Karypis G, Kumar V (1998) A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM J Sci Comput 20(1):359–392CrossRefMATH Karypis G, Kumar V (1998) A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM J Sci Comput 20(1):359–392CrossRefMATH
15.
go back to reference Kokosiński Z, Bała M (2018) Solving graph partitioning problems with parallel metaheuristics. Springer, BerlinCrossRefMATH Kokosiński Z, Bała M (2018) Solving graph partitioning problems with parallel metaheuristics. Springer, BerlinCrossRefMATH
16.
go back to reference Kumar D, Raj A, Dharanipragada J (2017) Graphsteal: dynamic re-partitioning for efficient graph processing in heterogeneous clusters, pp 439–446 Kumar D, Raj A, Dharanipragada J (2017) Graphsteal: dynamic re-partitioning for efficient graph processing in heterogeneous clusters, pp 439–446
17.
go back to reference Liu S, Chen P, Hero AO (2018) Accelerated distributed dual averaging over evolving networks of growing connectivity. IEEE Trans Signal Process 66(7):1845–1859CrossRefMATH Liu S, Chen P, Hero AO (2018) Accelerated distributed dual averaging over evolving networks of growing connectivity. IEEE Trans Signal Process 66(7):1845–1859CrossRefMATH
18.
go back to reference Liu WL, Gong YJ, Chen WN, Liu Z, Wang H, Zhang J (2019) Coordinated charging scheduling of electric vehicles: a mixed-variable differential evolution approach. IEEE Trans Intell Transp Syst 21(12):5094–5109CrossRef Liu WL, Gong YJ, Chen WN, Liu Z, Wang H, Zhang J (2019) Coordinated charging scheduling of electric vehicles: a mixed-variable differential evolution approach. IEEE Trans Intell Transp Syst 21(12):5094–5109CrossRef
19.
go back to reference Liu X, Zhou Y, Hu C, Guan X (2016) Miracle: a multiple independent random walks community parallel detection algorithm for big graphs. J Netw Comput Appl 70:89–101CrossRef Liu X, Zhou Y, Hu C, Guan X (2016) Miracle: a multiple independent random walks community parallel detection algorithm for big graphs. J Netw Comput Appl 70:89–101CrossRef
20.
go back to reference Masood S, Sheng B, Li P, Shen R, Fang R, Wu Q (2018) Automatic choroid layer segmentation using normalized graph cut. IET Image Proc 12(1):53–59CrossRef Masood S, Sheng B, Li P, Shen R, Fang R, Wu Q (2018) Automatic choroid layer segmentation using normalized graph cut. IET Image Proc 12(1):53–59CrossRef
21.
go back to reference Muttipati AS, Padmaja P (2015) Analysis of large graph partitioning and frequent subgraph mining on graph data. Int J Adv Res Comput Sci 6(7):29–40 Muttipati AS, Padmaja P (2015) Analysis of large graph partitioning and frequent subgraph mining on graph data. Int J Adv Res Comput Sci 6(7):29–40
22.
go back to reference Nishimura J, Ugander J (2013) Restreaming graph partitioning: simple versatile algorithms for advanced balancing, pp 1106–1114 (2013) Nishimura J, Ugander J (2013) Restreaming graph partitioning: simple versatile algorithms for advanced balancing, pp 1106–1114 (2013)
23.
go back to reference Rahimian F, Payberah AH, Girdzijauskas S, Jelasity M, Haridi S (2013) Ja-be-ja: a distributed algorithm for balanced graph partitioning, pp 51–60 Rahimian F, Payberah AH, Girdzijauskas S, Jelasity M, Haridi S (2013) Ja-be-ja: a distributed algorithm for balanced graph partitioning, pp 51–60
24.
go back to reference Sajjad HP, Payberah AH, Rahimian F, Vlassov V, Haridi S (2016) Boosting vertex-cut partitioning for streaming graphs, pp 1–8 Sajjad HP, Payberah AH, Rahimian F, Vlassov V, Haridi S (2016) Boosting vertex-cut partitioning for streaming graphs, pp 1–8
25.
go back to reference Schulz C, Strash D (2018) Graph partitioning: formulations and applications to big data[M]. In: Encyclopedia of big data technologies. Springer, Cham, pp 1–7 Schulz C, Strash D (2018) Graph partitioning: formulations and applications to big data[M]. In: Encyclopedia of big data technologies. Springer, Cham, pp 1–7
26.
go back to reference Shi Z, Li J, Guo P, Li S, Feng D, Su Y (2017) Partitioning dynamic graph asynchronously with distributed fennel. Future Gener Comput Syst 71:32–42CrossRef Shi Z, Li J, Guo P, Li S, Feng D, Su Y (2017) Partitioning dynamic graph asynchronously with distributed fennel. Future Gener Comput Syst 71:32–42CrossRef
27.
go back to reference Stanton I, Kliot G (2012) Streaming graph partitioning for large distributed graphs, pp 1222–1230 Stanton I, Kliot G (2012) Streaming graph partitioning for large distributed graphs, pp 1222–1230
28.
go back to reference Stanton I, Kliot G (2012) Streaming graph partitioning for large distributed graphs. In: KDD Stanton I, Kliot G (2012) Streaming graph partitioning for large distributed graphs. In: KDD
29.
go back to reference Testa A, Rucco A, Notarstefano G (2018) Distributed mixed-integer linear programming via cut generation and constraint exchange. IEEE Trans Autom Control 65:1456–1467CrossRefMATH Testa A, Rucco A, Notarstefano G (2018) Distributed mixed-integer linear programming via cut generation and constraint exchange. IEEE Trans Autom Control 65:1456–1467CrossRefMATH
30.
go back to reference Tsourakakis CE, Gkantsidis C, Radunovic B, Vojnovic M (2014) Fennel: streaming graph partitioning for massive scale graphs, pp 333–342 Tsourakakis CE, Gkantsidis C, Radunovic B, Vojnovic M (2014) Fennel: streaming graph partitioning for massive scale graphs, pp 333–342
31.
go back to reference Wang G, Ng TSE (2010) The impact of virtualization on network performance of amazon ec2 data center, pp 1163–1171 Wang G, Ng TSE (2010) The impact of virtualization on network performance of amazon ec2 data center, pp 1163–1171
32.
go back to reference Wang W, Du S, Guo Z, Luo L (2015) Polygonal clustering analysis using multilevel graph-partition. Trans GIS 19(5):716–736CrossRef Wang W, Du S, Guo Z, Luo L (2015) Polygonal clustering analysis using multilevel graph-partition. Trans GIS 19(5):716–736CrossRef
33.
go back to reference Washburne AD, Silverman JD, Morton JT, Becker DJ, Crowley DE, Mukherjee S, David LA, Plowright RK (2019) Phylofactorization: a graph partitioning algorithm to identify phylogenetic scales of ecological data. Ecol Monogr 89(2):e01353CrossRef Washburne AD, Silverman JD, Morton JT, Becker DJ, Crowley DE, Mukherjee S, David LA, Plowright RK (2019) Phylofactorization: a graph partitioning algorithm to identify phylogenetic scales of ecological data. Ecol Monogr 89(2):e01353CrossRef
34.
go back to reference Xiaofeng Y, Yumei Z, Yang W (2013) The innovation of e-commerce financial service product based on cloud computing-taking alibaba finance as an example, pp 259–261 Xiaofeng Y, Yumei Z, Yang W (2013) The innovation of e-commerce financial service product based on cloud computing-taking alibaba finance as an example, pp 259–261
35.
go back to reference Xu H, Li B (2014) Repflow: minimizing flow completion times with replicated flows in data centers. In: International conference on computer communications, pp 1581–1589 Xu H, Li B (2014) Repflow: minimizing flow completion times with replicated flows in data centers. In: International conference on computer communications, pp 1581–1589
36.
go back to reference Xu N, Cui B, Chen L, Huang Z, Shao Y (2014) Heterogeneous environment aware streaming graph partitioning. IEEE Trans Knowl Data Eng 27(6):1560–1572CrossRef Xu N, Cui B, Chen L, Huang Z, Shao Y (2014) Heterogeneous environment aware streaming graph partitioning. IEEE Trans Knowl Data Eng 27(6):1560–1572CrossRef
37.
go back to reference Yi S, Kondo D, Andrzejak A (2010) Reducing costs of spot instances via checkpointing in the amazon elastic compute cloud, pp 236–243 Yi S, Kondo D, Andrzejak A (2010) Reducing costs of spot instances via checkpointing in the amazon elastic compute cloud, pp 236–243
38.
go back to reference Zhang X, Wu Y, Zhao C (2016) Mrheter: improving mapreduce performance in heterogeneous environments. Clust Comput 19(4):1691–1701CrossRef Zhang X, Wu Y, Zhao C (2016) Mrheter: improving mapreduce performance in heterogeneous environments. Clust Comput 19(4):1691–1701CrossRef
39.
go back to reference Zhao F, He X, Wang L (2020) A two-stage cooperative evolutionary algorithm with problem-specific knowledge for energy-efficient scheduling of no-wait flow-shop problem. IEEE Trans Cybern 51(11):5291–5303CrossRef Zhao F, He X, Wang L (2020) A two-stage cooperative evolutionary algorithm with problem-specific knowledge for energy-efficient scheduling of no-wait flow-shop problem. IEEE Trans Cybern 51(11):5291–5303CrossRef
40.
go back to reference Zhao F, Ma R, Wang L (2022) A self-learning discrete jaya algorithm for multiobjective energy-efficient distributed no-idle flow-shop scheduling problem in heterogeneous factory system. In: IEEE transactions on cybernetics, vol 52, no 12, pp 12675–12686 Zhao F, Ma R, Wang L (2022) A self-learning discrete jaya algorithm for multiobjective energy-efficient distributed no-idle flow-shop scheduling problem in heterogeneous factory system. In: IEEE transactions on cybernetics, vol 52, no 12, pp 12675–12686
41.
go back to reference Zheng A, Labrinidis A, Chrysanthis PK (2016) Planar: parallel lightweight architecture-aware adaptive graph repartitioning, pp 121–132 Zheng A, Labrinidis A, Chrysanthis PK (2016) Planar: parallel lightweight architecture-aware adaptive graph repartitioning, pp 121–132
42.
go back to reference Zhou S, Xing L, Zheng X, Du N, Wang L, Zhang Q (2019) A self-adaptive differential evolution algorithm for scheduling a single batch-processing machine with arbitrary job sizes and release times. IEEE Trans Cybern 51(3):1430–1442CrossRef Zhou S, Xing L, Zheng X, Du N, Wang L, Zhang Q (2019) A self-adaptive differential evolution algorithm for scheduling a single batch-processing machine with arbitrary job sizes and release times. IEEE Trans Cybern 51(3):1430–1442CrossRef
Metadata
Title
HaSGP: an effective graph partition method for heterogeneous-aware
Authors
Ying Zhong
Chenze Huang
Qingbiao Zhou
Publication date
27-11-2022
Publisher
Springer Vienna
Published in
Computing / Issue 2/2023
Print ISSN: 0010-485X
Electronic ISSN: 1436-5057
DOI
https://doi.org/10.1007/s00607-022-01132-y

Other articles of this Issue 2/2023

Computing 2/2023 Go to the issue

Premium Partner