Skip to main content
Top

2017 | OriginalPaper | Chapter

Optimization of Cloud Workflow Scheduling Based on Balanced Clustering

Authors : Lei Zhang, Dongjin Yu, Hongsheng Zheng

Published in: Cyberspace Safety and Security

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Scientific workflow applications consist of many fine-grained computational tasks with dependencies, whose runtime varies widely. When executing these fine-grained tasks in a cloud computing environment, significant scheduling overheads are generated. Task clustering is a key technology to reduce scheduling overhead and optimize process execution time. Unfortunately, the attempts of task clustering often cause the problems of runtime and dependency imbalance. However, the existing task clustering strategies mainly focus on how to avoid the runtime imbalance, but rarely deal with the data dependency between tasks. Without considering the data dependency, task clustering will lead to the poor degree of parallelism during task execution due to the introduced data locality. In order to address the problem of dependency imbalance, we propose Dependency Balance Clustering Algorithm (DBCA), which defines the concept of dependency correlation to measure the similarity between tasks in terms of data dependencies. The tasks with high dependency correlation are clustered together so as to avoid the dependency imbalance. We conducted the experiments on the WorkflowSim platform and compared our method with the existing task clustering method. The results showed that it significantly reduced the execution time of the whole workflow.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Da Silva, R.F., Juve, G., Deelman, E., Glatard, T., Desprez, F., Thain, D., Tovar, B., Livny, M.: Toward fine-grained online task characteristics estimation in scientific workflows. In: WORKS@ SC, pp. 58–67 (2013) Da Silva, R.F., Juve, G., Deelman, E., Glatard, T., Desprez, F., Thain, D., Tovar, B., Livny, M.: Toward fine-grained online task characteristics estimation in scientific workflows. In: WORKS@ SC, pp. 58–67 (2013)
2.
go back to reference Stratan, C., Iosup, A., Epema, D.H.: A performance study of grid workflow engines. In: Proceedings of IEEE/ACM 9th International Conference on Grid Computing, pp. 25–32. IEEE Computer Society (2008) Stratan, C., Iosup, A., Epema, D.H.: A performance study of grid workflow engines. In: Proceedings of IEEE/ACM 9th International Conference on Grid Computing, pp. 25–32. IEEE Computer Society (2008)
3.
go back to reference Chen, W., Deelman, E.: Workflow overhead analysis and optimizations. In: Proceedings of the 6th workshop on Workflows in Support of Large-Scale Science, pp. 11–20. ACM (2011) Chen, W., Deelman, E.: Workflow overhead analysis and optimizations. In: Proceedings of the 6th workshop on Workflows in Support of Large-Scale Science, pp. 11–20. ACM (2011)
4.
go back to reference Muthuvelu, N., Liu, J., Soe, N.L., Venugopal, S., Sulistio, A., Buyya, R.: A dynamic job grouping-based scheduling for deploying applications with fine-grained tasks on global grids. In: Proceedings of the 2005 Australasian Workshop on Grid Computing and e-Research, vol. 44, pp. 41–48. Australian Computer Society, Inc. (2005) Muthuvelu, N., Liu, J., Soe, N.L., Venugopal, S., Sulistio, A., Buyya, R.: A dynamic job grouping-based scheduling for deploying applications with fine-grained tasks on global grids. In: Proceedings of the 2005 Australasian Workshop on Grid Computing and e-Research, vol. 44, pp. 41–48. Australian Computer Society, Inc. (2005)
5.
go back to reference Muthuvelu, N., Chai, I., Eswaran, C.: An adaptive and parameterized job grouping algorithm for scheduling grid jobs. In: Proceedings of ICACT 10th International Conference on Advanced Communication Technology, pp. 975–980. IEEE (2008) Muthuvelu, N., Chai, I., Eswaran, C.: An adaptive and parameterized job grouping algorithm for scheduling grid jobs. In: Proceedings of ICACT 10th International Conference on Advanced Communication Technology, pp. 975–980. IEEE (2008)
6.
go back to reference Muthuvelu, N., Vecchiola, C., Chai, I., Chikkannan, E., Buyya, R.: Task granularity policies for deploying bag-of-task applications on global grids. Future Gener. Comput. Syst. 29, 170–181 (2013)CrossRef Muthuvelu, N., Vecchiola, C., Chai, I., Chikkannan, E., Buyya, R.: Task granularity policies for deploying bag-of-task applications on global grids. Future Gener. Comput. Syst. 29, 170–181 (2013)CrossRef
7.
go back to reference Ang, T., Ng, W., Ling, T., Por, L., Liew, C.: A bandwidth-aware job grouping-based scheduling on grid environment. Inf. Technol. J. 8, 372–377 (2009)CrossRef Ang, T., Ng, W., Ling, T., Por, L., Liew, C.: A bandwidth-aware job grouping-based scheduling on grid environment. Inf. Technol. J. 8, 372–377 (2009)CrossRef
8.
go back to reference Liu, Q., Liao, Y.: Grouping-based fine-grained job scheduling in grid computing. In: Proceedings of the 1st International Workshop on Education Technology and Computer Science, ETCS 2009, pp. 556–559. IEEE (2009) Liu, Q., Liao, Y.: Grouping-based fine-grained job scheduling in grid computing. In: Proceedings of the 1st International Workshop on Education Technology and Computer Science, ETCS 2009, pp. 556–559. IEEE (2009)
9.
go back to reference Zhao, E., Qi, Y., Xiang, X., Chen, Y.: A data placement strategy based on genetic algorithm for scientific workflows. In: Proceedings of 2012 8th International Conference on Computational Intelligence and Security (CIS), pp. 146–149. IEEE (2012) Zhao, E., Qi, Y., Xiang, X., Chen, Y.: A data placement strategy based on genetic algorithm for scientific workflows. In: Proceedings of 2012 8th International Conference on Computational Intelligence and Security (CIS), pp. 146–149. IEEE (2012)
10.
go back to reference Deng, K., Ren, K., Song, J., Yuan, D., Xiang, Y., Chen, J.: A clustering based coscheduling strategy for efficient scientific workflow execution in cloud computing. Concurrency Comput. Pract. Exp. 25, 2523–2539 (2013)CrossRef Deng, K., Ren, K., Song, J., Yuan, D., Xiang, Y., Chen, J.: A clustering based coscheduling strategy for efficient scientific workflow execution in cloud computing. Concurrency Comput. Pract. Exp. 25, 2523–2539 (2013)CrossRef
11.
go back to reference Li, X., Zhang, L., Wu, Y., Liu, X., Zhu, E., Yi, H., Wang, F., Zhang, C., Yang, Y.: A novel workflow-level data placement strategy for data-sharing scientific cloud workflows. IEEE Trans. Serv. Comput., 1 (2016) Li, X., Zhang, L., Wu, Y., Liu, X., Zhu, E., Yi, H., Wang, F., Zhang, C., Yang, Y.: A novel workflow-level data placement strategy for data-sharing scientific cloud workflows. IEEE Trans. Serv. Comput., 1 (2016)
12.
go back to reference Chen, W., da Silva, R.F., Deelman, E., Sakellariou, R.: Using imbalance metrics to optimize task clustering in scientific workflow executions. Future Gener. Comput. Syst. 46, 69–84 (2015)CrossRef Chen, W., da Silva, R.F., Deelman, E., Sakellariou, R.: Using imbalance metrics to optimize task clustering in scientific workflow executions. Future Gener. Comput. Syst. 46, 69–84 (2015)CrossRef
13.
go back to reference Sahni, J., Vidyarthi, D.P.: Workflow-and-platform aware task clustering for scientific workflow execution in cloud environment. Future Gener. Comput. Syst. 64, 61–74 (2016)CrossRef Sahni, J., Vidyarthi, D.P.: Workflow-and-platform aware task clustering for scientific workflow execution in cloud environment. Future Gener. Comput. Syst. 64, 61–74 (2016)CrossRef
14.
go back to reference Chen, W., Da Silva, R.F., Deelman, E., Sakellariou, R.: Balanced task clustering in scientific workflows. In: Proceedings of IEEE 9th International Conference on E-Science (e-Science), pp. 188–195. IEEE (2013) Chen, W., Da Silva, R.F., Deelman, E., Sakellariou, R.: Balanced task clustering in scientific workflows. In: Proceedings of IEEE 9th International Conference on E-Science (e-Science), pp. 188–195. IEEE (2013)
15.
go back to reference Chen, W., Deelman, E.: Workflowsim: a toolkit for simulating scientific workflows in distributed environments. In: Proceedings of IEEE 8th International Conference on E-Science (e-Science), pp. 1–8. IEEE (2012) Chen, W., Deelman, E.: Workflowsim: a toolkit for simulating scientific workflows in distributed environments. In: Proceedings of IEEE 8th International Conference on E-Science (e-Science), pp. 1–8. IEEE (2012)
16.
go back to reference Bharathi, S., Chervenak, A., Deelman, E., Mehta, G., Su, M.-H., Vahi, K.: Characterization of scientific workflows. In: Proceedings of the third Workshop on Workflows in Support of Large-Scale Science, pp. 1–10. IEEE (2008) Bharathi, S., Chervenak, A., Deelman, E., Mehta, G., Su, M.-H., Vahi, K.: Characterization of scientific workflows. In: Proceedings of the third Workshop on Workflows in Support of Large-Scale Science, pp. 1–10. IEEE (2008)
Metadata
Title
Optimization of Cloud Workflow Scheduling Based on Balanced Clustering
Authors
Lei Zhang
Dongjin Yu
Hongsheng Zheng
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-69471-9_26

Premium Partner