Skip to main content
Erschienen in: Cluster Computing 2/2024

11.06.2023

A novel seq2seq-based prediction approach for workflow scheduling

verfasst von: Zhongguo Yang, Mingzhu Zhang, Han Li, Weilong Ding

Erschienen in: Cluster Computing | Ausgabe 2/2024

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Workflow scheduling problems have been widely studied in cloud computing and edge computing, which aim to exploit cloud-edge resources to execute workflow tasks considering several constraints and optimization goals. However, in the era of Internet of things, the load of each computing task and the amount of data transferred between computing tasks will fluctuate, which changes the original workflow and needs for a new scheduling plan correspondingly. Existing methods are difficult to quickly cope with these dynamic changes and there are few studies applying neural networks to solve problems in workflow scheduling. To bridge the gap, we propose an innovative supervised learning method which leverages function-fitting strategy of neural networks to link the workflow environment and its optimal scheduling plan. Specifically, our approach can be divided into two steps, the first one is to generate dataset and train a seq2seq-based prediction models. In this step, we develop an algorithm for generating a significant amount of workflow instances while ensuring dataset diversity based on complexity estimation. Then we apply GA, NSGA, NSGA-NN three different types GA-based optimization methods to search optimal solutions. Finally, we construct dataset which includes {workflow, environment configurations \(\rightarrow\) obtained optimal solution} and train a seq2seq-based model. The other part is real-time generation of scheduling plans based on trained seq2seq-based model. Simulation experiments have confirmed that our method is both effective and efficient, demonstrating its ability to adapt to changes in the execution environment, workflow task load, and task data transmission, and effectively schedule tasks in real-time. The simulation results show that the seq2seq-based prediction method can approach 90% of the optimal scheme.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Meena, J., Kumar, M., Vardham, M.: Cost effective genetic algorithm for workflow scheduling in cloud under deadline constraint. IEEE Access 4, 1–1 (2016)CrossRef Meena, J., Kumar, M., Vardham, M.: Cost effective genetic algorithm for workflow scheduling in cloud under deadline constraint. IEEE Access 4, 1–1 (2016)CrossRef
2.
Zurück zum Zitat Suraj, P., Linlin, W., Siddeswara, G., Rajkumar, B.: A Particle Swarm Optimization-Based Heuristic for Scheduling Workflow Applications in Cloud Computing Environments, pp. 400–407. IEEE, New York (2010) Suraj, P., Linlin, W., Siddeswara, G., Rajkumar, B.: A Particle Swarm Optimization-Based Heuristic for Scheduling Workflow Applications in Cloud Computing Environments, pp. 400–407. IEEE, New York (2010)
3.
Zurück zum Zitat Topcuoglu, H., Hariri, S., Min-You, W.: Performance-effective and low-complexity task scheduling forheterogeneous computing. IEEE Trans. Parallel Distrib. Syst. 13, 260–274 (2002)CrossRef Topcuoglu, H., Hariri, S., Min-You, W.: Performance-effective and low-complexity task scheduling forheterogeneous computing. IEEE Trans. Parallel Distrib. Syst. 13, 260–274 (2002)CrossRef
4.
Zurück zum Zitat Bittencourt, L.F., Sakellariou, R., Madeira, M.: DAG Scheduling using a lookahead variant of the heterogeneous earliest finish time algorithm. In: 2010 18th Euromicro Conference on Parallel, Distributed and Network-Based Processing, pp. 27–34. IEEE, Italy (2010)CrossRef Bittencourt, L.F., Sakellariou, R., Madeira, M.: DAG Scheduling using a lookahead variant of the heterogeneous earliest finish time algorithm. In: 2010 18th Euromicro Conference on Parallel, Distributed and Network-Based Processing, pp. 27–34. IEEE, Italy (2010)CrossRef
5.
Zurück zum Zitat Arabnejad, H., Barbosa, J.G.: List scheduling algorithm for heterogeneous systems by an optimistic cost table. IEEE Trans. Parallel Distrib. Syst. 25(3), 682–694 (2014)CrossRef Arabnejad, H., Barbosa, J.G.: List scheduling algorithm for heterogeneous systems by an optimistic cost table. IEEE Trans. Parallel Distrib. Syst. 25(3), 682–694 (2014)CrossRef
6.
Zurück zum Zitat Panda, S.K., Jana, P.K.: Uncertainty-based QoS Min-Min algorithm for heterogeneous multi-cloud environment. Arab. J. Sci. Eng. 41(8), 3003–3025 (2016)CrossRef Panda, S.K., Jana, P.K.: Uncertainty-based QoS Min-Min algorithm for heterogeneous multi-cloud environment. Arab. J. Sci. Eng. 41(8), 3003–3025 (2016)CrossRef
7.
Zurück zum Zitat Liu, H., Ma, Y., Chen, P., Xia, Y., Ma, Y., Zheng, W., Xiaobo L.: Scheduling Multi-workflows over Edge Computing Resources with Time-Varying Performance. A Novel Probability-Mass Function and DQN-Based Approach, pp. 197–209. Springer, Cham (2020) Liu, H., Ma, Y., Chen, P., Xia, Y., Ma, Y., Zheng, W., Xiaobo L.: Scheduling Multi-workflows over Edge Computing Resources with Time-Varying Performance. A Novel Probability-Mass Function and DQN-Based Approach, pp. 197–209. Springer, Cham (2020)
8.
Zurück zum Zitat Kintsakis, A., Psomopoulos, F., Mitkas, P.: Reinforcement learning based scheduling in a workflow management system. Eng. Appl. Artif. Intell. 81, 94–106 (2019)CrossRef Kintsakis, A., Psomopoulos, F., Mitkas, P.: Reinforcement learning based scheduling in a workflow management system. Eng. Appl. Artif. Intell. 81, 94–106 (2019)CrossRef
9.
Zurück zum Zitat Pan, Y., Sun, X., Xia, Y., Chen, P., Pang, S., Li, X., Ma, Y.: A Stochastic-Performance-Distribution-Based Approach to Cloud Workflow Scheduling with Fluctuating Performance, pp. 33–48. Springer, Cham (2020) Pan, Y., Sun, X., Xia, Y., Chen, P., Pang, S., Li, X., Ma, Y.: A Stochastic-Performance-Distribution-Based Approach to Cloud Workflow Scheduling with Fluctuating Performance, pp. 33–48. Springer, Cham (2020)
10.
Zurück zum Zitat Li, W., Xia, Y., Zhou, M., Sun, X., Zhu, Q.: Fluctuation-aware and predictive workflow scheduling in cost-effective infrastructure-as-a-service clouds. IEEE Access. 2018, 1 (2018) Li, W., Xia, Y., Zhou, M., Sun, X., Zhu, Q.: Fluctuation-aware and predictive workflow scheduling in cost-effective infrastructure-as-a-service clouds. IEEE Access. 2018, 1 (2018)
11.
Zurück zum Zitat Liu, L., Huang, H., Tan, H., Cao, W., Yang, P., Li, X.Y.: Online DAG Scheduling with On-Demand Function Configuration in Edge Computing, pp 213–224. Springer, Cham (2019) Liu, L., Huang, H., Tan, H., Cao, W., Yang, P., Li, X.Y.: Online DAG Scheduling with On-Demand Function Configuration in Edge Computing, pp 213–224. Springer, Cham (2019)
12.
Zurück zum Zitat Ismayilov, G., Topcuoglu, H.: Neural network based multi-objective evolutionary algorithm for dynamic workflow scheduling in cloud computing. Future Gener. Comput. Syst. 102, 10 (2019) Ismayilov, G., Topcuoglu, H.: Neural network based multi-objective evolutionary algorithm for dynamic workflow scheduling in cloud computing. Future Gener. Comput. Syst. 102, 10 (2019)
15.
Zurück zum Zitat Arabnejad, V., Bubendorfer, K., Ng, B.: Scheduling deadline constrained scientific workflows on dynamically provisioned cloud resources. Future Gener. Comput. Syst. 75, 20 (2017) Arabnejad, V., Bubendorfer, K., Ng, B.: Scheduling deadline constrained scientific workflows on dynamically provisioned cloud resources. Future Gener. Comput. Syst. 75, 20 (2017)
16.
Zurück zum Zitat Dong, T., Xue, F., Xiao, C., Zhang, J.: Deep reinforcement learning for dynamic workflow scheduling in cloud environment. In: 2021 IEEE International Conference on Services Computing (SCC), pp. 107–115. IEEE, New York (2021) Dong, T., Xue, F., Xiao, C., Zhang, J.: Deep reinforcement learning for dynamic workflow scheduling in cloud environment. In: 2021 IEEE International Conference on Services Computing (SCC), pp. 107–115. IEEE, New York (2021)
17.
Zurück zum Zitat Xiaolong, X., Cao, H., Geng, Q., Liu, X., Dai, F., Wang, C.: Dynamic resource provisioning for workflow scheduling under uncertainty in edge computing environment. Concurr. Comput. Practice Exp. 34(14), e5674 (2022)CrossRef Xiaolong, X., Cao, H., Geng, Q., Liu, X., Dai, F., Wang, C.: Dynamic resource provisioning for workflow scheduling under uncertainty in edge computing environment. Concurr. Comput. Practice Exp. 34(14), e5674 (2022)CrossRef
18.
Zurück zum Zitat Barika, M., Garg, S., Ranjan, R.: Cost effective stream workflow scheduling to handle application structural changes. Future Gener. Comput. Syst. 112, 348–361 (2020)CrossRef Barika, M., Garg, S., Ranjan, R.: Cost effective stream workflow scheduling to handle application structural changes. Future Gener. Comput. Syst. 112, 348–361 (2020)CrossRef
19.
Zurück zum Zitat Arabnejad, V., Bubendorfer, K., Ng, B.: Dynamic multi-workflow scheduling: a deadline and cost-aware approach for commercial clouds. Future Gener. Comput. Syst. 100, 98–108 (2019)CrossRef Arabnejad, V., Bubendorfer, K., Ng, B.: Dynamic multi-workflow scheduling: a deadline and cost-aware approach for commercial clouds. Future Gener. Comput. Syst. 100, 98–108 (2019)CrossRef
20.
Zurück zum Zitat Yi, P., Wang S., Wu, L., Xia, Y., Wanbo, Z., Shanchen, P., Ziyang, Z., Peng, C., Yawen, L.: A novel approach to scheduling workflows upon cloud resources with fluctuating performance. Mobile Netw. Appl. 25(2), 690–700 (2020)CrossRef Yi, P., Wang S., Wu, L., Xia, Y., Wanbo, Z., Shanchen, P., Ziyang, Z., Peng, C., Yawen, L.: A novel approach to scheduling workflows upon cloud resources with fluctuating performance. Mobile Netw. Appl. 25(2), 690–700 (2020)CrossRef
21.
Zurück zum Zitat Shaw, R., Howley, E., Barrett, E.: Predicting the available bandwidth on intra cloud network links for deadline constrained workflow scheduling in public clouds. In: Michael, M., Antonio, V., Jianmin, W., Marc O. (ed.) Service-Oriented Computing, Lecture Notes in Computer Science, pp. 221–228. Springer International Publishing, Cham (2017) Shaw, R., Howley, E., Barrett, E.: Predicting the available bandwidth on intra cloud network links for deadline constrained workflow scheduling in public clouds. In: Michael, M., Antonio, V., Jianmin, W., Marc O. (ed.) Service-Oriented Computing, Lecture Notes in Computer Science, pp. 221–228. Springer International Publishing, Cham (2017)
22.
Zurück zum Zitat Jairam Naik, K., Pedagandam, M., Mishra, A.: Workflow scheduling optimisation for distributed environment using artificial neural networks and reinforcement learning. Int. J. Comput. Sci. Eng. 24(6), 653–670 (2021)CrossRef Jairam Naik, K., Pedagandam, M., Mishra, A.: Workflow scheduling optimisation for distributed environment using artificial neural networks and reinforcement learning. Int. J. Comput. Sci. Eng. 24(6), 653–670 (2021)CrossRef
23.
Zurück zum Zitat Huang, J.: The workflow task scheduling algorithm based on the ga model in the cloud computing environment. J. Softw. 9(4), 873–880 (2014)CrossRef Huang, J.: The workflow task scheduling algorithm based on the ga model in the cloud computing environment. J. Softw. 9(4), 873–880 (2014)CrossRef
24.
Zurück zum Zitat Hafsi, H., Gharsellaoui, H., Bouamama, S.: Towards a Novel NSGAII-based Approach for Multi-objectives Scientific Workflow Scheduling on Hybrid Clouds. GECCO (2019) Hafsi, H., Gharsellaoui, H., Bouamama, S.: Towards a Novel NSGAII-based Approach for Multi-objectives Scientific Workflow Scheduling on Hybrid Clouds. GECCO (2019)
25.
Zurück zum Zitat Chen, W., Deelman, E.: Workflowsim: a toolkit for simulating scientific workflows in distributed environments. In: 2012 IEEE 8th International Conference on E-science, pp. 1–8. IEEE, New York (2012) Chen, W., Deelman, E.: Workflowsim: a toolkit for simulating scientific workflows in distributed environments. In: 2012 IEEE 8th International Conference on E-science, pp. 1–8. IEEE, New York (2012)
26.
Zurück zum Zitat Juve, G., Chervenak, A., Deelman, E., Bharathi, S., Mehta, G., Vahi, K.: Characterizing and profiling scientific workflows. Future Gener. Comput. Syst. 29, 682–692 (2013)CrossRef Juve, G., Chervenak, A., Deelman, E., Bharathi, S., Mehta, G., Vahi, K.: Characterizing and profiling scientific workflows. Future Gener. Comput. Syst. 29, 682–692 (2013)CrossRef
27.
Zurück zum Zitat Gupta, I., Choudhary, A., Jana, P.: Generation and Proliferation of Random Directed Acyclic Graphs for Workflow Scheduling Problem, pp 123–127. ACM, New York (2017) Gupta, I., Choudhary, A., Jana, P.: Generation and Proliferation of Random Directed Acyclic Graphs for Workflow Scheduling Problem, pp 123–127. ACM, New York (2017)
29.
Zurück zum Zitat Cordeiro, D., Mounié, G., Perarnau, S., Trystram, D., Vincent, J.M., Wagner, F.: Random Graph Generation for Scheduling Simulations. ICST (2010) Cordeiro, D., Mounié, G., Perarnau, S., Trystram, D., Vincent, J.M., Wagner, F.: Random Graph Generation for Scheduling Simulations. ICST (2010)
30.
Zurück zum Zitat Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to Sequence Learning with Neural Networks. NIPS, Montreal (2014) Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to Sequence Learning with Neural Networks. NIPS, Montreal (2014)
Metadaten
Titel
A novel seq2seq-based prediction approach for workflow scheduling
verfasst von
Zhongguo Yang
Mingzhu Zhang
Han Li
Weilong Ding
Publikationsdatum
11.06.2023
Verlag
Springer US
Erschienen in
Cluster Computing / Ausgabe 2/2024
Print ISSN: 1386-7857
Elektronische ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-023-04061-3

Weitere Artikel der Ausgabe 2/2024

Cluster Computing 2/2024 Zur Ausgabe

Premium Partner