Skip to main content
Top

2023 | OriginalPaper | Chapter

Resource Management in Cloud Computing Using Deep Reinforcement Learning: A Survey

Authors : Yuxin Feng, Feiyang Liu

Published in: Proceedings of the 10th Chinese Society of Aeronautics and Astronautics Youth Forum

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Next generation aircrafts will not only require high-performance and intelligent computing capabilities, but also a fast design-developing-integration-update time cycle. Cloud computing technology provides a platform with large amounts of hardware resources and software services, making applications development and deployment much more convenient. Thus, airborne cloud computing becomes an important design methodology for next generation avionics system. However, the dynamic and uncertain cloud environment makes efficient resource management very complicated. Due to the characteristics of dynamic autonomous decision-making, deep reinforcement learning has become a promising resource scheduling algorithm. This paper firstly analyzes the requirements of airborne cloud computing systems, then studies the basic theory of cloud resources management and scheduling strategies. The resource management algorithms based on deep reinforcement learning (DRL), some common DRL models, experimental platforms, and evaluation parameters are introduced in details. Finally, some critical problems and challenges in the design of DRL-based resource management algorithm are summarized. This paper can provide some technique supports for the airborne cloud computing system.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Jianchun, X., Zhonghua, W., Yahui, L.: The distributed computing framework research for avionics cloud[C]. In: IEEE International Conference on Networking and Network Applications, pp. 390–393 (2018) Jianchun, X., Zhonghua, W., Yahui, L.: The distributed computing framework research for avionics cloud[C]. In: IEEE International Conference on Networking and Network Applications, pp. 390–393 (2018)
2.
go back to reference Qin, Y., Wang, H., Yi, S., et al.: Virtual machine placement based on multi-objective reinforcement learning[J]. Appl. Intell. 50(8), 2370–2383 (2020)CrossRef Qin, Y., Wang, H., Yi, S., et al.: Virtual machine placement based on multi-objective reinforcement learning[J]. Appl. Intell. 50(8), 2370–2383 (2020)CrossRef
3.
go back to reference Dasjupta, P., Leblanc, R.J., Apple, W.F.: The clouds distributed operating system: Functional description, implementation details and related work[C]. In: The 8th International Conference on Distributed, pp. 2–3 (1988) Dasjupta, P., Leblanc, R.J., Apple, W.F.: The clouds distributed operating system: Functional description, implementation details and related work[C]. In: The 8th International Conference on Distributed, pp. 2–3 (1988)
4.
go back to reference Li, L.: An optimistic differentiated service job scheduling system for cloud computing service users and providers[C] 2009. In: Third International Conference on Multimedia and Ubiquitous Engineering, pp. 295–299 (2009) Li, L.: An optimistic differentiated service job scheduling system for cloud computing service users and providers[C] 2009. In: Third International Conference on Multimedia and Ubiquitous Engineering, pp. 295–299 (2009)
5.
go back to reference Gomathi, B., Krishnasamy, K., Bslaji, B.S.: Epsilon-fuzzy dominance sort-based composite discrete artificial bee colony optimisation for multi-objective cloud task scheduling problem[J]. Int. J. Bus. Intell. Data Min. 13(1–3), 247–266 (2018) Gomathi, B., Krishnasamy, K., Bslaji, B.S.: Epsilon-fuzzy dominance sort-based composite discrete artificial bee colony optimisation for multi-objective cloud task scheduling problem[J]. Int. J. Bus. Intell. Data Min. 13(1–3), 247–266 (2018)
6.
go back to reference Wang, B., Li, J.: Load balancing task scheduling based on multi-population genetic algorithm in cloud computing[C]. In: 2016 35th Chinese Control Conference (CCC), pp. 5261–5266 (2016) Wang, B., Li, J.: Load balancing task scheduling based on multi-population genetic algorithm in cloud computing[C]. In: 2016 35th Chinese Control Conference (CCC), pp. 5261–5266 (2016)
7.
go back to reference Mao, H., Alizadeh, M., Menache, I. et al.: Resource management with deep reinforcement learning[C]. In: Proceedings of the 15th ACM Workshop on Hot Topics in Networks, pp. 50–56 (2016) Mao, H., Alizadeh, M., Menache, I. et al.: Resource management with deep reinforcement learning[C]. In: Proceedings of the 15th ACM Workshop on Hot Topics in Networks, pp. 50–56 (2016)
8.
go back to reference Zhang, Y., Yao, J., Guan, H.: Intelligent cloud resource management with deep reinforcement learning[J]. IEEE Cloud Comput. 4(6), 60–69 (2017)CrossRef Zhang, Y., Yao, J., Guan, H.: Intelligent cloud resource management with deep reinforcement learning[J]. IEEE Cloud Comput. 4(6), 60–69 (2017)CrossRef
9.
go back to reference Mao, H., Schwarzkopf, M., Venkatakrishnan, S.B., et al.: Learning scheduling algorithms for data processing clusters[G]. In: Proceedings of the ACM Special Interest Group on Data Communication, pp. 270–288 (2019) Mao, H., Schwarzkopf, M., Venkatakrishnan, S.B., et al.: Learning scheduling algorithms for data processing clusters[G]. In: Proceedings of the ACM Special Interest Group on Data Communication, pp. 270–288 (2019)
10.
go back to reference Sung, T.T., Ha, J., Kim, J. et al.: Deepsocs: a neural scheduler for heterogeneous system-on-chip (soc) resource scheduling[J]. Electronics 9(6), 936 (2020) Sung, T.T., Ha, J., Kim, J. et al.: Deepsocs: a neural scheduler for heterogeneous system-on-chip (soc) resource scheduling[J]. Electronics 9(6), 936 (2020)
11.
go back to reference Ding, D., Fan, X., Zhao, Y., et al.: Q-learning based dynamic task scheduling for energy-efficient cloud computing[J]. Futur. Gener. Comput. Syst. 108, 361–371 (2020) Ding, D., Fan, X., Zhao, Y., et al.: Q-learning based dynamic task scheduling for energy-efficient cloud computing[J]. Futur. Gener. Comput. Syst. 108, 361–371 (2020)
12.
go back to reference Bibal Benifa, J., Dejey, D.: Rlpas: Reinforcement learning-based proactive auto-scaler for resource provisioning in cloud environment[J]. Mob. Netw. Appl. 24(4), 1348–1363 (2019)CrossRef Bibal Benifa, J., Dejey, D.: Rlpas: Reinforcement learning-based proactive auto-scaler for resource provisioning in cloud environment[J]. Mob. Netw. Appl. 24(4), 1348–1363 (2019)CrossRef
13.
go back to reference Balasubramanian, V., Aloqaily, M., Tunde-onadele, O., et al.: Reinforcing cloud environments via index policy for bursty workloads[C]. In: NOMS 2020–2020 IEEE/IFIP Network Operations and Management Symposium, pp. 1–7 (2020) Balasubramanian, V., Aloqaily, M., Tunde-onadele, O., et al.: Reinforcing cloud environments via index policy for bursty workloads[C]. In: NOMS 2020–2020 IEEE/IFIP Network Operations and Management Symposium, pp. 1–7 (2020)
14.
go back to reference Balla, H.A., Sheng, C.G., Jing, W.: Reliability-aware: task scheduling in cloud computing using multi-agent reinforcement learning algorithm and neural fitted Q.[J]. Int. Arab J. Inf. Technol. 18(1), 36–47 (2021) Balla, H.A., Sheng, C.G., Jing, W.: Reliability-aware: task scheduling in cloud computing using multi-agent reinforcement learning algorithm and neural fitted Q.[J]. Int. Arab J. Inf. Technol. 18(1), 36–47 (2021)
15.
go back to reference Swarup, S., Shakshuki, E.M., Yasar, A.: Task scheduling in cloud using deep reinforcement learning[J]. Procedia Comput. Sci. 184, 42–51 (2021) Swarup, S., Shakshuki, E.M., Yasar, A.: Task scheduling in cloud using deep reinforcement learning[J]. Procedia Comput. Sci. 184, 42–51 (2021)
16.
17.
go back to reference Sheng, J., Hu, Y., Zhow, W. et al.: Learning to schedule multi-NUMA virtual machines via reinforcement learning[J]. Pattern Recognit. 121, 108254 (2022) Sheng, J., Hu, Y., Zhow, W. et al.: Learning to schedule multi-NUMA virtual machines via reinforcement learning[J]. Pattern Recognit. 121, 108254 (2022)
18.
go back to reference Rjoub, G., Bentahar, J., Abdel Wahab, O., et al.: Deep and reinforcement learning for automated task scheduling in large-scale cloud computing systems[J]. Concurr. Comput.: Pract. Exp. 33(23), e5919 (2021) Rjoub, G., Bentahar, J., Abdel Wahab, O., et al.: Deep and reinforcement learning for automated task scheduling in large-scale cloud computing systems[J]. Concurr. Comput.: Pract. Exp. 33(23), e5919 (2021)
19.
go back to reference Yu, Z., Machado, P., Zahid, A., et al.: Energy and performance trade-off optimization in heterogeneous computing via reinforcement learning[J]. Electronics 9(11), 1812 (2020) Yu, Z., Machado, P., Zahid, A., et al.: Energy and performance trade-off optimization in heterogeneous computing via reinforcement learning[J]. Electronics 9(11), 1812 (2020)
20.
go back to reference Chou, Q., Fan, W., Zhang, J.: A reinforcement learning model for virtual machines consolidation in cloud data center [C]. In: 2021 6th international conference on automation, control and robotics engineering (CACRE), pp. 16–21 (2021) Chou, Q., Fan, W., Zhang, J.: A reinforcement learning model for virtual machines consolidation in cloud data center [C]. In: 2021 6th international conference on automation, control and robotics engineering (CACRE), pp. 16–21 (2021)
Metadata
Title
Resource Management in Cloud Computing Using Deep Reinforcement Learning: A Survey
Authors
Yuxin Feng
Feiyang Liu
Copyright Year
2023
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-19-7652-0_56

Premium Partner