Skip to main content
Erschienen in: Automated Software Engineering 2/2022

01.11.2022

Actor-critic architecture based probabilistic meta-reinforcement learning for load balancing of controllers in software defined networks

verfasst von: Ashish Sharma, Sanjiv Tokekar, Sunita Varma

Erschienen in: Automated Software Engineering | Ausgabe 2/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The exponential growth in the complexity of network architecture to accommodate the enormous amount of data has motivated the emergence of software-defined networks (SDN). However, the scaling of network size and related services seriously affects the controller resource utilization in SDN. An optimal load balancing strategy is required to accommodate the dynamic network traffic and load distribution among different controllers. Although reinforcement learning has been used for load balancing in SDN by modelling it as the linear optimization problem, it could not perform well for the real-time load in SDN due to the uncertain and dynamic correlation between the resources. This paper presents a deep meta-reinforcement learning (meta-RL) technique to derive an intelligent optimization framework for load balancing in SDN using actor-critic architecture. Meta-RL is the modified version of conventional reinforcement learning which utilizes a smaller amount of training data comparatively to enable the agent to learn the policies. The proposed technique separates the task's inference and control to deal with the uncertainty while adapting to the new tasks in the sparse reward problem. It utilizes the probabilistic interpretation of task variables to solve the new task through sparse experience space. The simulation analysis supports the theoretical analysis of SDN's optimal load balancing technique. A real-time database has been utilized in this work to evaluate the efficiency and effectiveness of the proposed work. It has been found that the proposed technique showed an average of 10.4%, 19.1%, and 6.04% improvement in load balancing rate as compared to DC-LB, MMO-LB, and CRL-LB techniques. It also showed an improvement of 15.7%, 12.18%, and 5.18% in processing delay compared to the same methods, respectively. The standard deviation is also improved by 23.72% compared to the scenario of no-load balancing.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Belgaum, M.R., Musa, S., Alam, M.M., Su’ud, M.M.: A systematic review of load balancing techniques in software-defined networking. IEEE Access 8, 98612–98636 (2020)CrossRef Belgaum, M.R., Musa, S., Alam, M.M., Su’ud, M.M.: A systematic review of load balancing techniques in software-defined networking. IEEE Access 8, 98612–98636 (2020)CrossRef
Zurück zum Zitat Benzekki, K., El Fergougui, A., Elalaoui, A.E.: Software-defined networking (SDN): a survey. Secur. Commun. Netw. 9(18), 5803–5833 (2016)CrossRef Benzekki, K., El Fergougui, A., Elalaoui, A.E.: Software-defined networking (SDN): a survey. Secur. Commun. Netw. 9(18), 5803–5833 (2016)CrossRef
Zurück zum Zitat Cimorelli, F., Priscoli, F.D., Pietrabissa, A., Celsi, L.R., Suraci, V., Zuccaro, L. (2016): A distributed load balancing algorithm for the control plane in software defined networking. In: Proc. 24th Medit. Conf. Control Autom. (MED), pp. 1033–1040 (2016) Cimorelli, F., Priscoli, F.D., Pietrabissa, A., Celsi, L.R., Suraci, V., Zuccaro, L. (2016): A distributed load balancing algorithm for the control plane in software defined networking. In: Proc. 24th Medit. Conf. Control Autom. (MED), pp. 1033–1040 (2016)
Zurück zum Zitat Hu, T., Zhang, J., Kong, W., Yang, S., Cao, L.: Switch competing migration algorithm based on process optimization in SDN. J. Commun. 8, 213–222 (2017) Hu, T., Zhang, J., Kong, W., Yang, S., Cao, L.: Switch competing migration algorithm based on process optimization in SDN. J. Commun. 8, 213–222 (2017)
Zurück zum Zitat Kang, B., Choo, H.: An SDN-enhanced load-balancing technique in the cloud system. J. Supercomput. 3, 1–24 (2016) Kang, B., Choo, H.: An SDN-enhanced load-balancing technique in the cloud system. J. Supercomput. 3, 1–24 (2016)
Zurück zum Zitat Lemeshko, O., Yeremenko, O.: Enhanced method of fast re-routing with load balancing in software-defined networks. J. Electr. Eng. 68(6), 444–454 (2017) Lemeshko, O., Yeremenko, O.: Enhanced method of fast re-routing with load balancing in software-defined networks. J. Electr. Eng. 68(6), 444–454 (2017)
Zurück zum Zitat Li, Z., Zhou, Z., Gao, J., Qin, Y.: SDN controller load balancing based on reinforcement learning. In: Proceedings of the 2018 IEEE 9th International Conference on Software Engineering and Service Science (ICSESS), Beijing, China, 23–25 November 2018 Li, Z., Zhou, Z., Gao, J., Qin, Y.: SDN controller load balancing based on reinforcement learning. In: Proceedings of the 2018 IEEE 9th International Conference on Software Engineering and Service Science (ICSESS), Beijing, China, 23–25 November 2018
Zurück zum Zitat McKeown, N., Anderson, T., Balakrishnan, H., Parulkar, G., Peterson, L., Rexford, J., Shenker, S., Turner, J.: Openflow: enabling innovation in campus networks. SIGCOMM Comput. Commun. Rev. 38(2), 69–74 (2008)CrossRef McKeown, N., Anderson, T., Balakrishnan, H., Parulkar, G., Peterson, L., Rexford, J., Shenker, S., Turner, J.: Openflow: enabling innovation in campus networks. SIGCOMM Comput. Commun. Rev. 38(2), 69–74 (2008)CrossRef
Zurück zum Zitat Mnih, V., Badia, A.P., Mirza, M., Graves, A., Lillicrap, T., Harley, T., Silver, D., Kavukcuoglu, K.: Asynchronous methods for deep reinforcement learning. In: Proceedings of the 33rd International Conference on Machine Learning, vol. 48, pp. 1928–1937 (2016) Mnih, V., Badia, A.P., Mirza, M., Graves, A., Lillicrap, T., Harley, T., Silver, D., Kavukcuoglu, K.: Asynchronous methods for deep reinforcement learning. In: Proceedings of the 33rd International Conference on Machine Learning, vol. 48, pp. 1928–1937 (2016)
Zurück zum Zitat Mu, T., Al-Fuqaha, A., Shuaib, K., Sallabi, F.M., Qadir, J.: SDN flow entry management using reinforcement learning. ACM Trans. Auton. Adapt. Syst. 13, 1–23 (2018)CrossRef Mu, T., Al-Fuqaha, A., Shuaib, K., Sallabi, F.M., Qadir, J.: SDN flow entry management using reinforcement learning. ACM Trans. Auton. Adapt. Syst. 13, 1–23 (2018)CrossRef
Zurück zum Zitat Nagabandi, A., Clavera, I., Liu, S., Fearing, R.S., Abbeel, P., Levine, S., Finn, C.: Learning to adapt in dynamic, real-world environments through meta-reinforcement learning. In: International Conference on Learning Representations (ICLR) (2019) Nagabandi, A., Clavera, I., Liu, S., Fearing, R.S., Abbeel, P., Levine, S., Finn, C.: Learning to adapt in dynamic, real-world environments through meta-reinforcement learning. In: International Conference on Learning Representations (ICLR) (2019)
Zurück zum Zitat Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Desmaison, A.: Pytorch: an imperative style, high-performance deep learning library. Adv. Neural Inf. Process. Syst. 32, 8026–8037 (2019) Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Desmaison, A.: Pytorch: an imperative style, high-performance deep learning library. Adv. Neural Inf. Process. Syst. 32, 8026–8037 (2019)
Zurück zum Zitat Rakelly, K., Zhou, A., Quillen, D., Finn, C., Levine, S.: Efficient off-policy meta-reinforcement learning via probabilistic context variables (2019). arXiv:1903.08254v1 Rakelly, K., Zhou, A., Quillen, D., Finn, C., Levine, S.: Efficient off-policy meta-reinforcement learning via probabilistic context variables (2019). arXiv:​1903.​08254v1
Zurück zum Zitat Rusu, A.A., Rao, D., Sygnowski, J., Vinyals, O., Pascanu, R., Osindero, S., Hadsell, R.: Meta-learning with latent embedding optimization. In: International Conference on Learning Representations (2019) Rusu, A.A., Rao, D., Sygnowski, J., Vinyals, O., Pascanu, R., Osindero, S., Hadsell, R.: Meta-learning with latent embedding optimization. In: International Conference on Learning Representations (2019)
Zurück zum Zitat Sahoo, K.S., Puthal, D., Tiwary, M., Usman, M., Sahoo, B., Wen, Z., Sahoo, B.P.S., Ranjan, R.: ESMLB: efficient switch migration-based load balancing for multi-controller SDN in IoT. IEEE Internet Things J. 7, 5852–5860 (2019)CrossRef Sahoo, K.S., Puthal, D., Tiwary, M., Usman, M., Sahoo, B., Wen, Z., Sahoo, B.P.S., Ranjan, R.: ESMLB: efficient switch migration-based load balancing for multi-controller SDN in IoT. IEEE Internet Things J. 7, 5852–5860 (2019)CrossRef
Zurück zum Zitat Schweighofera, N., Doya, K.: Meta-learning in reinforcement learning. Neural Netw. 16(1), 5–9 (2003)CrossRef Schweighofera, N., Doya, K.: Meta-learning in reinforcement learning. Neural Netw. 16(1), 5–9 (2003)CrossRef
Zurück zum Zitat Stadie, B.C., Yang, G., Houthooft, R., Chen, X., Duan, Y., Wu, Y., Abbeel, P., Sutskever, I.: Some considerations on learning to explore via meta-reinforcement learning (2018). arXiv preprint arXiv:1803.01118 Stadie, B.C., Yang, G., Houthooft, R., Chen, X., Duan, Y., Wu, Y., Abbeel, P., Sutskever, I.: Some considerations on learning to explore via meta-reinforcement learning (2018). arXiv preprint arXiv:​1803.​01118
Zurück zum Zitat Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction, vol. 1, 2nd edn. MIT Press, Cambridge (2017)MATH Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction, vol. 1, 2nd edn. MIT Press, Cambridge (2017)MATH
Zurück zum Zitat Symondson, S., Hofmann, K., Deisenroth, M.: Meta reinforcement learning with latent variable gaussian processes. In: Conference on Uncertainty in Artificial Intelligence (UAI) (2018) Symondson, S., Hofmann, K., Deisenroth, M.: Meta reinforcement learning with latent variable gaussian processes. In: Conference on Uncertainty in Artificial Intelligence (UAI) (2018)
Zurück zum Zitat Tosounidis, V., Pavlidis, G., Sakellaiou, I.: Deep Q-learning for load balancing traffic in SDN networks. In: Proceedings of the SETN: Hellenic Conference on Artificial Intelligence, Athens, Greece, 2–4 September 2020 Tosounidis, V., Pavlidis, G., Sakellaiou, I.: Deep Q-learning for load balancing traffic in SDN networks. In: Proceedings of the SETN: Hellenic Conference on Artificial Intelligence, Athens, Greece, 2–4 September 2020
Zurück zum Zitat Wang, Y., Zhang, Y., Chen, J.: SDNPS: a load-balanced topic-based Publish/Subscribe system in software-defined networking. Appl. Sci. 6(4), 91 (2016)CrossRef Wang, Y., Zhang, Y., Chen, J.: SDNPS: a load-balanced topic-based Publish/Subscribe system in software-defined networking. Appl. Sci. 6(4), 91 (2016)CrossRef
Zurück zum Zitat Wu, Y., Zhou, S., Wei, Y., Leng, S.: Deep reinforcement learning for controller placement in software defined network. In: Proceedings of the IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Toronto, ON, Canada, 6–9 July 2020 Wu, Y., Zhou, S., Wei, Y., Leng, S.: Deep reinforcement learning for controller placement in software defined network. In: Proceedings of the IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Toronto, ON, Canada, 6–9 July 2020
Zurück zum Zitat Ye, X., Cheng, G., Luo, X.: Maximizing SDN control resource utilization via switch migration. Comput. Netw. 126(24), 69–80 (2017)CrossRef Ye, X., Cheng, G., Luo, X.: Maximizing SDN control resource utilization via switch migration. Comput. Netw. 126(24), 69–80 (2017)CrossRef
Zurück zum Zitat Zhang, Y., Cui, L., Wang, W., Zhang, Y.: A survey on software defined networking with multiple controllers. J. Netw. Comput. Appl. 103, 101–118 (2018)CrossRef Zhang, Y., Cui, L., Wang, W., Zhang, Y.: A survey on software defined networking with multiple controllers. J. Netw. Comput. Appl. 103, 101–118 (2018)CrossRef
Zurück zum Zitat Zhang, S., Lan, J., Sun, P., Jiang, Y.: Online load balancing for distributed control plane in software-defined data center network. IEEE Access 6, 18184–18191 (2018)CrossRef Zhang, S., Lan, J., Sun, P., Jiang, Y.: Online load balancing for distributed control plane in software-defined data center network. IEEE Access 6, 18184–18191 (2018)CrossRef
Metadaten
Titel
Actor-critic architecture based probabilistic meta-reinforcement learning for load balancing of controllers in software defined networks
verfasst von
Ashish Sharma
Sanjiv Tokekar
Sunita Varma
Publikationsdatum
01.11.2022
Verlag
Springer US
Erschienen in
Automated Software Engineering / Ausgabe 2/2022
Print ISSN: 0928-8910
Elektronische ISSN: 1573-7535
DOI
https://doi.org/10.1007/s10515-022-00362-w

Weitere Artikel der Ausgabe 2/2022

Automated Software Engineering 2/2022 Zur Ausgabe

Premium Partner