Skip to main content
Erschienen in: Cluster Computing 1/2021

01.02.2021

OMBM-ML: efficient memory bandwidth management for ensuring QoS and improving server utilization

verfasst von: Hanul Sung, Jeesoo Min, Donghun Koo, Hyeonsang Eom

Erschienen in: Cluster Computing | Ausgabe 1/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

As cloud data centers are dramatically growing, various applications are moved to cloud data centers owing to cost benefits for maintenance and hardware resources. However, latency-critical workloads among them suffer from some problems to fully achieve the cost-effectiveness. The latency-critical workloads should show latencies in a stable manner, to be predicted, for strictly meeting QoSs. However, if they are executed with other workloads to save the cost, they experience QoS violation due to the contention for the hardware resources shared with co-location workloads. In order to guarantee QoSs and to improve the hardware resource utilization, we proposed a memory bandwidth management method with an effective prediction model using machine learning. The prediction model estimates the amount of memory bandwidth that will be allocated to the latency-critical workload based on a REP decision tree. To construct this model, we first collect data and train the model with the data. The generated model can estimate the amount of memory bandwidth for meeting the SLO of the latency-critical workload no matter what batch processing workloads are collocated. The use of our approach achieves up to 99% SLO assurance and improves the server utilization up to 6.8\(\times\) on average.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
3.
Zurück zum Zitat Amy Ousterhout, J.B., Joshua Fried, A.B., Hari Balakrishnan, M.C.: Shenango: achieving high CPU efficiency for latency-sensitive datacenter workloads. In: Proceedings of the 16th USENIX Conference on Networked Systems Design and Implementation (2019) Amy Ousterhout, J.B., Joshua Fried, A.B., Hari Balakrishnan, M.C.: Shenango: achieving high CPU efficiency for latency-sensitive datacenter workloads. In: Proceedings of the 16th USENIX Conference on Networked Systems Design and Implementation (2019)
4.
Zurück zum Zitat Azimi, R., Kwon, Y., Elnikety, S., Syamala, M., Narasayya, V., Herodotou, H., Microsoft, P.T., Alex, B., Microsoft, C., Jack, B., Microsoft, Z., Wang, B.J., Bing, M.: PerfIso: Performance Isolation for Commercial Latency-Sensitive Services C: alin Iorgulescu* EPFL. Technical report (2018) Azimi, R., Kwon, Y., Elnikety, S., Syamala, M., Narasayya, V., Herodotou, H., Microsoft, P.T., Alex, B., Microsoft, C., Jack, B.,  Microsoft, Z., Wang, B.J.,  Bing, M.: PerfIso: Performance Isolation for Commercial Latency-Sensitive Services C: alin Iorgulescu* EPFL. Technical report (2018)
5.
Zurück zum Zitat Barroso, L.A., Clidaras, J., Hoelzle, U.: The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines. Morgan & Claypool Publishers, San Rafael (2013) Barroso, L.A., Clidaras, J., Hoelzle, U.: The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines. Morgan & Claypool Publishers, San Rafael (2013)
6.
Zurück zum Zitat Chen, Q., Wang, Z., Leng, J., Li, C., Zheng, W., Guo Avalon, M.: Towards QoS awareness and improved utilization through multi-resource management in datacenters. In: Proceedings of the International Conference on Supercomputing, pp. 272–283, New York, NY, USA, Jun 2019. Association for Computing Machinery Chen, Q., Wang, Z., Leng, J., Li, C., Zheng, W., Guo Avalon, M.: Towards QoS awareness and improved utilization through multi-resource management in datacenters. In: Proceedings of the International Conference on Supercomputing, pp. 272–283, New York, NY, USA, Jun 2019. Association for Computing Machinery
7.
Zurück zum Zitat Dauwe, D., Jonardi, E., Friese, R., Pasricha, S., Maciejewski, A.A., Bader, D.A., Siegel, H.J.: A methodology for co-location aware application performance modeling in multicore computing. In: 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, pp. 434–443, May 2015 Dauwe, D., Jonardi, E.,  Friese, R.,  Pasricha, S., Maciejewski, A.A., Bader, D.A., Siegel, H.J.: A methodology for co-location aware application performance modeling in multicore computing. In: 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, pp. 434–443, May 2015
8.
Zurück zum Zitat Delimitrou, C., Kozyrakis, C.: Quasar: Resource-efficient and qos-aware cluster management. In: Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS ’14, pp. 127–144, New York, NY, USA (2014). ACM Delimitrou, C.,  Kozyrakis, C.: Quasar: Resource-efficient and qos-aware cluster management. In: Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS ’14, pp. 127–144, New York, NY, USA (2014). ACM
9.
Zurück zum Zitat Desai, N., Cirne, W.: Job Scheduling Strategies for Parallel Processing, pp. 274–278. Springer, Cham (2017)CrossRef Desai, N., Cirne, W.: Job Scheduling Strategies for Parallel Processing, pp. 274–278. Springer, Cham (2017)CrossRef
10.
Zurück zum Zitat Di, S., Kondo, D., Cirne, W.: Characterization and comparison of cloud versus grid workloads. In: 2012 IEEE International Conference on Cluster Computing, pp. 230–238, Sept 2012 Di, S., Kondo, D., Cirne, W.: Characterization and comparison of cloud versus grid workloads. In: 2012 IEEE International Conference on Cluster Computing, pp. 230–238, Sept 2012
11.
Zurück zum Zitat Dwyer, T. , Fedorova, A., Blagodurov, S., Roth, M., Gaud, F., Pei, J.: A practical method for estimating performance degradation on multicore processors, and its application to hpc workloads. In: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC ’12, pp. 83:1–83:11, Los Alamitos, CA, USA, 2012. IEEE Computer Society Press Dwyer, T. , Fedorova, A.,  Blagodurov, S., Roth, M., Gaud, F.,  Pei, J.: A practical method for estimating performance degradation on multicore processors, and its application to hpc workloads. In: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC ’12, pp. 83:1–83:11, Los Alamitos, CA, USA, 2012. IEEE Computer Society Press
12.
Zurück zum Zitat Huang, S., Huang, J., Dai, J., Xie, T., Huang, B.: The hibench benchmark suite: characterization of the mapreduce-based data analysis. In: 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010), pp. 41–51, March 2010 Huang, S.,  Huang, J., Dai, J.,  Xie, T.,  Huang, B.: The hibench benchmark suite: characterization of the mapreduce-based data analysis. In: 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010), pp. 41–51, March 2010
13.
Zurück zum Zitat Hurt, K., John, E.: Analysis of memory sensitive spec cpu2006 integer benchmarks for big data benchmarking. In: Proceedings of the 1st Workshop on Performance Analysis of Big Data Systems, PABS ’15, pp. 11–16, New York, NY, USA (2015). ACM Hurt, K., John, E.: Analysis of memory sensitive spec cpu2006 integer benchmarks for big data benchmarking. In: Proceedings of the 1st Workshop on Performance Analysis of Big Data Systems, PABS ’15, pp. 11–16, New York, NY, USA (2015). ACM
14.
Zurück zum Zitat Kalmegh, S.: Analysis of weka data mining algorithm reptree, simple cart and randomtree for classification of indian news. Int. J. Innov. Sci. Eng. Technol 2(2), 438–446 (2015) Kalmegh, S.: Analysis of weka data mining algorithm reptree, simple cart and randomtree for classification of indian news. Int. J. Innov. Sci. Eng. Technol 2(2), 438–446 (2015)
15.
Zurück zum Zitat Kasture, H., Sanchez, D.: Ubik: efficient cache sharing with strict qos for latency-critical workloads. In: Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS ’14, pp. 729–742, New York, NY, USA (2014). ACM Kasture, H., Sanchez, D.: Ubik: efficient cache sharing with strict qos for latency-critical workloads. In: Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS ’14, pp. 729–742, New York, NY, USA (2014). ACM
16.
Zurück zum Zitat Kasture, H., Sanchez, D.: Tailbench: a benchmark suite and evaluation methodology for latency-critical applications. In: 2016 IEEE International Symposium on Workload Characterization (IISWC), pp. 1–10, Sept 2016 Kasture, H., Sanchez, D.: Tailbench: a benchmark suite and evaluation methodology for latency-critical applications. In: 2016 IEEE International Symposium on Workload Characterization (IISWC), pp. 1–10, Sept 2016
17.
Zurück zum Zitat Kyungyoung, C., Park, R.C.: Cloud based u-healthcare network with QoS guarantee for mobile health service. In: Cluster Computing (2017) Kyungyoung, C., Park, R.C.: Cloud based u-healthcare network with QoS guarantee for mobile health service. In: Cluster Computing (2017)
18.
Zurück zum Zitat Lakshmi Devasena, C.: Comparative analysis of random forest, rep tree and j48 classifiers for credit risk prediction. In: International Journal of Computer Applications (0975-8887), International Conference on Communication, Computing and Information Technology (ICCCMIT-2014) (2014) Lakshmi Devasena, C.: Comparative analysis of random forest, rep tree and j48 classifiers for credit risk prediction. In: International Journal of Computer Applications (0975-8887), International Conference on Communication, Computing and Information Technology (ICCCMIT-2014) (2014)
19.
Zurück zum Zitat Li Chunlin, T.J., Youlong, L.: Distributed QoS-aware scheduling optimization for resource-intensive mobile application in hybrid cloud. In: Cluster Computing (2017) Li Chunlin, T.J.,  Youlong, L.: Distributed QoS-aware scheduling optimization for resource-intensive mobile application in hybrid cloud. In: Cluster Computing (2017)
20.
Zurück zum Zitat Lo, D., Cheng, L., Govindaraju, R., Ranganathan, P., Kozyrakis, C.: Heracles: Improving resource efficiency at scale. In: Proceedings of the 42nd Annual International Symposium on Computer Architecture, ISCA ’15, pp. 450–462, New York, NY, USA (2015). ACM Lo, D.,  Cheng, L., Govindaraju, R.,  Ranganathan, P., Kozyrakis, C.: Heracles: Improving resource efficiency at scale. In: Proceedings of the 42nd Annual International Symposium on Computer Architecture, ISCA ’15, pp. 450–462, New York, NY, USA (2015). ACM
21.
Zurück zum Zitat Mahmoud, Z.H.A., Badawy, M., Ali, H.A.: QoS provisioning framework for service-oriented internet of things (IoT). In: Cluster Computing (2019) Mahmoud, Z.H.A., Badawy, M., Ali, H.A.: QoS provisioning framework for service-oriented internet of things (IoT). In: Cluster Computing (2019)
22.
Zurück zum Zitat Mars, J., Tang, L., Hundt, R., Skadron, K., Soffa, M.L.: Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations. In: Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-44, pp. 248–259, New York, NY, USA (2011). ACM Mars, J.,  Tang, L.,  Hundt, R., Skadron, K., Soffa, M.L.: Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations. In: Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-44, pp. 248–259, New York, NY, USA (2011). ACM
23.
Zurück zum Zitat Anithadevi, N., Sundarambal, M.: A design of intelligent QoS aware web service recommendation system. In: Cluster Computing (2018) Anithadevi, N., Sundarambal, M.: A design of intelligent QoS aware web service recommendation system. In: Cluster Computing (2018)
24.
Zurück zum Zitat Nathuji, R., Kansal, A., Ghaffarkhah, A.: Q-clouds: managing performance interference effects for qos-aware clouds. In: Proceedings of the 5th European Conference on Computer Systems, EuroSys ’10, pp. 237–250, New York, NY, USA (2010). ACM Nathuji, R.,  Kansal, A.,  Ghaffarkhah, A.: Q-clouds: managing performance interference effects for qos-aware clouds. In: Proceedings of the 5th European Conference on Computer Systems, EuroSys ’10, pp. 237–250, New York, NY, USA (2010). ACM
25.
Zurück zum Zitat Patel, T., Tiwari, D.: CLITE: efficient and QoS-aware co-location of multiple latency-critical jobs for warehouse scale computers. In Proceedings—2020 IEEE International Symposium on High Performance Computer Architecture, HPCA 2020, pp. 193–206. Institute of Electrical and Electronics Engineers Inc., Feb 2020 Patel, T.,  Tiwari, D.: CLITE: efficient and QoS-aware co-location of multiple latency-critical jobs for warehouse scale computers. In Proceedings—2020 IEEE International Symposium on High Performance Computer Architecture, HPCA 2020, pp. 193–206. Institute of Electrical and Electronics Engineers Inc., Feb 2020
26.
Zurück zum Zitat Santiago Felici-Castell, J.S.G., Garcia-Pineda, M.: Adaptive QoE-based architecture on cloud mobile media for live streaming. In: Cluster Computing (2018) Santiago Felici-Castell, J.S.G., Garcia-Pineda, M.: Adaptive QoE-based architecture on cloud mobile media for live streaming. In: Cluster Computing (2018)
27.
Zurück zum Zitat Sukhpal Singh Gill, M.S., Charana, I., Buyya, R.: CHOPPER: an intelligent QoS-aware autonomic resource management approach for cloud computing. In: Cluster Computing (2017) Sukhpal Singh Gill, M.S., Charana, I., Buyya, R.: CHOPPER: an intelligent QoS-aware autonomic resource management approach for cloud computing. In: Cluster Computing (2017)
28.
Zurück zum Zitat Sung, H., Min, J., Ha, S., Eom, H.: OMBM: optimized memory bandwidth management for ensuring QoS and high server utilization. In: 2017 IEEE 2nd International Workshops on Foundations and Applications of Self* Systems (FAS*W), pp. 269–276. IEEE, Sep 2017 Sung, H., Min, J.,  Ha, S.,  Eom, H.: OMBM: optimized memory bandwidth management for ensuring QoS and high server utilization. In: 2017 IEEE 2nd International Workshops on Foundations and Applications of Self* Systems (FAS*W), pp. 269–276. IEEE, Sep 2017
29.
Zurück zum Zitat Witten, I., Frank, E., Hall, M. A., Pal, C. J.: Data Mining: Practical Machine Learning Tools and Techniques (2016) Witten, I.,  Frank, E., Hall, M. A., Pal, C. J.: Data Mining: Practical Machine Learning Tools and Techniques (2016)
30.
Zurück zum Zitat Xu, C., Felter, W., Rajamani, K., Rubio, J., Ferreira, A., Li, Y.: dCat: dynamic cache management for efficient, performance-sensitive infrastructure-as-a-service. In: Proceedings of the 13th EuroSys Conference, EuroSys 2018, volume 2018-January, pp. 1–13, New York, NY, USA, Apr 2018. Association for Computing Machinery, Inc. Xu, C.,  Felter, W.,  Rajamani, K., Rubio, J., Ferreira, A.,  Li, Y.: dCat: dynamic cache management for efficient, performance-sensitive infrastructure-as-a-service. In: Proceedings of the 13th EuroSys Conference, EuroSys 2018, volume 2018-January, pp. 1–13, New York, NY, USA, Apr 2018. Association for Computing Machinery, Inc.
31.
Zurück zum Zitat Yang, H., Breslow, A., Mars, J., Tang, L.: Bubble-flux: Precise online qos management for increased utilization in warehouse scale computers. In: Proceedings of the 40th Annual International Symposium on Computer Architecture, ISCA’13, pp. 607–618, New York, NY, USA (2013). ACM Yang, H.,  Breslow, A.,  Mars, J.,  Tang, L.: Bubble-flux: Precise online qos management for increased utilization in warehouse scale computers. In: Proceedings of the 40th Annual International Symposium on Computer Architecture, ISCA’13, pp. 607–618, New York, NY, USA (2013). ACM
32.
Zurück zum Zitat Yang, X., Blackburn, S. M., McKinley, K. S.: Elfen scheduling: Fine-grain principled borrowing from latency-critical workloads using simultaneous multithreading. In: 2016 USENIX Annual Technical Conference (USENIX ATC 16), pp. 309–322, Denver, CO, 2016. USENIX Association Yang, X., Blackburn, S. M., McKinley, K. S.: Elfen scheduling: Fine-grain principled borrowing from latency-critical workloads using simultaneous multithreading. In: 2016 USENIX Annual Technical Conference (USENIX ATC 16), pp. 309–322, Denver, CO, 2016. USENIX Association
33.
Zurück zum Zitat Yongfeng Cui, Y. M., Zhongyuan Zhao, Dong, S.: Resource allocation algorithm design of high quality of service based on chaotic neural network in wireless communication technology. In: Cluster Computing (2017) Yongfeng Cui, Y. M., Zhongyuan Zhao,  Dong, S.: Resource allocation algorithm design of high quality of service based on chaotic neural network in wireless communication technology. In: Cluster Computing (2017)
34.
Zurück zum Zitat Yun, H., Yao, G., Pellizzoni, R., Caccamo, M., Sha, L.: Memguard: memory bandwidth reservation system for efficient performance isolation in multi-core platforms. In: 2013 IEEE 19th Real-Time and Embedded Technology and Applications Symposium (RTAS), pp. 55–64, April, 2013 Yun, H., Yao, G., Pellizzoni, R., Caccamo, M.,  Sha, L.: Memguard: memory bandwidth reservation system for efficient performance isolation in multi-core platforms. In: 2013 IEEE 19th Real-Time and Embedded Technology and Applications Symposium (RTAS), pp. 55–64, April, 2013
35.
Zurück zum Zitat Zhang, W., Cui, W., Fu, K., Chen, Q., Mawhirter, D. E., Wu, B., Li, C., Guo, M.: Laius: towards latency awareness and improved utilization of spatial multitasking accelerators in datacenters. In: Proceedings of the International Conference on Supercomputing, pages 58–68. Association for Computing Machinery, Jun, 2019 Zhang, W.,  Cui, W.,  Fu, K.,  Chen, Q., Mawhirter, D. E.,  Wu, B., Li, C., Guo, M.: Laius: towards latency awareness and improved utilization of spatial multitasking accelerators in datacenters. In: Proceedings of the International Conference on Supercomputing, pages 58–68. Association for Computing Machinery, Jun, 2019
36.
Zurück zum Zitat Zhu, H., Erez, M.: Dirigent: enforcing qos for latency-critical tasks on shared multicore systems. In: Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS ’16, pp. 33–47, New York, NY, USA (2016). ACM Zhu, H., Erez, M.: Dirigent: enforcing qos for latency-critical tasks on shared multicore systems. In: Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS ’16, pp. 33–47, New York, NY, USA (2016). ACM
Metadaten
Titel
OMBM-ML: efficient memory bandwidth management for ensuring QoS and improving server utilization
verfasst von
Hanul Sung
Jeesoo Min
Donghun Koo
Hyeonsang Eom
Publikationsdatum
01.02.2021
Verlag
Springer US
Erschienen in
Cluster Computing / Ausgabe 1/2021
Print ISSN: 1386-7857
Elektronische ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-020-03191-2

Weitere Artikel der Ausgabe 1/2021

Cluster Computing 1/2021 Zur Ausgabe