Skip to main content
Erschienen in: Cluster Computing 3/2019

05.01.2019

Event driven power consumption optimization control model of GPU clusters

verfasst von: Haifeng Wang, Yunpeng Cao

Erschienen in: Cluster Computing | Ausgabe 3/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Reducing power consumption for GPU cluster in large-scale stream computing process can bring various benefits such as reducing operating costs and environmental effect. We formulate the problem of power consumption as a constrained optimization problem, minimizing power state of cluster nodes to reduce power consumption while guaranteeing system performance and reliability. The proposed control model based on Model Prediction Control is designed to make a comprehensive metric of GPU cluster achieve expected performance, energy efficiency and reliability. It is different from the previous models, which just consider power consumption as the sole control objective. The event-triggering mechanism is introduced to reduce control overhead. It successfully separates sampling cluster status signals from control model. So the controller needs not to periodically interrupt computing process to solve optimal solutions. Finally, we evaluate and compare this control model with the previous control model by using artificial and real-world workloads. The experimental results show that our proposed control model is able to outperform existing techniques.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Mike, S., Jeremy, E., Craig, S., et al.: ECOG: a power-efficient GPU cluster architecture for scientific computing. Comput. Sci. Eng. 13(2), 83–87 (2011)CrossRef Mike, S., Jeremy, E., Craig, S., et al.: ECOG: a power-efficient GPU cluster architecture for scientific computing. Comput. Sci. Eng. 13(2), 83–87 (2011)CrossRef
2.
Zurück zum Zitat Abbas, K., Nirwan, A.: Toward low-cost workload distribution for integrated green data centers. IEEE Commun. Lett. 19(1), 26–29 (2015)CrossRef Abbas, K., Nirwan, A.: Toward low-cost workload distribution for integrated green data centers. IEEE Commun. Lett. 19(1), 26–29 (2015)CrossRef
3.
Zurück zum Zitat Ashwin, M.A., Lokendra, S., et al.: MPI-ACC: accelerator-aware MPI for scientific applications. IEEE Trans. Parallel Distrib. Syst. 27(5), 1401–1414 (2016)CrossRef Ashwin, M.A., Lokendra, S., et al.: MPI-ACC: accelerator-aware MPI for scientific applications. IEEE Trans. Parallel Distrib. Syst. 27(5), 1401–1414 (2016)CrossRef
4.
Zurück zum Zitat Wang, H., Sreeram, P., Devendar, B., et al.: GPU-aware MPI on rdma-enabled cluster:design, implementation and evaluation. IEEE Trans. Parallel Distrib. Syst. 25(10), 2595–2605 (2014)CrossRef Wang, H., Sreeram, P., Devendar, B., et al.: GPU-aware MPI on rdma-enabled cluster:design, implementation and evaluation. IEEE Trans. Parallel Distrib. Syst. 25(10), 2595–2605 (2014)CrossRef
5.
Zurück zum Zitat Dario, B., Audric, L., et al.: Modeling and evaluation of energy policies in green clouds. IEEE Trans. Parallel Distrib. Syst. 26(11), 3052–3065 (2015)CrossRef Dario, B., Audric, L., et al.: Modeling and evaluation of energy policies in green clouds. IEEE Trans. Parallel Distrib. Syst. 26(11), 3052–3065 (2015)CrossRef
6.
Zurück zum Zitat Zhang, Y., Mueller, F.: Autogeneration and autotuning of 3D stencil codes on homogeneous and heterogeneous GPU clusters. IEEE Trans. Parallel Distrib Syst. 24(3), 417–427 (2013)CrossRef Zhang, Y., Mueller, F.: Autogeneration and autotuning of 3D stencil codes on homogeneous and heterogeneous GPU clusters. IEEE Trans. Parallel Distrib Syst. 24(3), 417–427 (2013)CrossRef
7.
Zurück zum Zitat Tang, Y., Gedik, B.: Autopipelining for data stream processing. IEEE Trans. Parallel Distrib Syst. 24(12), 2344–2354 (2013)CrossRef Tang, Y., Gedik, B.: Autopipelining for data stream processing. IEEE Trans. Parallel Distrib Syst. 24(12), 2344–2354 (2013)CrossRef
8.
Zurück zum Zitat Deng, Z., X, W., Wang, L., et al.: Parallel processing of dynamic continuous queries over streaming data flows. IEEE Trans. Parallel Distrib. Syst. 26(3), 834–864 (2015)CrossRef Deng, Z., X, W., Wang, L., et al.: Parallel processing of dynamic continuous queries over streaming data flows. IEEE Trans. Parallel Distrib. Syst. 26(3), 834–864 (2015)CrossRef
9.
Zurück zum Zitat Yang, J., Zeng, K., et al.: Dynamic cluster reconfiguration for energy conservation in computation intensive service. IEEE Trans. Comput. 61(10), 1401–1416 (2012)MathSciNetCrossRefMATH Yang, J., Zeng, K., et al.: Dynamic cluster reconfiguration for energy conservation in computation intensive service. IEEE Trans. Comput. 61(10), 1401–1416 (2012)MathSciNetCrossRefMATH
10.
Zurück zum Zitat Wang, H., Cao, Y.: Predicting power consumption of GPUs with fuzzy wavelet neural networks. Parallel Comput. 44(5), 18–36 (2015)CrossRef Wang, H., Cao, Y.: Predicting power consumption of GPUs with fuzzy wavelet neural networks. Parallel Comput. 44(5), 18–36 (2015)CrossRef
11.
Zurück zum Zitat Gandhi, A., Harchol-Balter, M. et al.: Optimal power allocation in server farms. In: Proceeding of the 11th International Joint Conference Measurement and Modeling of Computer Systems, pp. 157–168 (2009) Gandhi, A., Harchol-Balter, M. et al.: Optimal power allocation in server farms. In: Proceeding of the 11th International Joint Conference Measurement and Modeling of Computer Systems, pp. 157–168 (2009)
12.
Zurück zum Zitat Ewa, N.S., Andrzej, S., et al.: Dynamic power management in energy-aware computer networks and data intensive computing systems. Future Gener. Comput. Syst. 37, 284–296 (2014)CrossRef Ewa, N.S., Andrzej, S., et al.: Dynamic power management in energy-aware computer networks and data intensive computing systems. Future Gener. Comput. Syst. 37, 284–296 (2014)CrossRef
13.
Zurück zum Zitat Liu, Y., Zhu, H., Lu, K., Liu, Y.: A power provision and capping architecture for large scale systems. In: Proceeding of the 26th IEEE International Parallel and Distributed Processing Symposium Workship& PHD Forum, pp. 954–963 (2012) Liu, Y., Zhu, H., Lu, K., Liu, Y.: A power provision and capping architecture for large scale systems. In: Proceeding of the 26th IEEE International Parallel and Distributed Processing Symposium Workship& PHD Forum, pp. 954–963 (2012)
14.
Zurück zum Zitat Bertini, L., J, C.B., Daniel, M.: Power and performance control of soft real-time web server clusters. Inf. Process. Lett. 110, 767–773 (2010)MathSciNetCrossRefMATH Bertini, L., J, C.B., Daniel, M.: Power and performance control of soft real-time web server clusters. Inf. Process. Lett. 110, 767–773 (2010)MathSciNetCrossRefMATH
15.
Zurück zum Zitat Lefurgy, C., Wang, X., Ware, M: Server-level power control. In: Proceeding of the Fourth International Conference on Autonomic Computing(ICAC’07), (2007) Lefurgy, C., Wang, X., Ware, M: Server-level power control. In: Proceeding of the Fourth International Conference on Autonomic Computing(ICAC’07), (2007)
16.
Zurück zum Zitat Wang, X., Chen, M., Xing, F.: MIMI power control for high-density servers in an enclosure. IEEE Trans. Parallel Distrib Syst. 21(10), 1412–1426 (2010)CrossRef Wang, X., Chen, M., Xing, F.: MIMI power control for high-density servers in an enclosure. IEEE Trans. Parallel Distrib Syst. 21(10), 1412–1426 (2010)CrossRef
17.
Zurück zum Zitat Wang, X., Wang, Y.: Coordinating power control and performance management for virtualized server clusters. IEEE Trans. Parallel Distrib. Syst. 22(2), 245–259 (2011)CrossRef Wang, X., Wang, Y.: Coordinating power control and performance management for virtualized server clusters. IEEE Trans. Parallel Distrib. Syst. 22(2), 245–259 (2011)CrossRef
18.
Zurück zum Zitat Wang, X., Chen, M., Lefurgy, C., Keller, T.W.: SHIP: a scalable hierarchical power control architecture for large-scale data centers. IEEE Trans. Parallel Distrib. Syst. 23(1), 168–176 (2012)CrossRef Wang, X., Chen, M., Lefurgy, C., Keller, T.W.: SHIP: a scalable hierarchical power control architecture for large-scale data centers. IEEE Trans. Parallel Distrib. Syst. 23(1), 168–176 (2012)CrossRef
19.
Zurück zum Zitat Gong, J., Xu, X.: A gray-box feedback control approach for system-level peak power management. In: Proceeding of the 39th International Conference on Parallel Processing, pp. 555–564 (2010) Gong, J., Xu, X.: A gray-box feedback control approach for system-level peak power management. In: Proceeding of the 39th International Conference on Parallel Processing, pp. 555–564 (2010)
20.
Zurück zum Zitat Lama, P., Zhou, X.: Coordinated power and performance guarantee with fuzzy MIMO control in virtualized server clusters. IEEE Trans. Comput. 64(1), 97–111 (2015)MathSciNetCrossRefMATH Lama, P., Zhou, X.: Coordinated power and performance guarantee with fuzzy MIMO control in virtualized server clusters. IEEE Trans. Comput. 64(1), 97–111 (2015)MathSciNetCrossRefMATH
21.
Zurück zum Zitat Enokido, T., Takizawa, M.: An extended power consumption model for distributed applications. In: Proceeding of the 26th IEEE International Conference on Advanced Information Networking and Applications, pp. 912–919 (2012) Enokido, T., Takizawa, M.: An extended power consumption model for distributed applications. In: Proceeding of the 26th IEEE International Conference on Advanced Information Networking and Applications, pp. 912–919 (2012)
22.
Zurück zum Zitat Sergio, N., Cristian, P., et al.: Controlling datacenter power consumption while maintaining temperature and QoS levels. In: IEEE 3rd International Conference on Cloud Networking, pp. 242–247 (2014) Sergio, N., Cristian, P., et al.: Controlling datacenter power consumption while maintaining temperature and QoS levels. In: IEEE 3rd International Conference on Cloud Networking, pp. 242–247 (2014)
23.
Zurück zum Zitat Saul, C.L., Marcelo, D.F.: On the control of power consumption in server farms via heavy traffic approximation. In: IEEE 53rd Conference on Decision and Control, pp. 3683–3688 (2014) Saul, C.L., Marcelo, D.F.: On the control of power consumption in server farms via heavy traffic approximation. In: IEEE 53rd Conference on Decision and Control, pp. 3683–3688 (2014)
24.
Zurück zum Zitat Dimitrov, M., Mantor, M., Zhou, H.: Understanding software approaches for GPGPU reliability. In: Proceedings of 2nd workshop on general purpose processing on graphics processing units. ACM, New York Dimitrov, M., Mantor, M., Zhou, H.: Understanding software approaches for GPGPU reliability. In: Proceedings of 2nd workshop on general purpose processing on graphics processing units. ACM, New York
25.
Zurück zum Zitat Dal, D., Mansouri, N.: Power optimization with power islands synthesis. IEEE Trans. Comput. Aided Design Integr. Circuits Syst. 28(7), 1025–1037 (2009)CrossRef Dal, D., Mansouri, N.: Power optimization with power islands synthesis. IEEE Trans. Comput. Aided Design Integr. Circuits Syst. 28(7), 1025–1037 (2009)CrossRef
26.
Zurück zum Zitat Padoin, E.L., Pilla, L.L., et al.: Evaluating application performance and energy consumption on hybrid CPU + GPU architecture. Clust. Comput. 16, 511–525 (2013)CrossRef Padoin, E.L., Pilla, L.L., et al.: Evaluating application performance and energy consumption on hybrid CPU + GPU architecture. Clust. Comput. 16, 511–525 (2013)CrossRef
27.
Zurück zum Zitat Degalahal, V., Li, L., Narayanan, V.: Soft errors issues in low-power caches. IEEE Trans. Very Large Scale Integr. Syst. 13(10), 1157–1166 (2005)CrossRef Degalahal, V., Li, L., Narayanan, V.: Soft errors issues in low-power caches. IEEE Trans. Very Large Scale Integr. Syst. 13(10), 1157–1166 (2005)CrossRef
28.
Zurück zum Zitat Firouzi, F., Azarpeyvand, A., et al.: Adaptive fault-tolerant DVFS with dynamic online AVF prediction. Microelectron. Reliab. 52, 1197–1208 (2012)CrossRef Firouzi, F., Azarpeyvand, A., et al.: Adaptive fault-tolerant DVFS with dynamic online AVF prediction. Microelectron. Reliab. 52, 1197–1208 (2012)CrossRef
29.
Zurück zum Zitat Zhu, D., Aydin, H.: Reliability-aware energy management for periodic real-time tasks. IEEE Trans. Comput. 58(10), 1382–1397 (2009)MathSciNetCrossRefMATH Zhu, D., Aydin, H.: Reliability-aware energy management for periodic real-time tasks. IEEE Trans. Comput. 58(10), 1382–1397 (2009)MathSciNetCrossRefMATH
30.
Zurück zum Zitat Dixit, A., Wood, A.: The impact of new technology on soft error rates. 2011 IEEE International Reliability Physics Symposium(IRPS), pp. 5B.4.1–5B.4.7 (2011) Dixit, A., Wood, A.: The impact of new technology on soft error rates. 2011 IEEE International Reliability Physics Symposium(IRPS), pp. 5B.4.1–5B.4.7 (2011)
31.
Zurück zum Zitat Zhao, B. Aydin, H., Zhu, D.: Energy management under general task-level reliability constraints. In: 2012 IEEE 18th Real Time and Embedded Technology and Applications Symposium, pp. 1080–1812 (2012) Zhao, B. Aydin, H., Zhu, D.: Energy management under general task-level reliability constraints. In: 2012 IEEE 18th Real Time and Embedded Technology and Applications Symposium, pp. 1080–1812 (2012)
32.
Zurück zum Zitat Hancao, L., Haddad, W.M.: Model predictive control for a multi-compartment respiratory system. IEEE Trans. Instrum. Meas. 21(5), 1988–1995 (2013) Hancao, L., Haddad, W.M.: Model predictive control for a multi-compartment respiratory system. IEEE Trans. Instrum. Meas. 21(5), 1988–1995 (2013)
33.
Zurück zum Zitat Chen, Y., Zhang, J., et al.: A service selection model using mixed intelligent optimization. Chin. J. Comput. 36(2), 384–391 (2013). (in Chinese) Chen, Y., Zhang, J., et al.: A service selection model using mixed intelligent optimization. Chin. J. Comput. 36(2), 384–391 (2013). (in Chinese)
34.
Zurück zum Zitat Li, X.: A novel effective solution for non-differentiable optimization problems. Sci. Sin. Math. 24(4), 371–377 (1994). (in Chinese) Li, X.: A novel effective solution for non-differentiable optimization problems. Sci. Sin. Math. 24(4), 371–377 (1994). (in Chinese)
35.
Zurück zum Zitat Li, S., Zheng, Y., Lin, Z.: Impacted-region optimization for distributed model predictive control systems with constraints. IEEE Trans. Autom. Sci. Eng. 99(5), 1–14 (2014) Li, S., Zheng, Y., Lin, Z.: Impacted-region optimization for distributed model predictive control systems with constraints. IEEE Trans. Autom. Sci. Eng. 99(5), 1–14 (2014)
36.
Zurück zum Zitat Hsueh, Y., Chen, H.: Map matching for low-sampling-rate GPS trajectories by exploring real-time moving directions. Inf. Sci. 433, 55–69 (2018)MathSciNetCrossRef Hsueh, Y., Chen, H.: Map matching for low-sampling-rate GPS trajectories by exploring real-time moving directions. Inf. Sci. 433, 55–69 (2018)MathSciNetCrossRef
37.
Zurück zum Zitat Yuan, J., Zheng, Y., Xie, X., Sun, G.: Driving with knowledge from the physical world. In: the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data mining, KDD’11, New York. ACM (2011) Yuan, J., Zheng, Y., Xie, X., Sun, G.: Driving with knowledge from the physical world. In: the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data mining, KDD’11, New York. ACM (2011)
38.
Zurück zum Zitat Deng, Z., Yangyang, H., et al.: A scalable and fast OPTICS for clustering trajectory big data. Clust. Comput. 18, 549–562 (2015)CrossRef Deng, Z., Yangyang, H., et al.: A scalable and fast OPTICS for clustering trajectory big data. Clust. Comput. 18, 549–562 (2015)CrossRef
Metadaten
Titel
Event driven power consumption optimization control model of GPU clusters
verfasst von
Haifeng Wang
Yunpeng Cao
Publikationsdatum
05.01.2019
Verlag
Springer US
Erschienen in
Cluster Computing / Ausgabe 3/2019
Print ISSN: 1386-7857
Elektronische ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-018-02886-x

Weitere Artikel der Ausgabe 3/2019

Cluster Computing 3/2019 Zur Ausgabe