Skip to main content
Top

2018 | OriginalPaper | Chapter

Elastic CPU Cap Mechanism for Timely Dataflow Applications

Authors : M. Reza Hoseinyfarahabady, Nazanin Farhangsadr, Albert Y. Zomaya, Zahir Tari, Samee U. Khan

Published in: Computational Science – ICCS 2018

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Sudden surges in the incoming workload can cause adverse consequences on the run-time performance of data-flow applications. Our work addresses the problem of limiting CPU associated with the elastic scaling of timely data-flow (TDF) applications running in a shared computing environment while each application can possess a different quality of service (QoS) requirement. The key argument here is that an unwise consolidation decision to dynamically scale up/out the computing resources for responding to unexpected workload changes can degrade the performance of some (if not all) collocated applications due to their fierce competition getting the shared resources (such as the last level cache). The proposed solution uses a queue-based model to predict the performance degradation of running data-flow applications together. The problem of CPU cap adjustment is addressed as an optimization problem, where the aim is to reduce the quality of service violation incidents among applications while raising the CPU utilization level of server nodes as well as preventing the formation of bottlenecks due to the fierce competition among collocated applications. The controller uses and efficient dynamic method to find a solution at each round of the controlling epoch. The performance evaluation is carried out by comparing the proposed controller against an enhanced QoS-aware version of round robin strategy which is deployed in many commercial packages. Experimental results confirmed that the proposed solution improves QoS satisfaction by near to 148% on average while it can reduce the latency of processing data records for applications in the highest QoS classes by near to 19% during workload surges.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Abdelwahed, S., et al.: On the application of MPC techniques for adaptive performance management of computing systems. IEEE Trans. Netw. Serv. Manag. 6(4), 212–225 (2009)CrossRef Abdelwahed, S., et al.: On the application of MPC techniques for adaptive performance management of computing systems. IEEE Trans. Netw. Serv. Manag. 6(4), 212–225 (2009)CrossRef
3.
go back to reference Akidau, T., Balikov, A., et al.: Millwheel: fault-tolerant stream processing at internet scale. Proc. VLDB Endow. 6(11), 1033–1044 (2013)CrossRef Akidau, T., Balikov, A., et al.: Millwheel: fault-tolerant stream processing at internet scale. Proc. VLDB Endow. 6(11), 1033–1044 (2013)CrossRef
4.
go back to reference Allen, A.O.: Probability, Statistics, and Queueing Theory. Academic Press, Cambridge (2014) Allen, A.O.: Probability, Statistics, and Queueing Theory. Academic Press, Cambridge (2014)
5.
go back to reference Bolch, G., Greiner, S., de Meer, H., Trivedi, K.S.: Queueing Networks and Markov Chains. Wiley, Hoboken (2006)CrossRef Bolch, G., Greiner, S., de Meer, H., Trivedi, K.S.: Queueing Networks and Markov Chains. Wiley, Hoboken (2006)CrossRef
7.
go back to reference Chen, L., Shen, H.: Considering resource demand misalignments to reduce resource over-provisioning in cloud. In: IEEE Conference on Computer Communications (2017) Chen, L., Shen, H.: Considering resource demand misalignments to reduce resource over-provisioning in cloud. In: IEEE Conference on Computer Communications (2017)
8.
go back to reference Chen, L., Shen, H., Platt, S.: Cache contention aware VM placement & migration in cloud. In: International Conference on Network Protocols, pp. 1–10. IEEE (2016) Chen, L., Shen, H., Platt, S.: Cache contention aware VM placement & migration in cloud. In: International Conference on Network Protocols, pp. 1–10. IEEE (2016)
10.
go back to reference Dagum, P., Karp, R., Luby, M., Ross, S.: An optimal algorithm for Monte Carlo estimation. SIAM J. Comput. 29(5), 1484–1496 (2000)MathSciNetCrossRef Dagum, P., Karp, R., Luby, M., Ross, S.: An optimal algorithm for Monte Carlo estimation. SIAM J. Comput. 29(5), 1484–1496 (2000)MathSciNetCrossRef
11.
go back to reference Dean, J., Ghemawat, S.: Mapreduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)CrossRef Dean, J., Ghemawat, S.: Mapreduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)CrossRef
13.
go back to reference Dudoladov, S., Xu, C., et al.: Optimistic recovery for iterative dataflows in action. In: ACM SIGMOD International Conference on Management of Data, pp. 1439–1443 (2015) Dudoladov, S., Xu, C., et al.: Optimistic recovery for iterative dataflows in action. In: ACM SIGMOD International Conference on Management of Data, pp. 1439–1443 (2015)
14.
go back to reference Hirzel, M., Soulé, R., Schneider, S., Gedik, B., Grimm, R.: A catalog of stream processing optimizations. ACM Comput. Surv. (CSUR) 46(4), 46 (2014)CrossRef Hirzel, M., Soulé, R., Schneider, S., Gedik, B., Grimm, R.: A catalog of stream processing optimizations. ACM Comput. Surv. (CSUR) 46(4), 46 (2014)CrossRef
15.
go back to reference Huang, X., Xue, G., Yu, R., Leng, S.: Joint scheduling and beamforming coordination in cloud radio access networks with qos guarantees. IEEE Trans. Veh. Technol. 65(7), 5449–5460 (2016)CrossRef Huang, X., Xue, G., Yu, R., Leng, S.: Joint scheduling and beamforming coordination in cloud radio access networks with qos guarantees. IEEE Trans. Veh. Technol. 65(7), 5449–5460 (2016)CrossRef
16.
go back to reference Li, B., Diao, Y., Shenoy, P.: Supporting scalable analytics with latency constraints. Proc. VLDB Endow. 8(11), 1166–1177 (2015)CrossRef Li, B., Diao, Y., Shenoy, P.: Supporting scalable analytics with latency constraints. Proc. VLDB Endow. 8(11), 1166–1177 (2015)CrossRef
17.
go back to reference Li, K., Liu, C., Li, K.: An approximation algorithm based on game theory for scheduling simple linear deteriorating jobs. Theor. Comput. Sci. 543, 46–51 (2014)MathSciNetCrossRef Li, K., Liu, C., Li, K.: An approximation algorithm based on game theory for scheduling simple linear deteriorating jobs. Theor. Comput. Sci. 543, 46–51 (2014)MathSciNetCrossRef
19.
go back to reference McSherry, F., Isard, M., et al.: Scalability! but at what cost? In: HotOS (2015) McSherry, F., Isard, M., et al.: Scalability! but at what cost? In: HotOS (2015)
20.
go back to reference Mencagli, G.: Adaptive model predictive control of autonomic distributed parallel computations with variable horizons and switching costs. Concurrency Comput.: Pract. Exp. 28(7), 2187–2212 (2016)CrossRef Mencagli, G.: Adaptive model predictive control of autonomic distributed parallel computations with variable horizons and switching costs. Concurrency Comput.: Pract. Exp. 28(7), 2187–2212 (2016)CrossRef
21.
go back to reference Mencagli, G., Vanneschi, M., Vespa, E.: A cooperative predictive control approach to improve the reconfiguration stability of adaptive distributed parallel applications. ACM Trans. Auton. Adapt. Syst. 9(1), 2 (2014)CrossRef Mencagli, G., Vanneschi, M., Vespa, E.: A cooperative predictive control approach to improve the reconfiguration stability of adaptive distributed parallel applications. ACM Trans. Auton. Adapt. Syst. 9(1), 2 (2014)CrossRef
22.
go back to reference Murray, D.G., McSherry, F., et al.: Naiad: a timely dataflow system. In: ACM Symposium on Operating Systems Principles, pp. 439–455 (2013) Murray, D.G., McSherry, F., et al.: Naiad: a timely dataflow system. In: ACM Symposium on Operating Systems Principles, pp. 439–455 (2013)
23.
go back to reference Padala, P., et al.: Automated control of multiple virtualized resources. In: European Conference on Computer Systems (EuroSys), pp. 13–26. ACM (2009) Padala, P., et al.: Automated control of multiple virtualized resources. In: European Conference on Computer Systems (EuroSys), pp. 13–26. ACM (2009)
24.
go back to reference Rao, J., Zhou, X.: Towards fair and efficient SMP VM scheduling. In: SIGPLAN Symposium on Principles & Practice of Parallel Programming, pp. 273–286. ACM (2014) Rao, J., Zhou, X.: Towards fair and efficient SMP VM scheduling. In: SIGPLAN Symposium on Principles & Practice of Parallel Programming, pp. 273–286. ACM (2014)
25.
go back to reference Rawlings, J.B., Mayne, D.Q.: Model Predictive Control: Theory and Design. Nob Hill Publishing, LLC, Madison (2009) Rawlings, J.B., Mayne, D.Q.: Model Predictive Control: Theory and Design. Nob Hill Publishing, LLC, Madison (2009)
26.
go back to reference Şahin, S.: C-stream: a coroutune-based elastic stream processing engine. Ph.D. thesis, Bilkent University (2015) Şahin, S.: C-stream: a coroutune-based elastic stream processing engine. Ph.D. thesis, Bilkent University (2015)
27.
go back to reference Subramanian, L., Seshadri, V., Ghosh, A., Khan, S., Mutlu, O.: The application slowdown model. In: Microarchitecture (MICRO), pp. 62–75. IEEE (2015) Subramanian, L., Seshadri, V., Ghosh, A., Khan, S., Mutlu, O.: The application slowdown model. In: Microarchitecture (MICRO), pp. 62–75. IEEE (2015)
28.
go back to reference Tanner, M.: Practical Queueing Analysis. McGraw-Hill, New York City (1995) Tanner, M.: Practical Queueing Analysis. McGraw-Hill, New York City (1995)
29.
go back to reference Tembey, P., Gavrilovska, A., et al.: Application & platform-aware RA in consolidated systems. In: Symposium on Cloud Computing, pp. 1–14. ACM (2014) Tembey, P., Gavrilovska, A., et al.: Application & platform-aware RA in consolidated systems. In: Symposium on Cloud Computing, pp. 1–14. ACM (2014)
30.
go back to reference Wang, H., Isci, C., Subramanian, L., Choi, J., Qian, D., Mutlu, O.: A-DRM: architecture-aware distributed resource management of virtualized clusters. ACM SIGPLAN Not. 50(7), 93–106 (2015)CrossRef Wang, H., Isci, C., Subramanian, L., Choi, J., Qian, D., Mutlu, O.: A-DRM: architecture-aware distributed resource management of virtualized clusters. ACM SIGPLAN Not. 50(7), 93–106 (2015)CrossRef
31.
go back to reference Yang, F., Qian, Z., Chen, X., Beschastnikh, I., Zhuang, L., Zhou, L., Shen, J.: Sonora: a platform for continuous mobile-cloud computing. Technical report, Microsoft Research Asia (2012) Yang, F., Qian, Z., Chen, X., Beschastnikh, I., Zhuang, L., Zhou, L., Shen, J.: Sonora: a platform for continuous mobile-cloud computing. Technical report, Microsoft Research Asia (2012)
32.
go back to reference Ye, K., et al.: Profiling-based workload consolidation & migration in VDCs. IEEE Trans. Parallel Distrib. Syst. 26(3), 878–890 (2015)CrossRef Ye, K., et al.: Profiling-based workload consolidation & migration in VDCs. IEEE Trans. Parallel Distrib. Syst. 26(3), 878–890 (2015)CrossRef
33.
go back to reference Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. HotCloud 10(10–10), 95 (2010) Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. HotCloud 10(10–10), 95 (2010)
Metadata
Title
Elastic CPU Cap Mechanism for Timely Dataflow Applications
Authors
M. Reza Hoseinyfarahabady
Nazanin Farhangsadr
Albert Y. Zomaya
Zahir Tari
Samee U. Khan
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-93698-7_42

Premium Partner