Skip to main content
Top
Published in: Queueing Systems 1-2/2022

28-07-2022

WCFS: a new framework for analyzing multiserver systems

Authors: Isaac Grosof, Mor Harchol-Balter, Alan Scheller-Wolf

Published in: Queueing Systems | Issue 1-2/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Multiserver queueing systems are found at the core of a wide variety of practical systems. Many important multiserver models have a previously-unexplained similarity: identical mean response time behavior is empirically observed in the heavy traffic limit. We explain this similarity for the first time. We do so by introducing the work-conserving finite-skip (WCFS) framework, which encompasses a broad class of important models. This class includes the heterogeneous M/G/k, the Limited Processor Sharing policy for the M/G/1, the Threshold Parallelism model and the Multiserver-Job model under a novel scheduling algorithm. We prove that for all WCFS models, scaled mean response time \(E[T](1-\rho )\) converges to the same value, \(E[S^2]/(2E[S])\), in the heavy-traffic limit, which is also the heavy traffic limit for the M/G/1/FCFS. Moreover, we prove additively tight bounds on mean response time for the WCFS class, which hold for all load \(\rho \). For each of the four models mentioned above, our bounds are the first known bounds on mean response time.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Footnotes
1
This assumption is defined in Sect. 2.3.
 
2
The data was published in a scaled form [15]. We rescale the data so the smallest job in the trace uses one normalized CPU.
 
Literature
1.
go back to reference Nathuji, R., Isci, C., Gorbatov, E.: Exploiting platform heterogeneity for power efficient data centers. In: Fourth International Conference on Autonomic Computing (ICAC’07), p. 5 (2007) Nathuji, R., Isci, C., Gorbatov, E.: Exploiting platform heterogeneity for power efficient data centers. In: Fourth International Conference on Autonomic Computing (ICAC’07), p. 5 (2007)
2.
go back to reference Mars, J., Tang, L., Hundt, R.: Heterogeneity in “homogeneous’’ warehouse-scale computers: a performance opportunity. IEEE Comput. Arch. Lett. 10(2), 29–32 (2011)CrossRef Mars, J., Tang, L., Hundt, R.: Heterogeneity in “homogeneous’’ warehouse-scale computers: a performance opportunity. IEEE Comput. Arch. Lett. 10(2), 29–32 (2011)CrossRef
3.
go back to reference Cho, H.-D., Engineer, P.D.P., Chung, K., Kim, T.: Benefits of the big. LITTLE architecture. EETimes, (2012) Cho, H.-D., Engineer, P.D.P., Chung, K., Kim, T.: Benefits of the big. LITTLE architecture. EETimes, (2012)
4.
go back to reference Yashkov, S., Yashkova, A.: Processor sharing: a survey of the mathematical theory. Autom. Remote Control 68(9), 1662–1731 (2007)CrossRef Yashkov, S., Yashkova, A.: Processor sharing: a survey of the mathematical theory. Autom. Remote Control 68(9), 1662–1731 (2007)CrossRef
5.
go back to reference Nuyens, M., Van Der Weij, W.: Monotonicity in the limited processor sharing queue. Resource 4, 7 (2008) Nuyens, M., Van Der Weij, W.: Monotonicity in the limited processor sharing queue. Resource 4, 7 (2008)
6.
go back to reference Telek, M., Van Houdt, B.: Response time distribution of a class of limited processor sharing queues. SIGMETRICS Perform. Eval. Rev. 45(3), 143–155 (2018)CrossRef Telek, M., Van Houdt, B.: Response time distribution of a class of limited processor sharing queues. SIGMETRICS Perform. Eval. Rev. 45(3), 143–155 (2018)CrossRef
7.
go back to reference Zhang, J., Zwart, B.: Steady state approximations of limited processor sharing queues in heavy traffic. Queueing Syst. 60(3), 227–246 (2008)CrossRef Zhang, J., Zwart, B.: Steady state approximations of limited processor sharing queues in heavy traffic. Queueing Syst. 60(3), 227–246 (2008)CrossRef
8.
go back to reference Gupta, V., Harchol-Balter, M.: Self-adaptive admission control policies for resource-sharing systems. SIGMETRICS Perform. Eval. Rev. 37(1), 311–322 (2009)CrossRef Gupta, V., Harchol-Balter, M.: Self-adaptive admission control policies for resource-sharing systems. SIGMETRICS Perform. Eval. Rev. 37(1), 311–322 (2009)CrossRef
9.
go back to reference Delimitrou, C., Kozyrakis, C.: Quasar: Resource-efficient and QoS-aware cluster management. In: Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems. ASPLOS ’14, pp. 127–144 (2014) Delimitrou, C., Kozyrakis, C.: Quasar: Resource-efficient and QoS-aware cluster management. In: Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems. ASPLOS ’14, pp. 127–144 (2014)
10.
go back to reference Peng, Y., Bao, Y., Chen, Y., Wu, C., Guo, C.: Optimus: an efficient dynamic resource scheduler for deep learning clusters. In: Proceedings of the Thirteenth EuroSys Conference. EuroSys ’18 (2018) Peng, Y., Bao, Y., Chen, Y., Wu, C., Guo, C.: Optimus: an efficient dynamic resource scheduler for deep learning clusters. In: Proceedings of the Thirteenth EuroSys Conference. EuroSys ’18 (2018)
11.
go back to reference Maguluri, S.T., Srikant, R., Ying, L.: Stochastic models of load balancing and scheduling in cloud computing clusters. In: 2012 Proceedings IEEE Infocom, pp. 702–710. IEEE, Orlando (2012) Maguluri, S.T., Srikant, R., Ying, L.: Stochastic models of load balancing and scheduling in cloud computing clusters. In: 2012 Proceedings IEEE Infocom, pp. 702–710. IEEE, Orlando (2012)
12.
go back to reference Feitelson, D.G., Rudolph, L., Schwiegelshohn, U.: Parallel job scheduling—a status report. In: Workshop on Job Scheduling Strategies for Parallel Processing, pp. 1–16. Springer, New York (2004) Feitelson, D.G., Rudolph, L., Schwiegelshohn, U.: Parallel job scheduling—a status report. In: Workshop on Job Scheduling Strategies for Parallel Processing, pp. 1–16. Springer, New York (2004)
13.
go back to reference Srinivasan, S., Kettimuthu, R., Subramani, V., Sadayappan, P.: Characterization of backfilling strategies for parallel job scheduling. In: Proceedings of International Conference on Parallel Processing Workshop, pp. 514–519 (2002) Srinivasan, S., Kettimuthu, R., Subramani, V., Sadayappan, P.: Characterization of backfilling strategies for parallel job scheduling. In: Proceedings of International Conference on Parallel Processing Workshop, pp. 514–519 (2002)
14.
go back to reference Carastan-Santos, D., De Camargo, R.Y., Trystram, D., Zrigui, S.: One can only gain by replacing easy backfilling: a simple scheduling policies case study. In: 2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID), pp. 1–10 (2019) Carastan-Santos, D., De Camargo, R.Y., Trystram, D., Zrigui, S.: One can only gain by replacing easy backfilling: a simple scheduling policies case study. In: 2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID), pp. 1–10 (2019)
15.
go back to reference Tirmazi, M., Barker, A., Deng, N., Haque, M.E., Qin, Z.G., Hand, S., Harchol-Balter, M., Wilkes, J.: Borg: the next generation. In: Proceedings of the Fifteenth European Conference on Computer Systems. EuroSys ’20 (2020) Tirmazi, M., Barker, A., Deng, N., Haque, M.E., Qin, Z.G., Hand, S., Harchol-Balter, M., Wilkes, J.: Borg: the next generation. In: Proceedings of the Fifteenth European Conference on Computer Systems. EuroSys ’20 (2020)
16.
go back to reference Grosof, I., Harchol-Balter, M., Scheller-Wolf, A.: Stability for two-class multiserver-job systems. arXiv preprint arXiv:2010.00631 (2020) Grosof, I., Harchol-Balter, M., Scheller-Wolf, A.: Stability for two-class multiserver-job systems. arXiv preprint arXiv:​2010.​00631 (2020)
18.
go back to reference Loulou, R.: Multi-channel queues in heavy traffic. J. Appl. Probab. 10(4), 769–777 (1973)CrossRef Loulou, R.: Multi-channel queues in heavy traffic. J. Appl. Probab. 10(4), 769–777 (1973)CrossRef
19.
go back to reference Köllerström, J.: Heavy traffic theory for queues with several servers. I. J. Appl. Probab. 11(3), 544–552 (1974)CrossRef Köllerström, J.: Heavy traffic theory for queues with several servers. I. J. Appl. Probab. 11(3), 544–552 (1974)CrossRef
20.
go back to reference Köllerström, J.: Heavy traffic theory for queues with several servers. II. J. Appl. Probab. 16(2), 393–401 (1979)CrossRef Köllerström, J.: Heavy traffic theory for queues with several servers. II. J. Appl. Probab. 16(2), 393–401 (1979)CrossRef
21.
go back to reference Kingman, J.: Some inequalities for the queue GI/G/1. Biometrika 49(3/4), 315–324 (1962)CrossRef Kingman, J.: Some inequalities for the queue GI/G/1. Biometrika 49(3/4), 315–324 (1962)CrossRef
22.
go back to reference Gamarnik, D., Momčilović, P.: Steady-state analysis of a multiserver queue in the Halfin-Whitt regime. Adv. Appl. Probab. 40(2), 548–577 (2008)CrossRef Gamarnik, D., Momčilović, P.: Steady-state analysis of a multiserver queue in the Halfin-Whitt regime. Adv. Appl. Probab. 40(2), 548–577 (2008)CrossRef
23.
go back to reference Aghajani, R., Ramanan, K.: The limit of stationary distributions of many-server queues in the Halfin–Whitt regime. Math. Oper. Res. 45(3), 1016–1055 (2020)CrossRef Aghajani, R., Ramanan, K.: The limit of stationary distributions of many-server queues in the Halfin–Whitt regime. Math. Oper. Res. 45(3), 1016–1055 (2020)CrossRef
24.
go back to reference Dai, J., Dieker, A., Gao, X.: Validity of heavy-traffic steady-state approximations in many-server queues with abandonment. Queueing Syst. 78(1), 1–29 (2014)CrossRef Dai, J., Dieker, A., Gao, X.: Validity of heavy-traffic steady-state approximations in many-server queues with abandonment. Queueing Syst. 78(1), 1–29 (2014)CrossRef
25.
go back to reference Goldberg, D.A., Li, Y.: Simple and explicit bounds for multi-server queues with universal 1/(1-rho) scaling. arXiv preprint arXiv:1706.04628 (2017) Goldberg, D.A., Li, Y.: Simple and explicit bounds for multi-server queues with universal 1/(1-rho) scaling. arXiv preprint arXiv:​1706.​04628 (2017)
26.
go back to reference Efrosinin, D.V., Rykov, V.V.: On performance characteristics for queueing systems with heterogeneous servers. Autom. Remote Control 69(1), 61–75 (2008)CrossRef Efrosinin, D.V., Rykov, V.V.: On performance characteristics for queueing systems with heterogeneous servers. Autom. Remote Control 69(1), 61–75 (2008)CrossRef
27.
go back to reference Alves, F., Yehia, H., Pedrosa, L., Cruz, F., Kerbache, L.: Upper bounds on performance measures of heterogeneous M/M/c queues. Math. Probl. Eng. 2011 (2011) Alves, F., Yehia, H., Pedrosa, L., Cruz, F., Kerbache, L.: Upper bounds on performance measures of heterogeneous M/M/c queues. Math. Probl. Eng. 2011 (2011)
28.
go back to reference Efrosinin, D., Stepanova, N., Sztrik, J., Plank, A.: Approximations in performance analysis of a controllable queueing system with heterogeneous servers. Mathematics 8(10), 1803 (2020)CrossRef Efrosinin, D., Stepanova, N., Sztrik, J., Plank, A.: Approximations in performance analysis of a controllable queueing system with heterogeneous servers. Mathematics 8(10), 1803 (2020)CrossRef
29.
go back to reference Lin, W., Kumar, P.: Optimal control of a queueing system with two heterogeneous servers. IEEE Trans. Autom. Control 29(8), 696–703 (1984)CrossRef Lin, W., Kumar, P.: Optimal control of a queueing system with two heterogeneous servers. IEEE Trans. Autom. Control 29(8), 696–703 (1984)CrossRef
30.
go back to reference Van Harten, A., Sleptchenko, A.: On Markovian multi-class, multi-server queueing. Queueing Syst. 43(4), 307–328 (2003)CrossRef Van Harten, A., Sleptchenko, A.: On Markovian multi-class, multi-server queueing. Queueing Syst. 43(4), 307–328 (2003)CrossRef
31.
go back to reference Boxma, O.J., Deng, Q., Zwart, A.P.: Waiting-time asymptotics for the M/G/2 queue with heterogeneous servers. Queueing Syst. 40(1), 5–31 (2002)CrossRef Boxma, O.J., Deng, Q., Zwart, A.P.: Waiting-time asymptotics for the M/G/2 queue with heterogeneous servers. Queueing Syst. 40(1), 5–31 (2002)CrossRef
32.
go back to reference Keaogile, T., Fatai Adewole, A., Ramasamy, S.: Geo (\(\lambda \))/Geo (\(\mu \))+ G/2 queues with heterogeneous servers operating under FCFS queue discipline. Am. J. Appl. Math. Stat 3(2), 54–58 (2015) Keaogile, T., Fatai Adewole, A., Ramasamy, S.: Geo (\(\lambda \))/Geo (\(\mu \))+ G/2 queues with heterogeneous servers operating under FCFS queue discipline. Am. J. Appl. Math. Stat 3(2), 54–58 (2015)
33.
go back to reference Sani, S., Daman, O.A.: The M/G/2 queue with heterogeneous servers under a controlled service discipline: stationary performance analysis. IAENG Int. J. Appl. Math. 45(1) (2015) Sani, S., Daman, O.A.: The M/G/2 queue with heterogeneous servers under a controlled service discipline: stationary performance analysis. IAENG Int. J. Appl. Math. 45(1) (2015)
34.
go back to reference Ramasamy, S., Daman, O.A., Sani, S.: An M/G/2 queue where customers are served subject to a minimum violation of FCFS queue discipline. Eur. J. Oper. Res. 240(1), 140–146 (2015)CrossRef Ramasamy, S., Daman, O.A., Sani, S.: An M/G/2 queue where customers are served subject to a minimum violation of FCFS queue discipline. Eur. J. Oper. Res. 240(1), 140–146 (2015)CrossRef
35.
go back to reference Zhang, J., Dai, J.G., Zwart, B.: Law of large number limits of limited processor-sharing queues. Math. Oper. Res. 34(4), 937–970 (2009)CrossRef Zhang, J., Dai, J.G., Zwart, B.: Law of large number limits of limited processor-sharing queues. Math. Oper. Res. 34(4), 937–970 (2009)CrossRef
36.
go back to reference Zhang, J., Dai, J.G., Zwart, B.: Diffusion limits of limited processor sharing queues. Ann. Appl. Probab. 21(2), 745–799 (2011)CrossRef Zhang, J., Dai, J.G., Zwart, B.: Diffusion limits of limited processor sharing queues. Ann. Appl. Probab. 21(2), 745–799 (2011)CrossRef
37.
go back to reference Harchol-Balter, M.: Performance Modeling and Design of Computer Systems: Queueing Theory in Action. Cambridge University Press, Cambridge (2013)CrossRef Harchol-Balter, M.: Performance Modeling and Design of Computer Systems: Queueing Theory in Action. Cambridge University Press, Cambridge (2013)CrossRef
38.
go back to reference Berg, B., Dorsman, J.-P., Harchol-Balter, M.: Towards optimality in parallel scheduling. Proc. ACM Meas. Anal. Comput. Syst. 1(2) (2017) Berg, B., Dorsman, J.-P., Harchol-Balter, M.: Towards optimality in parallel scheduling. Proc. ACM Meas. Anal. Comput. Syst. 1(2) (2017)
39.
go back to reference Berg, B., Harchol-Balter, M.: Optimal scheduling of parallel jobs with unknown service requirements. In: Handbook of Research on Methodologies and Applications of Supercomputing, pp. 18–40. IGI Global, Hershey (2021) Berg, B., Harchol-Balter, M.: Optimal scheduling of parallel jobs with unknown service requirements. In: Handbook of Research on Methodologies and Applications of Supercomputing, pp. 18–40. IGI Global, Hershey (2021)
40.
go back to reference Berg, B., Harchol-Balter, M., Moseley, B., Wang, W., Whitehouse, J.: Optimal resource allocation for elastic and inelastic jobs. In: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and Architectures. SPAA ’20, pp. 75–87 (2020) Berg, B., Harchol-Balter, M., Moseley, B., Wang, W., Whitehouse, J.: Optimal resource allocation for elastic and inelastic jobs. In: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and Architectures. SPAA ’20, pp. 75–87 (2020)
41.
go back to reference Brill, P.H., Green, L.: Queues in which customers receive simultaneous service from a random number of servers: a system point approach. Manag. Sci. 30(1), 51–68 (1984)CrossRef Brill, P.H., Green, L.: Queues in which customers receive simultaneous service from a random number of servers: a system point approach. Manag. Sci. 30(1), 51–68 (1984)CrossRef
42.
go back to reference Rumyantsev, A., Morozov, E.: Stability criterion of a multiserver model with simultaneous service. Ann. Oper. Res. 252(1), 29–39 (2017)CrossRef Rumyantsev, A., Morozov, E.: Stability criterion of a multiserver model with simultaneous service. Ann. Oper. Res. 252(1), 29–39 (2017)CrossRef
43.
go back to reference Hong, Y., Wang, W.: Sharp zero-queueing bounds for multi-server jobs (2021) Hong, Y., Wang, W.: Sharp zero-queueing bounds for multi-server jobs (2021)
44.
go back to reference Ghaderi, J.: Randomized algorithms for scheduling VMs in the cloud. In: IEEE INFOCOM 2016—The 35th Annual IEEE International Conference on Computer Communications, pp. 1–9 (2016) Ghaderi, J.: Randomized algorithms for scheduling VMs in the cloud. In: IEEE INFOCOM 2016—The 35th Annual IEEE International Conference on Computer Communications, pp. 1–9 (2016)
45.
go back to reference Psychas, K., Ghaderi, J.: Randomized algorithms for scheduling multi-resource jobs in the cloud. IEEE/ACM Trans. Netw. 26(5), 2202–2215 (2018)CrossRef Psychas, K., Ghaderi, J.: Randomized algorithms for scheduling multi-resource jobs in the cloud. IEEE/ACM Trans. Netw. 26(5), 2202–2215 (2018)CrossRef
46.
go back to reference Psychas, K., Ghaderi, J.: On non-preemptive VM scheduling in the cloud. Proc. ACM Meas. Anal. Comput. Syst. 1(2), 35–13529 (2017)CrossRef Psychas, K., Ghaderi, J.: On non-preemptive VM scheduling in the cloud. Proc. ACM Meas. Anal. Comput. Syst. 1(2), 35–13529 (2017)CrossRef
47.
go back to reference Maguluri, S.T., Srikant, R.: Scheduling jobs with unknown duration in clouds. IEEE/ACM Trans. Netw. 22(6), 1938–1951 (2014)CrossRef Maguluri, S.T., Srikant, R.: Scheduling jobs with unknown duration in clouds. IEEE/ACM Trans. Netw. 22(6), 1938–1951 (2014)CrossRef
48.
go back to reference Baccelli, F., Foss, S.: On the saturation rule for the stability of queues. J. Appl. Probab. 32(2), 494–507 (1995)CrossRef Baccelli, F., Foss, S.: On the saturation rule for the stability of queues. J. Appl. Probab. 32(2), 494–507 (1995)CrossRef
49.
go back to reference Foss, S., Konstantopoulos, T.: An overview of some stochastic stability methods. J. Oper. Res. Soc. Jpn. 47(4), 275–303 (2004) Foss, S., Konstantopoulos, T.: An overview of some stochastic stability methods. J. Oper. Res. Soc. Jpn. 47(4), 275–303 (2004)
50.
go back to reference Miyazawa, M.: Rate conservation laws: a survey. Queueing Syst. 15(1), 1–58 (1994)CrossRef Miyazawa, M.: Rate conservation laws: a survey. Queueing Syst. 15(1), 1–58 (1994)CrossRef
51.
go back to reference Grosof, I., Scully, Z., Harchol-Balter, M.: SRPT for multiserver systems. Perform. Eval. 127–128, 154–175 (2018)CrossRef Grosof, I., Scully, Z., Harchol-Balter, M.: SRPT for multiserver systems. Perform. Eval. 127–128, 154–175 (2018)CrossRef
52.
go back to reference Sigman, K., Yao, D.D.: Finite moments for inventory processes. Ann. Appl. Probab. 4, 765–778 (1994)CrossRef Sigman, K., Yao, D.D.: Finite moments for inventory processes. Ann. Appl. Probab. 4, 765–778 (1994)CrossRef
53.
go back to reference Scheller-Wolf, A.: Finite moment conditions for stationary content processes with applications to fluid models and queues. PhD thesis, Columbia University (1996) Scheller-Wolf, A.: Finite moment conditions for stationary content processes with applications to fluid models and queues. PhD thesis, Columbia University (1996)
Metadata
Title
WCFS: a new framework for analyzing multiserver systems
Authors
Isaac Grosof
Mor Harchol-Balter
Alan Scheller-Wolf
Publication date
28-07-2022
Publisher
Springer US
Published in
Queueing Systems / Issue 1-2/2022
Print ISSN: 0257-0130
Electronic ISSN: 1572-9443
DOI
https://doi.org/10.1007/s11134-022-09848-6

Other articles of this Issue 1-2/2022

Queueing Systems 1-2/2022 Go to the issue

Premium Partner