Skip to main content
Erschienen in: Queueing Systems 3-4/2016

02.07.2016

Queueing with redundant requests: exact analysis

verfasst von: Kristen Gardner, Samuel Zbarsky, Sherwin Doroudi, Mor Harchol-Balter, Esa Hyytiä, Alan Scheller-Wolf

Erschienen in: Queueing Systems | Ausgabe 3-4/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Recent computer systems research has proposed using redundant requests to reduce latency. The idea is to run a request on multiple servers and wait for the first completion (discarding all remaining copies of the request). However, there is no exact analysis of systems with redundancy. This paper presents the first exact analysis of systems with redundancy. We allow for any number of classes of redundant requests, any number of classes of non-redundant requests, any degree of redundancy, and any number of heterogeneous servers. In all cases we derive the limiting distribution of the state of the system. In small (two or three server) systems, we derive simple forms for the distribution of response time of both the redundant classes and non-redundant classes, and we quantify the “gain” to redundant classes and “pain” to non-redundant classes caused by redundancy. We find some surprising results. First, the response time of a fully redundant class follows a simple exponential distribution and that of the non-redundant class follows a generalized hyperexponential. Second, fully redundant classes are “immune” to any pain caused by other classes becoming redundant. We also compare redundancy with other approaches for reducing latency, such as optimal probabilistic splitting of a class among servers (Opt-Split) and join-the-shortest-queue (JSQ) routing of a class. We find that, in many cases, redundancy outperforms JSQ and Opt-Split with respect to overall response time, making it an attractive solution.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
This is counterintuitive because, as we will see in Lemma 3, the distribution of response time for class R does not depend on whether class A is redundant or non-redundant.
 
2
A generalized hyperexponential \(H_2(\nu _1,\nu _2,\omega )\) is defined as the weighted mixture of two exponentials with rates \(\nu _1\) and \(\nu _2\), where the first exponential is given weight \(1+\omega \) and the second is given weight \(-\omega \). Note that \(\omega \) can be any real number; it need not be a probability [8].
 
Literatur
1.
Zurück zum Zitat Adan, I., Weiss, G.: A skill based parallel service system under FCFS-ALIS—steady state, overloads, and abandonments. Stoch. Syst. 4(1), 250–299 (2014)CrossRef Adan, I., Weiss, G.: A skill based parallel service system under FCFS-ALIS—steady state, overloads, and abandonments. Stoch. Syst. 4(1), 250–299 (2014)CrossRef
2.
Zurück zum Zitat Ananthanarayanan, G., Ghodsi, A., Shenker, S., Stoica, I.: Effective straggler mitigation: attack of the clones. In NSDI, pp. 185–198, April 2013 Ananthanarayanan, G., Ghodsi, A., Shenker, S., Stoica, I.: Effective straggler mitigation: attack of the clones. In NSDI, pp. 185–198, April 2013
3.
Zurück zum Zitat Ananthanarayanan, G., Hung, M.C.C., Ren, X., Stoica, I., Wierman, A., Yu, M.: Grass: trimming stragglers in approximation analytics. In Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation, pp. 289–302. USENIX Association, 2014 Ananthanarayanan, G., Hung, M.C.C., Ren, X., Stoica, I., Wierman, A., Yu, M.: Grass: trimming stragglers in approximation analytics. In Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation, pp. 289–302. USENIX Association, 2014
4.
Zurück zum Zitat Baccelli, F., Makowski, A.: Simple computable bounds for the fork-join queue. Technical Report RR-0394, Inria, 1985 Baccelli, F., Makowski, A.: Simple computable bounds for the fork-join queue. Technical Report RR-0394, Inria, 1985
5.
Zurück zum Zitat Baccelli, F., Makowski, A.M., Shwartz, A.: The fork-join queue and related systems with synchronization constraints: stochastic ordering and computable bounds. Adv. Appl. Prob. 21, 629–660 (1989)CrossRef Baccelli, F., Makowski, A.M., Shwartz, A.: The fork-join queue and related systems with synchronization constraints: stochastic ordering and computable bounds. Adv. Appl. Prob. 21, 629–660 (1989)CrossRef
6.
Zurück zum Zitat Bassamboo, A., Randhawa, R.S., Mieghem, J.A.V.: A little flexibility is all you need: on the value of flexible resources in queueing systems. Oper. Res. 60, 1423–1435 (2012)CrossRef Bassamboo, A., Randhawa, R.S., Mieghem, J.A.V.: A little flexibility is all you need: on the value of flexible resources in queueing systems. Oper. Res. 60, 1423–1435 (2012)CrossRef
7.
Zurück zum Zitat Borst, S., Boxma, O., Uitert, M.V.: The asymptotic workload behavior of two coupled queues. Queueing Syst. 43(1–2), 81–102 (2003)CrossRef Borst, S., Boxma, O., Uitert, M.V.: The asymptotic workload behavior of two coupled queues. Queueing Syst. 43(1–2), 81–102 (2003)CrossRef
8.
Zurück zum Zitat Botta, R.F., Harris, C.M., Marchal, W.G.: Characterizations of generalized hyperexponential distribution functions. Commun. Stat. Stoch. Models 3(1), 115–148 (1987)CrossRef Botta, R.F., Harris, C.M., Marchal, W.G.: Characterizations of generalized hyperexponential distribution functions. Commun. Stat. Stoch. Models 3(1), 115–148 (1987)CrossRef
9.
Zurück zum Zitat Boxma, O., Koole, G., Liu, Z.: Queueing-theoretic solution methods for models of parallel and distributed systems. In Performance Evaluation of Parallel and Distributed Systems Solution Methods. CWI Tract 105 & 106, pp. 1–24, 1994 Boxma, O., Koole, G., Liu, Z.: Queueing-theoretic solution methods for models of parallel and distributed systems. In Performance Evaluation of Parallel and Distributed Systems Solution Methods. CWI Tract 105 & 106, pp. 1–24, 1994
10.
Zurück zum Zitat Casanova, H.: Benefits and drawbacks of redundant batch requests. J. Grid Comput. 5(2), 235–250 (2007)CrossRef Casanova, H.: Benefits and drawbacks of redundant batch requests. J. Grid Comput. 5(2), 235–250 (2007)CrossRef
11.
Zurück zum Zitat Cohen, J.W., Boxma, O.J.: Boundary Value Problems in Queueing System Analysis. North-Holland Publishing Company, Amsterdam (1983) Cohen, J.W., Boxma, O.J.: Boundary Value Problems in Queueing System Analysis. North-Holland Publishing Company, Amsterdam (1983)
12.
Zurück zum Zitat Dean, J., Barroso, L.A.: The tail at scale. Commun. ACM 56(2), 74–80 (2013)CrossRef Dean, J., Barroso, L.A.: The tail at scale. Commun. ACM 56(2), 74–80 (2013)CrossRef
13.
Zurück zum Zitat Fayolle, G., Iasnogorodski, R.: Two coupled processors: the reduction to a Riemann-Hilbert problem. Zeitschrift fur Wahrscheinlichkeitstheorie und vervandte Gebiete 47(3), 325–351 (1979)CrossRef Fayolle, G., Iasnogorodski, R.: Two coupled processors: the reduction to a Riemann-Hilbert problem. Zeitschrift fur Wahrscheinlichkeitstheorie und vervandte Gebiete 47(3), 325–351 (1979)CrossRef
14.
Zurück zum Zitat Flatto, L.: Two parallel queues created by arrivals with two demands II. SIAM J. Appl. Math. 45(5), 1159–1166 (1985)CrossRef Flatto, L.: Two parallel queues created by arrivals with two demands II. SIAM J. Appl. Math. 45(5), 1159–1166 (1985)CrossRef
15.
Zurück zum Zitat Flatto, L., Hahn, S.: Two parallel queues created by arrivals with two demands I. SIAM J. Appl. Math. 44(5), 250–255 (1984)CrossRef Flatto, L., Hahn, S.: Two parallel queues created by arrivals with two demands I. SIAM J. Appl. Math. 44(5), 250–255 (1984)CrossRef
16.
Zurück zum Zitat Hall, P.: On representatives of subsets. J. Lond. Math. Soc. 10(1), 26–30 (1935) Hall, P.: On representatives of subsets. J. Lond. Math. Soc. 10(1), 26–30 (1935)
17.
Zurück zum Zitat Harchol-Balter, M., Li, C., Osogami, T., Scheller-Wolf, A., Squillante, M.: Cycle stealing under immediate dispatch task assignment. In Annual Symposium on Parallel Algorithms and Architectures, pp. 274–285, June 2003 Harchol-Balter, M., Li, C., Osogami, T., Scheller-Wolf, A., Squillante, M.: Cycle stealing under immediate dispatch task assignment. In Annual Symposium on Parallel Algorithms and Architectures, pp. 274–285, June 2003
18.
Zurück zum Zitat Hooghiemstra, G., Keane, M., de Ree, S.V.: Power series for stationary distributions of coupled processor models. SIAM J. Appl. Math. 48(5), 861–878 (1988)CrossRef Hooghiemstra, G., Keane, M., de Ree, S.V.: Power series for stationary distributions of coupled processor models. SIAM J. Appl. Math. 48(5), 861–878 (1988)CrossRef
19.
Zurück zum Zitat Huang, H., Hung, W., Shin, K.G.: FS2: dynamic data replication in free disk space for improving disk performance and energy consumption. In Proceedings of the SOSP’05, pp. 263–276, December 2005 Huang, H., Hung, W., Shin, K.G.: FS2: dynamic data replication in free disk space for improving disk performance and energy consumption. In Proceedings of the SOSP’05, pp. 263–276, December 2005
20.
Zurück zum Zitat Joshi, G., Liu, Y., Soljanin, E.: Coding for fast content download. In Allerton Conference’12, pp. 326–333, 2012 Joshi, G., Liu, Y., Soljanin, E.: Coding for fast content download. In Allerton Conference’12, pp. 326–333, 2012
21.
Zurück zum Zitat Joshi, G., Liu, Y., Soljanin, E.: On the delay-storage trade-off in content download from coded distributed storage systems. IEEE J. Sel. Areas Commun. 32(5), 989–997 (2014)CrossRef Joshi, G., Liu, Y., Soljanin, E.: On the delay-storage trade-off in content download from coded distributed storage systems. IEEE J. Sel. Areas Commun. 32(5), 989–997 (2014)CrossRef
22.
Zurück zum Zitat Keilson, J., Servi, L.: A distributional form of Little’s Law. Oper. Res. Lett. 7(5), 223–227 (1988)CrossRef Keilson, J., Servi, L.: A distributional form of Little’s Law. Oper. Res. Lett. 7(5), 223–227 (1988)CrossRef
23.
Zurück zum Zitat Kim, C., Agrawala, A.K.: Analysis of the fork-join queue. IEEE Trans. Comput. 38(2), 1041–1053 (1989)CrossRef Kim, C., Agrawala, A.K.: Analysis of the fork-join queue. IEEE Trans. Comput. 38(2), 1041–1053 (1989)CrossRef
24.
Zurück zum Zitat Konheim, A.G., Meilijson, I., Melkman, A.: Processor-sharing of two parallel lines. J. Appl. Prob. 18(4), 952–956 (1981)CrossRef Konheim, A.G., Meilijson, I., Melkman, A.: Processor-sharing of two parallel lines. J. Appl. Prob. 18(4), 952–956 (1981)CrossRef
25.
Zurück zum Zitat Koole, G., Righter, R.: Resource allocation in grid computing. J. Sched. 11, 163–173 (2009)CrossRef Koole, G., Righter, R.: Resource allocation in grid computing. J. Sched. 11, 163–173 (2009)CrossRef
26.
Zurück zum Zitat Nelson, R., Tantawi, A.N.: Approximate analysis of fork/join synchronization in parallel queues. IEEE Trans. Comput. 37(6), 739–743 (1988)CrossRef Nelson, R., Tantawi, A.N.: Approximate analysis of fork/join synchronization in parallel queues. IEEE Trans. Comput. 37(6), 739–743 (1988)CrossRef
27.
Zurück zum Zitat Osogami, T., Harchol-Balter, M., Scheller-Wolf, A.: Analysis of cycle stealing with switching times and thresholds. In SIGMETRICS, pp. 184–195, June 2003 Osogami, T., Harchol-Balter, M., Scheller-Wolf, A.: Analysis of cycle stealing with switching times and thresholds. In SIGMETRICS, pp. 184–195, June 2003
28.
Zurück zum Zitat Ren, X., Ananthanarayanan, G., Wierman, A., Yu, M.: Hopper: decentralized speculation-aware cluster scheduling at scale Ren, X., Ananthanarayanan, G., Wierman, A., Yu, M.: Hopper: decentralized speculation-aware cluster scheduling at scale
29.
Zurück zum Zitat Shah, N.B., Lee, K., Ramchandran, K.: The MDS queue: analysing latency performance of codes and redundant requests. Technical Report arXiv:1211.5405, November 2012 Shah, N.B., Lee, K., Ramchandran, K.: The MDS queue: analysing latency performance of codes and redundant requests. Technical Report arXiv:​1211.​5405, November 2012
30.
Zurück zum Zitat Shah, N.B., Lee, K., Ramchandran, K.: When do redundant requests reduce latency? Technical Report arXiv:1311.2851, June 2013 Shah, N.B., Lee, K., Ramchandran, K.: When do redundant requests reduce latency? Technical Report arXiv:​1311.​2851, June 2013
31.
Zurück zum Zitat Stolyar, A.L., Tezcan, T.: Control of systems with flexible multi-server pools: a shadow routing approach. Queueing Syst. 66, 1–51 (2010)CrossRef Stolyar, A.L., Tezcan, T.: Control of systems with flexible multi-server pools: a shadow routing approach. Queueing Syst. 66, 1–51 (2010)CrossRef
32.
Zurück zum Zitat Tsitsiklis, J., Xu, K.: On the power of (even a little) resource pooling. Stoch. Syst. 2, 1–66 (2012)CrossRef Tsitsiklis, J., Xu, K.: On the power of (even a little) resource pooling. Stoch. Syst. 2, 1–66 (2012)CrossRef
33.
Zurück zum Zitat Tsitsiklis, J., Xu, K.: Queueing system topologies with limited flexibility. In SIGMETRICS, 2013 Tsitsiklis, J., Xu, K.: Queueing system topologies with limited flexibility. In SIGMETRICS, 2013
34.
Zurück zum Zitat Visschers, J., Adan, I., Weiss, G.: A product form solution to a system with multi-type jobs and multi-type servers. Queueing Syst. 70, 269–298 (2012)CrossRef Visschers, J., Adan, I., Weiss, G.: A product form solution to a system with multi-type jobs and multi-type servers. Queueing Syst. 70, 269–298 (2012)CrossRef
35.
Zurück zum Zitat Vulimiri, A., Godfrey, P.B., Mittal, R., Sherry, J., Ratnasamy, S., Shenker, S.: Low latency via redundancy. In CoNEXT, pp. 283–294, December 2013 Vulimiri, A., Godfrey, P.B., Mittal, R., Sherry, J., Ratnasamy, S., Shenker, S.: Low latency via redundancy. In CoNEXT, pp. 283–294, December 2013
36.
Zurück zum Zitat Wang, D., Joshi, G., Wornell, G.: Efficient task replication for fast response times in parallel computation. Technical Report arXiv:1404.1328, April 2014 Wang, D., Joshi, G., Wornell, G.: Efficient task replication for fast response times in parallel computation. Technical Report arXiv:​1404.​1328, April 2014
37.
Zurück zum Zitat Xia, C., Liu, Z., Towsley, D., Lelarge, M.: Scalability of fork/join queueing networks with blocking. In SIGMETRICS, pp. 133–144, June 2007 Xia, C., Liu, Z., Towsley, D., Lelarge, M.: Scalability of fork/join queueing networks with blocking. In SIGMETRICS, pp. 133–144, June 2007
38.
Zurück zum Zitat Xu, Y., Bailey, M., Noble, B., Jahanian, F.: Small is better: avoiding latency traps in virtualized data centers. In Proceedings of the 4th annual Symposium on Cloud Computing, p. 7. ACM, 2013 Xu, Y., Bailey, M., Noble, B., Jahanian, F.: Small is better: avoiding latency traps in virtualized data centers. In Proceedings of the 4th annual Symposium on Cloud Computing, p. 7. ACM, 2013
Metadaten
Titel
Queueing with redundant requests: exact analysis
verfasst von
Kristen Gardner
Samuel Zbarsky
Sherwin Doroudi
Mor Harchol-Balter
Esa Hyytiä
Alan Scheller-Wolf
Publikationsdatum
02.07.2016
Verlag
Springer US
Erschienen in
Queueing Systems / Ausgabe 3-4/2016
Print ISSN: 0257-0130
Elektronische ISSN: 1572-9443
DOI
https://doi.org/10.1007/s11134-016-9485-y

Weitere Artikel der Ausgabe 3-4/2016

Queueing Systems 3-4/2016 Zur Ausgabe

EDITORIAL

Introduction

Premium Partner