Abstract
A common pattern in the architectures of modern interactive web-services is that of large request fan-outs, where even a single end-user request (task) arriving at an application server triggers tens to thousands of data accesses (sub-tasks) to different stateful backend servers. The overall response time of each task is bottlenecked by the completion time of the slowest sub-task, making such workloads highly sensitive to the tail of latency distribution of the backend tier. The large number of decentralized application servers and skewed workload patterns exacerbate the challenge in addressing this problem. We address these challenges through BetteR Batch (BRB). By carefully scheduling requests in a decentralized and task-aware manner, BRB enables low-latency distributed storage systems to deliver predictable performance in the presence of large request fan-outs. Our preliminary simulation results based on production workloads show that our proposed design is at the 99th percentile latency within 38% of an ideal system model while offering latency improvements over the state-of-the-art by a factor of 2.
- B. Atikoglu, Y. Xu, E. Frachtenberg, S. Jiang, and M. Paleczny. Workload Analysis of a Large-scale Key-value Store. In SIGMETRICS, 2012. Google ScholarDigital Library
- M. Chowdhury, Y. Zhong, and I. Stoica. Efficient Coflow Scheduling with Varys. In SIGCOMM, 2014. Google ScholarDigital Library
- J. Dean and L. A. Barroso. The Tail At Scale. Communications of the ACM, 56(2):74--80, 2013. Google ScholarDigital Library
- D. Shue, M. J. Freedman, and A. Shaikh. Performance Isolation and Fairness for Multi-tenant Cloud Storage. In OSDI, 2012. Google ScholarDigital Library
- L. Suresh, M. Canini, S. Schmid, and A. Feldmann. C3: Cutting Tail Latency in Cloud Data Stores via Adaptive Replica Selection. In NSDI, 2015. Google ScholarDigital Library
Index Terms
- BRB: BetteR Batch Scheduling to Reduce Tail Latencies in Cloud Data Stores
Recommendations
BRB: BetteR Batch Scheduling to Reduce Tail Latencies in Cloud Data Stores
SIGCOMM '15: Proceedings of the 2015 ACM Conference on Special Interest Group on Data CommunicationA common pattern in the architectures of modern interactive web-services is that of large request fan-outs, where even a single end-user request (task) arriving at an application server triggers tens to thousands of data accesses (sub-tasks) to ...
Virtual Batching: Request Batching for Server Energy Conservation in Virtualized Data Centers
Many power management strategies have been proposed for enterprise servers based on dynamic voltage and frequency scaling (DVFS), but those solutions cannot further reduce the energy consumption of a server when the server processor is already at the ...
Tail Latency in Datacenter Networks
Modelling, Analysis, and Simulation of Computer and Telecommunication SystemsAbstractOne of the major challenges in cloud service data centers is to satisfy service-level agreements without significant over-provisioning. Achieving predictable performance is critical for many interactive applications. While the focus, particularly ...
Comments