poster

BRB: BetteR Batch Scheduling to Reduce Tail Latencies in Cloud Data Stores

Authors:
Waleed Reda

Université catholique de Louvain, Louvain-la-Neuve, Belgium

Université catholique de Louvain, Louvain-la-Neuve, Belgium
View Profile

,
Lalith Suresh

TU Berlin, Berlin, Germany

TU Berlin, Berlin, Germany
View Profile

,
Marco Canini

Université catholique de Louvain, Louvain-la-Neuve, Belgium

Université catholique de Louvain, Louvain-la-Neuve, Belgium
View Profile

,
Sean Braithwaite

SoundCloud, Berlin, Germany

SoundCloud, Berlin, Germany
View Profile

ACM SIGCOMM Computer Communication Review Volume 45 Issue 4October 2015pp 607–608https://doi.org/10.1145/2829988.2790023

Published:17 August 2015Publication History

ACM SIGCOMM Computer Communication Review

Abstract

A common pattern in the architectures of modern interactive web-services is that of large request fan-outs, where even a single end-user request (task) arriving at an application server triggers tens to thousands of data accesses (sub-tasks) to different stateful backend servers. The overall response time of each task is bottlenecked by the completion time of the slowest sub-task, making such workloads highly sensitive to the tail of latency distribution of the backend tier. The large number of decentralized application servers and skewed workload patterns exacerbate the challenge in addressing this problem. We address these challenges through BetteR Batch (BRB). By carefully scheduling requests in a decentralized and task-aware manner, BRB enables low-latency distributed storage systems to deliver predictable performance in the presence of large request fan-outs. Our preliminary simulation results based on production workloads show that our proposed design is at the 99th percentile latency within 38% of an ideal system model while offering latency improvements over the state-of-the-art by a factor of 2.

References

B. Atikoglu, Y. Xu, E. Frachtenberg, S. Jiang, and M. Paleczny. Workload Analysis of a Large-scale Key-value Store. In SIGMETRICS, 2012. Google ScholarDigital Library
M. Chowdhury, Y. Zhong, and I. Stoica. Efficient Coflow Scheduling with Varys. In SIGCOMM, 2014. Google ScholarDigital Library
J. Dean and L. A. Barroso. The Tail At Scale. Communications of the ACM, 56(2):74--80, 2013. Google ScholarDigital Library
D. Shue, M. J. Freedman, and A. Shaikh. Performance Isolation and Fairness for Multi-tenant Cloud Storage. In OSDI, 2012. Google ScholarDigital Library
L. Suresh, M. Canini, S. Schmid, and A. Feldmann. C3: Cutting Tail Latency in Cloud Data Stores via Adaptive Replica Selection. In NSDI, 2015. Google ScholarDigital Library

Index Terms

BRB: BetteR Batch Scheduling to Reduce Tail Latencies in Cloud Data Stores
1. General and reference
  1. Cross-computing tools and techniques
    1. Performance
2. Information systems
  1. Information storage systems
    1. Storage architectures
      1. Distributed storage

Recommendations

BRB: BetteR Batch Scheduling to Reduce Tail Latencies in Cloud Data Stores
SIGCOMM '15: Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication

A common pattern in the architectures of modern interactive web-services is that of large request fan-outs, where even a single end-user request (task) arriving at an application server triggers tens to thousands of data accesses (sub-tasks) to ...
Read More
Virtual Batching: Request Batching for Server Energy Conservation in Virtualized Data Centers

Many power management strategies have been proposed for enterprise servers based on dynamic voltage and frequency scaling (DVFS), but those solutions cannot further reduce the energy consumption of a server when the server processor is already at the ...
Read More
Tail Latency in Datacenter Networks
Modelling, Analysis, and Simulation of Computer and Telecommunication Systems
Abstract
One of the major challenges in cloud service data centers is to satisfy service-level agreements without significant over-provisioning. Achieving predictable performance is critical for many interactive applications. While the focus, particularly ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM SIGCOMM Computer Communication Review Volume 45, Issue 4
SIGCOMM'15
October 2015
659 pages
ISSN:0146-4833
DOI:10.1145/2829988
Editors:
Konstantina Papagiannaki
Telefonica Research, Barcelona, Spain
,
Katerina Argyraki
EPFL, Switzerland
,
Hitesh Ballani
Microsoft Research Cambridge, UK
,
Fabián Bustamante
Northwestern University, USA
,
Joseph Camp
SMU, USA
,
Augustin Chaintreau
Columbia University, USA
,
Phillipa Gill
Stony Brook University, USA
,
Marco Mellia
Politecnico di Torino, Italy
,
Bhaskaran Raman
IIT Bombay, India
,
Joel Sommers
Colgate University, USA
,
Aline Carneiro Viana
INRIA, France
Issue’s Table of Contents
SIGCOMM '15: Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication
August 2015
684 pages
ISBN:9781450335423
DOI:10.1145/2785956
General Chairs:
Steve Uhlig
Queen Mary University of London, UK
,
Olaf Maennel
Tallinn U. of Technology in Estonia, Estonia
,
Program Chairs:
Brad Karp
University College London, UK
,
Jitendra Padhye
Microsoft, USA
Copyright © 2015 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 17 August 2015
Check for updates
Author Tags
batches
data centers
data storesle
load balancing
tail latency
Qualifiers
- poster
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 271
  Total Downloads
- Downloads (Last 12 months)24
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

BRB: BetteR Batch Scheduling to Reduce Tail Latencies in Cloud Data Stores

ACM SIGCOMM Computer Communication Review

Abstract

References

Cited By

Index Terms

Recommendations

BRB: BetteR Batch Scheduling to Reduce Tail Latencies in Cloud Data Stores

Virtual Batching: Request Batching for Server Energy Conservation in Virtualized Data Centers

Tail Latency in Datacenter Networks