ABSTRACT
An important class of datacenter applications, called Online Data-Intensive (OLDI) applications, includes Web search, online retail, and advertisement. To achieve good user experience, OLDI applications operate under soft-real-time constraints (e.g., 300 ms latency) which imply deadlines for network communication within the applications. Further, OLDI applications typically employ tree-based algorithms which, in the common case, result in bursts of children-to-parent traffic with tight deadlines. Recent work on datacenter network protocols is either deadline-agnostic (DCTCP) or is deadline-aware (D3) but suffers under bursts due to race conditions. Further, D3 has the practical drawbacks of requiring changes to the switch hardware and not being able to coexist with legacy TCP. We propose Deadline-Aware Datacenter TCP (D2TCP), a novel transport protocol, which handles bursts, is deadline-aware, and is readily deployable. In designing D2TCP, we make two contributions: (1) D2TCP uses a distributed and reactive approach for bandwidth allocation which fundamentally enables D2TCP's properties. (2) D2TCP employs a novel congestion avoidance algorithm, which uses ECN feedback and deadlines to modulate the congestion window via a gamma-correction function. Using a small-scale implementation and at-scale simulations, we show that D2TCP reduces the fraction of missed deadlines compared to DCTCP and D3 by 75% and 50%, respectively.
Supplemental Material
- M. Alizadeh, A. G. Greenberg, D. A. Maltz, J. Padhye, P. Patel, B. Prabhakar, S. Sengupta, and M. Sridharan. Data center TCP (DCTCP). In Proc. SIGCOMM, 2010. Google ScholarDigital Library
- Charles A. Poynton (2003). Digital Video and HDTV: Algorithms and Interfaces. Morgan Kaufmann. pp. 260, 630. ISBN 1558607927. Google ScholarDigital Library
- B. Briscoe et. al. Policing Congestion Response in an Internetwork using Re-feedback. In Proc. SIGCOMM 2005. Google ScholarDigital Library
- Datacenter TCP, http://www.stanford.edu/~alizade/Site/DCTCP.htmGoogle Scholar
- Nandita Dukkipati. RCP: Congestion Control to Make Flows Complete Quickly. PhD Thesis, Department of Electrical Engineering, Stanford University, October 2006.Google Scholar
- D. Ferrari, A. Banerjea, and H. Zhang. Network support for multimedia: A discussion of the tenet approach. In Proc. Computer Networks and ISDN Systems, 1994. Google ScholarDigital Library
- S. Floyd. RFC 3649: HighSpeed TCP for large congestion windows. Google ScholarDigital Library
- S. Floyd and V. Jacobson. Random early detection gateways for congestion avoidance. IEEE/ACM Transactions on Networking, 1(4):397--413, 1993. Google ScholarDigital Library
- S. Floyd and V. Jacobson. The synchronization of periodic routing messages. IEEE/ACM Transactions on Networking, 2(2):122--136, 1994. Google ScholarDigital Library
- R. Griffith, Y. Chen, J. Liu, A. Joseph, and R. Katz. Understanding TCP incast throughput collapse in datacenter networks. In WREN Workshop, 2009. Google ScholarDigital Library
- Y. Gu, D. Towsley, C. Hollot, and H. Zhang. Congestion control for small buffer high bandwidth networks. In Proc. INFOCOM, 2007.Google Scholar
- Urs Hoelzle, Jeffrey Dean, and Luiz André Barroso. Web Search for A Planet: The Architecture of the Google Cluster, In IEEE Micro Magazine, April 2003.Google Scholar
- T. Ho. Latency is Everywhere and it Costs You Sales - How to Crush it, July 2009. http://highscalability.com/blog/2009/7/25/latency-iseverywhere-and-it-costs-you-sales-how-to-crush-it.html.Google Scholar
- S. Iyer et. al. Analysis of a memory architecture for fast packet buffers. In IEEE HPSR Workshop, 2001.Google Scholar
- D. Katabi, M. Handley, and C. Rohrs. Congestion Control for High Bandwidth-Delay Product Networks. In Proc. SIGCOMM, 2002. Google ScholarDigital Library
- Sam Liang and David Cheriton. TCP-RTM: Using TCP for Real Time Applications. In Proc. ICNP, 2002.Google Scholar
- C. L. Liu and J. W. Layland. Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environment. Journal of the ACM, 20(1), 1973. Google ScholarDigital Library
- D. Meisner, C. M. Sadler, L. A. Barroso, W. Weber and T. F. Wenisch. Power Management of Online Data-Intensive Services. In Proc. ISCA, June 2011. Google ScholarDigital Library
- The ns-3 discrete-event network simulator. http://www.nsnam.org/Google Scholar
- R. Pan, B. Prabhakar, and A. Laxmikantha. QCN: Quantized congestion notification an overview. http://www.ieee802.org/1/les/public/docs2007/au_prabhakar_qcn_overview_geneva.pdfGoogle Scholar
- K. Ramakrishnan, S. Floyd, and D. Black. RFC 3168: The addition of explicit congestion notification (ECN) to IP. Google ScholarDigital Library
- I. R. Sangtae Ha and L. Xu. Cubic: A new TCP-friendly high-speed TCP variant. In Proc. SIGOPS-OSR, 2008. Google ScholarDigital Library
- V. Tsaoussidis and C. Zhang. 2002. TCP-Real: receiver-oriented congestion control. The International Journal of Computer and Telecommunications Networking. 40(4), 2002. Google ScholarDigital Library
- V. Vasudevan et al. Safe and effective fine-grained TCP retransmissions for datacenter communication. In Proc. SIGCOMM, 2009. Google ScholarDigital Library
- C. Wilson, H. Ballani, T. Karagiannis, A. Rowstron. Better Never Than Late: Meeting Deadlines in Datacenter Networks. In Proc. SIGCOMM, 2011. Google ScholarDigital Library
- C. Wilson, H. Ballani, T. Karagiannis, and A. Rowstron. Better never than late: Meeting deadlines in datacenter networks. Technical Report MSR-TR-2011--66, Microsoft Research, May 2011.Google ScholarDigital Library
- Y. Xia, L. Subramanian, I. Stoica, and S. Kalyanaraman. One more bit is enough. In Proc. SIGCOMM, 2005. Google ScholarDigital Library
Index Terms
- Deadline-aware datacenter tcp (D2TCP)
Recommendations
Data center TCP (DCTCP)
SIGCOMM '10: Proceedings of the ACM SIGCOMM 2010 conferenceCloud data centers host diverse applications, mixing workloads that require small predictable latency with others requiring large sustained throughput. In this environment, today's state-of-the-art TCP protocol falls short. We present measurements of a ...
Deadline-aware datacenter tcp (D2TCP)
Special october issue SIGCOMM '12An important class of datacenter applications, called Online Data-Intensive (OLDI) applications, includes Web search, online retail, and advertisement. To achieve good user experience, OLDI applications operate under soft-real-time constraints (e.g., ...
Performance sensitivity and fairness of ECN-aware 'modified TCP'
The paper discusses how explicit congestion notification (ECN) can be used to devise an Internet congestion control mechanism that is more rapidly reactive and allows best-effort flows to rapidly adjust to fluctuations in available capacity. Our ECN-mod ...
Comments