skip to main content
10.1145/2486001.2486005acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article
Free Access

zUpdate: updating data center networks with zero loss

Published:27 August 2013Publication History

ABSTRACT

Datacenter networks (DCNs) are constantly evolving due to various updates such as switch upgrades and VM migrations. Each update must be carefully planned and executed in order to avoid disrupting many of the mission-critical, interactive applications hosted in DCNs. The key challenge arises from the inherent difficulty in synchronizing the changes to many devices, which may result in unforeseen transient link load spikes or even congestions. We present one primitive, zUpdate, to perform congestion-free network updates under asynchronous switch and traffic matrix changes. We formulate the update problem using a network model and apply our model to a variety of representative update scenarios in DCNs. We develop novel techniques to handle several practical challenges in realizing zUpdate as well as implement the zUpdate prototype on OpenFlow switches and deploy it on a testbed that resembles real DCN topology. Our results, from both real-world experiments and large-scale trace-driven simulations, show that zUpdate can effectively perform congestion-free updates in production DCNs.

References

  1. Floodlight. http://floodlight.openflowhub.org/.Google ScholarGoogle Scholar
  2. MOSEK. http://mosek.com/.Google ScholarGoogle Scholar
  3. OpenFlow 1.0. http://www.openflow.org/documents/openflow-spec-v1.0.0.pdf.Google ScholarGoogle Scholar
  4. M. Al-Fares, A. Loukissas, and A. Vahdat. A Scalable, Commodity Data Center Network Architecture. In SIGCOMM'08. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Alizadeh, A. Greenberg, D. A. Maltz, J. Padhye, P. Patel, B. Prabhakar, S. Sengupta, and M. Sridharan. Data Center TCP DCTCP. In SIGCOMM'10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. C. Clark, K. Fraser, S. Hand, J. G. Hansen, E. Jul, C. Limpach, I. Pratt, and A. Warfield. Live Migration of Virtual Machines. In NSDI'05. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. R. Curtis, J. C. Mogul, J. Tourrilhes, P. Yalag, P. Sharma, and S. Banerjee. Devoflow: Scaling Flow Management for High-Performance Networks. In SIGCOMM'11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. N. Feamster and H. Balakrishnan. Detecting BGP Configuration Faults with Static Analysis. In NSDI'05. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. P. Francois, O. Bonaventure, B. Decraene, and P. A. Coste. Avoiding Disruptions During Maintenance Operations on BGP Sessions. IEEE Trans. on Netw. and Serv. Manag., 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Ghorbani and M. Caesar. Walk the Line: Consistent Network Updates with Bandwidth Guarantees. In HotSDN'12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. P. John, E. Katz-Bassett, A. Krishnamurthy, T. Anderson, and A. Venkataramani. Consensus Routing: the Internet as a Distributed System. In NSDI'08. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. P. Kazemian, M. Chang, H. Zeng, G. Varghese, N. McKeown, and S. Whyte. Real Time Network Policy Checking Using Header Space Analysis. In NSDI'13. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. P. Kazemian, G. Varghese, and N. McKeown. Header Space Analysis: Static Checking for Networks. In NSDI'12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. E. Keller, S. Ghorbani, M. Caesar, and J. Rexford. Live Migration of an Entire Network (and its hosts). In HotNets'12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Khurshid, W. Zhou, M. Caesar, and P. B. Godfrey. Veriflow: Verifying Network-Wide Invariants in Real Time. In HotSDN'12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. H. Mai, A. Khurshid, R. Agarwal, M. Caesar, P. B. Godfrey, and S. T. King. Debugging the Data Plane with Anteater. In SIGCOMM'11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Raza, Y. Zhu, and C.-N. Chuah. Graceful Network State Migrations. Networking, IEEE/ACM Transactions on, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Reitblatt, N. Foster, J. Rexford, C. Schlesinger, and D. Walker. Abstractions for Network Update. In SIGCOMM'12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. L. Vanbever, S. Vissicchio, C. Pelsser, P. Francois, and O. Bonaventure. Seamless Network-Wide IGP Migrations. In SIGCOMM'11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. X. Wu, D. Turner, C.-C. Chen, D. A. Maltz, X. Yang, L. Yuan, and M. Zhang. NetPilot: Automating Datacenter Network Failure Mitigation. In SIGCOMM'12. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. zUpdate: updating data center networks with zero loss

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SIGCOMM '13: Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM
        August 2013
        580 pages
        ISBN:9781450320566
        DOI:10.1145/2486001
        • cover image ACM SIGCOMM Computer Communication Review
          ACM SIGCOMM Computer Communication Review  Volume 43, Issue 4
          October 2013
          595 pages
          ISSN:0146-4833
          DOI:10.1145/2534169
          Issue’s Table of Contents

        Copyright © 2013 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 27 August 2013

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        SIGCOMM '13 Paper Acceptance Rate38of246submissions,15%Overall Acceptance Rate554of3,547submissions,16%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader