skip to main content
10.1145/1592568.1592575acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article
Free Access

PortLand: a scalable fault-tolerant layer 2 data center network fabric

Published:16 August 2009Publication History

ABSTRACT

This paper considers the requirements for a scalable, easily manageable, fault-tolerant, and efficient data center network fabric. Trends in multi-core processors, end-host virtualization, and commodities of scale are pointing to future single-site data centers with millions of virtual end points. Existing layer 2 and layer 3 network protocols face some combination of limitations in such a setting: lack of scalability, difficult management, inflexible communication, or limited support for virtual machine migration. To some extent, these limitations may be inherent for Ethernet/IP style protocols when trying to support arbitrary topologies. We observe that data center networks are often managed as a single logical network fabric with a known baseline topology and growth model. We leverage this observation in the design and implementation of PortLand, a scalable, fault tolerant layer 2 routing and forwarding protocol for data center environments. Through our implementation and evaluation, we show that PortLand holds promise for supporting a ``plug-and-play" large-scale, data center network.

References

  1. Cisco Data Center Infrastructure 2.5 Design Guide. www.cisco.com/application/pdf/en/us/guest/netsol/ns107/c649/ccmigration_09186a008073377d.pdf.Google ScholarGoogle Scholar
  2. Configuring IP Unicast Layer 3 Switching on Supervisor Engine 2. www.cisco.com/en/US/docs/routers/7600/ios/12.1E/configuration/guide/cef.html.Google ScholarGoogle Scholar
  3. Inside Microsoft's $550 Million Mega Data Centers. www.informationweek.com/news/hardware/data_centers/showArticle.jhtml?articleID=208403723.Google ScholarGoogle Scholar
  4. OpenFlow. www.openflowswitch.org/ .Google ScholarGoogle Scholar
  5. OSPF Design Guide. www.ciscosystems.com/en/US/tech/tk365/technologies_white_paper09186a0080094e9e.shtml.Google ScholarGoogle Scholar
  6. M. Al-Fares, A. Loukissas, and A. Vahdat. A Scalable, Commodity Data Center Network Architecture. In SIGCOMM '08: Proceedings of the ACM SIGCOMM 2008 conference on Data communication, pages 63--74, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Caesar, D. Caldwell, N. Feamster, J. Rexford, A. Shaikh, and J. van der Merwe. Design and Implementation of a Routing Control Platform. In USENIX Symposium on Networked Systems Design & Implementation, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Caesar, M. Castro, E. B. Nightingale, G. O, and A. Rowstron. Virtual Ring Routing: Network Routing Inspired by DHTs. In Proceedings of ACM SIGCOMM,2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Caesar, T. Condie, J. Kannan, K. Lakshminarayanan, I. Stoica, and S. Shenker. ROFL: Routing on Flat Labels. In Proceedings of ACM SIGCOMM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. C. Changhoon Kim and J. Rexford. Floodless in SEATTLE: A Scalable Ethernet Architecture for Large Enterprises. In SIGCOMM '08: Proceedings of the ACM SIGCOMM 2008 conference on Data communication, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. C. Clark, K. Fraser, S. Hand, J. G. H. E. J. C. Limpach, I. Pratt, and A. Warfield. Live Migration of Virtual Machines. In USENIX Symposium on Networked Systems Design & Implementation, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In OSDI'04: Proceedings of the 6th conference on Symposium on Operating Systems Design & Implementation, pages 10--10, Berkeley, CA, USA, 2004. USENIX Association. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. S. Ghemawat, H. Gobioff, and S.-T. Leung. The Google File System. ACM SIGOPS Operating Systems Review, 37(5), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Greenberg, P. Lahiri, D. A. Maltz, P. Patel, and S. Sengupta. Towards a Next Generation Data Center Architecture: Scalability and Commoditization. In PRESTO '08: Proceedings of the ACM Workshop on Programmable Routers for Extensible Services of Tomorrow, pages 57--62, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. Guo, H. Wu, K. Tan, L. Shi, Y. Zhang, and S. Lu. DCell: A Scalable and Fault-tolerant Network Structure for Data Centers. In Proceedings of the ACM SIGCOMM 2008 conference on Data communication, pages 75--86, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. C. Hopps. Analysis of an Equal-Cost Multi-Path Algorithm. RFC 2992, Internet Engineering Task Force, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. V. Kashyap. Dynamic Host Configuration Protocol (DHCP) over InfiniBand. RFC 4390 (Proposed Standard), 2006.Google ScholarGoogle Scholar
  18. K. Lakshminarayanan, M. Caesar, M. Rangan, T. Anderson, S. Shenker, I. Stoica, and H. Luo. Achieving Convergence-Free Routing Using Failure--Carrying Packets. In Proceedings of ACM SIGCOMM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. C. E. Leiserson. Fat-Trees: Universal Networks for Hardware-Efficient Supercomputing. IEEE Transactions on Computers, 34(10):892--901, 1985. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. W. Lockwood, N. McKeown, G. Watson, G. Gibb, P. Hartke, J. Naous, R. Raghuraman, and J. Luo. NetFPGA-An Open Platform for Gigabit-Rate Network Switching and Routing. In Proceedings of the 2007 IEEE International Conference on Microelectronic Systems Education, pages 160--161, Washington, DC, USA, 2007. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. Moskowitz and P. Nikander. Host Identity Protocol (HIP) Architecture. RFC 4423 (Proposed Standard), 2006.Google ScholarGoogle Scholar
  22. J. Moy. OSPF Version 2. RFC 2328, Internet Engineering Task Force, 1998.Google ScholarGoogle Scholar
  23. A. Myers, T. S. E. Ng, and H. Zhang. Rethinking the Service Model: Scaling Ethernet to a Million Nodes. In ACM HotNets-III, 2004.Google ScholarGoogle Scholar
  24. L. S. C. of the IEEE Computer Society. IEEE Standard for Local and Metropolitan Area Networks, Common Specifications Part 3: Media Access Control (MAC), Bridges Ammendment 2: Rapid Reconfiguration, June 2001.Google ScholarGoogle Scholar
  25. R. Perlman, D. Eastlake, D. G. Dutt, S. Gai, and A. Ghanwani. Rbridges: Base Protocol Specification. Technical report, Internet Engineering Task Force, 2009.Google ScholarGoogle Scholar
  26. T. L. Rodeheffer, C. A. Thekkath, and D. C. Anderson. SmartBridge: A Scalable Bridge Architecture. In Proceedings of ACM SIGCOMM, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. M. D. Schroeder, A. D. Birrell, M. Burrows, H. Murray, R. M. Needham, T. L. Rodeheffer, E. H. Satterthwaite, and C. P. Thacker. Autonet: A High-Speed, Self-Configuring Local Area Network Using Point-to-Point Links. In IEEE Journal On Selected Areas in Communications, 1991.Google ScholarGoogle Scholar
  28. M. Scott and J. Crowcroft. MOOSE: Addressing the Scalability of Ethernet. In EuroSys Poster session, 2008.Google ScholarGoogle Scholar

Index Terms

  1. PortLand: a scalable fault-tolerant layer 2 data center network fabric

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGCOMM '09: Proceedings of the ACM SIGCOMM 2009 conference on Data communication
      August 2009
      340 pages
      ISBN:9781605585949
      DOI:10.1145/1592568
      • cover image ACM SIGCOMM Computer Communication Review
        ACM SIGCOMM Computer Communication Review  Volume 39, Issue 4
        SIGCOMM '09
        October 2009
        325 pages
        ISSN:0146-4833
        DOI:10.1145/1594977
        Issue’s Table of Contents

      Copyright © 2009 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 16 August 2009

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate554of3,547submissions,16%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader