skip to main content
10.1145/1713072.1713082acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

pNFS, POSIX, and MPI-IO: a tale of three semantics

Published:14 November 2009Publication History

ABSTRACT

MPI-IO is emerging as the standard mechanism for file I/O within HPC applications. While pNFS demonstrates high-performance I/O for bulk data transfers, its performance and scalability with MPI-IO is unproven. To attain success, the consistency semantics and interfaces of pNFS, POSIX, and MPI-IO must all be reconciled and efficiently translated. This paper investigates and discusses the challenges of using pNFS to support the consistency semantics of HPC applications.

References

  1. W. Allcock, J. Bresnahan, R. Kettimuthu, M. Link, C. Dumitrescu, I. Raicu, and I. Foster, "The Globus Striped GridFTP Framework and Server," in Proceedings of Supercomputing '05, Seattle, WA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. B. Welch, M. Unangst, Z. Abbasi, G. Gibson, B. Mueller, J. Small, J. Zelenka, and B. Zhou, "Scalable Performance of the Panasas Parallel File System," in Proceedings of the 6th USENIX Conference on File and Storage Technologies, San Jose, CA, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. F. Schmuck and R. Haskin, "GPFS: A Shared-Disk File System for Large Computing Clusters," in Proceedings of the USENIX Conference on File and Storage Technologies, San Francisco, CA, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Cluster File Systems Inc., "Lustre: A Scalable, High-Performance File System," www.lustre.org, 2002.Google ScholarGoogle Scholar
  5. W. Gropp, S. Huss-Lederman, A. Lumsdaine, E. Lusk, B. Nitzberg, W. Saphir, and M. Snir, MPI: The Complete Reference, volume 2--The MPI-2 Extensions. Cambridge, MA: MIT Press, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. NCSA, "HDF5 ", hdf.ncsa.uiuc.edu/HDF5.Google ScholarGoogle Scholar
  7. J. Li, W. Liao, A. Choudhary, R. Ross, R. Thakur, W. Gropp, R. Latham, A. Siegel, B. Gallagher, and M. Zingale, "Parallel netCDF: A Scientific High-Performance I/O Interface," in Proceedings of Supercomputing '03, Phoenix, AZ, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Thakur, W. Gropp, and E. Lusk, "Data Sieving and Collective I/O in ROMIO," in Proceedings of the 7th Symposium on the Frontiers of Massively Parallel Computation, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Shepler, B. Callaghan, D. Robinson, R. Thurlow, C. Beame, M. Eisler, and D. Noveck, "NFS Version 4 Protocol Specification," RFC 3530, 2003.Google ScholarGoogle Scholar
  10. S. Shepler, M. Eisler, and D. Noveck, "NFSv4 Minor Version 1," Internet Draft, 2008.Google ScholarGoogle Scholar
  11. D. Hildebrand, P. Andrews, M. Eshel, R. Haskin, P. Kovatch, and J. White, "Deploying pNFS across the WAN: First Steps in HPC Grid Computing," in Proceedings of the 9th LCI International Conference on High-Performance Clustered Computing, Urbana, IL, 2008.Google ScholarGoogle Scholar
  12. D. Hildebrand and P. Honeyman, "Exporting Storage Systems in a Scalable Manner with pNFS," in Proceedings of the 22nd IEEE/13th NASA Goddard Conference on Mass Storage Systems and Technologies, Monterey, CA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. G. Grider, L. Ward, R. Ross, and G. Gibson, "A Business Case for Extensions to the POSIX I/O API for High End, Clustered, and Highly Concurrent Computing," www.opengroup.org/platform/hecewg, 2006.Google ScholarGoogle Scholar
  14. "MPI-Forum," www.mpi-forum.org/docs/_mpi-20-html/node215.htm.Google ScholarGoogle Scholar
  15. R. Thakur, W. Gropp, and E. Lusk, "An Abstract-Device Interface for Implementing Portable Parallel-I/O Interfaces," in Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. Thakur, E. Lusk, and W. Gropp, "Users Guide for ROMIO: A High-Performance, Portable MPI-IO Implementation," Technical Memorandum ANL/MCS-TM-234, Mathematics and Computer Science Division, Argonne National Laboratory, Revised May 2004.Google ScholarGoogle Scholar
  17. M. Oberg, H. M. Tufo, and M. Woitaszek, "Exploration of Parallel Storage Architectures for a Blue Gene/L on the TeraGrid," in Proceedings of the 9th LCI International Conference on High-Performance Clustered Computing, Urbana, IL, 2008.Google ScholarGoogle Scholar
  18. D. Hildebrand, L. Ward, and P. Honeyman, "Large Files, Small Writes, and pNFS," in Proceedings of the 20th ACM International Conference on Supercomputing, Cairns, Australia, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Sun Microsystems Inc., "Lustre File System," White-paper, 2007.Google ScholarGoogle Scholar
  20. J. P. Prost, R. Treumann, R. Hedges, B. Jia, and A. E. Koniges, "MPI-IO/GPFS, an Optimized Implementation of MPI-IO on top of GPFS," in Proceedings of Supercomputing '01, Denver, CO, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. P. H. Carns, W. B. Ligon III, R. B. Ross, and R. Thakur, "PVFS: A Parallel File System for Linux Clusters," in Proceedings of the 4th Annual Linux Showcase and Conference, Atlanta, GA, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. I. Chavis, D. Coutts, J. Huie, S. Liu, S. Qualters, B. Demkowicz, and D. L Turkenkopf, "A Guide to the IBM Clustered Network File System," IBM Redbooks, 2008.Google ScholarGoogle Scholar
  23. IBM Corp., "IBM Storage Optimization and Integration Services-scale out file services," datasheet, 2007.Google ScholarGoogle Scholar
  24. M. Eisler, P. Corbett, M. Kazar, D. Nydick, and C. Wagner, "Data ONTAP GX: A Scalable Storage Cluster," in Proceedings of the 5th USENIX conference on File and Storage Technologies, San Jose, CA, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. pNFS, POSIX, and MPI-IO: a tale of three semantics

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          PDSW '09: Proceedings of the 4th Annual Workshop on Petascale Data Storage
          November 2009
          58 pages
          ISBN:9781605588834
          DOI:10.1145/1713072
          • Conference Chair:
          • Garth A. Gibson

          Copyright © 2009 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 14 November 2009

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate17of41submissions,41%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader