Skip to main content
Top
Published in: The Journal of Supercomputing 6/2015

01-06-2015

Performance model-directed data sieving for high-performance I/O

Authors: Yong Chen, Yin Lu, Prathamesh Amritkar, Rajeev Thakur, Yu Zhuang

Published in: The Journal of Supercomputing | Issue 6/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Many scientific computing applications and engineering simulations exhibit noncontiguous I/O access patterns. Data sieving is an important technique to improve the performance of noncontiguous I/O accesses by combining small and noncontiguous requests into a large and contiguous request. It has been proven effective even though more data are potentially accessed than demanded. In this study, we propose a new data sieving approach namely performance model-directed data sieving, or PMD data sieving in short. It improves the existing data sieving approach from two aspects: (1) dynamically determines when it is beneficial to perform data sieving; and (2) dynamically determines how to perform data sieving if beneficial. It improves the performance of the existing data sieving approach considerably and reduces the memory consumption as verified by both theoretical analysis and experimental results. Given the importance of supporting noncontiguous accesses effectively and reducing the memory pressure in a large-scale system, the proposed PMD data sieving approach in this research holds a great promise and will have an impact on high-performance I/O systems.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Ali N, Carns PH, Iskra K, Kimpe D, Lang S, Latham R, Ross RB, Ward L, Sadayappan P (2009) Scalable I/O forwarding framework for high-performance computing systems. Proceedings of the 2009 IEEE International Conference on Cluster Computing Ali N, Carns PH, Iskra K, Kimpe D, Lang S, Latham R, Ross RB, Ward L, Sadayappan P (2009) Scalable I/O forwarding framework for high-performance computing systems. Proceedings of the 2009 IEEE International Conference on Cluster Computing
2.
go back to reference Abbasi H, Wolf M, Eisenhauer G, Klasky S, Schwan K, Zheng F (2010) Datastager: scalable data staging services for petascale applications. Cluster Comput 13(3):277–290CrossRef Abbasi H, Wolf M, Eisenhauer G, Klasky S, Schwan K, Zheng F (2010) Datastager: scalable data staging services for petascale applications. Cluster Comput 13(3):277–290CrossRef
3.
go back to reference Abbasi H, Eisenhauer G, Wolf M, Schwan K, Klasky S (2011) Just In Time: Adding Value to the I/O Pipelines Of High Performance Applications with JITStaging. In: Proceedings of International Symposium on High Performance Distributed Computing (HPDC), pp 27–36 Abbasi H, Eisenhauer G, Wolf M, Schwan K, Klasky S (2011) Just In Time: Adding Value to the I/O Pipelines Of High Performance Applications with JITStaging. In: Proceedings of International Symposium on High Performance Distributed Computing (HPDC), pp 27–36
4.
go back to reference Blas JG, Isaila F, Carretero J, Latham R, Ross R (2009) Multiple-level MPI file write-back and prefetching for blue gene systems. In: Proceedings of PVM/MPI Blas JG, Isaila F, Carretero J, Latham R, Ross R (2009) Multiple-level MPI file write-back and prefetching for blue gene systems. In: Proceedings of PVM/MPI
5.
go back to reference Bordawekar R, Rosario JM, Choudhary AN (1993) Design and evaluation of primitives for parallel I/O. In: Proceedings of ACM/IEEE Supercomputing Conference Bordawekar R, Rosario JM, Choudhary AN (1993) Design and evaluation of primitives for parallel I/O. In: Proceedings of ACM/IEEE Supercomputing Conference
6.
go back to reference Byna S, Chen Y, Sun X-H, Thakur R, Gropp W (2008) Parallel I/O prefetching using MPI file caching and I/O signatures. In: Proceedings of the ACM/IEEE SuperComputing Conference (SC’08) Byna S, Chen Y, Sun X-H, Thakur R, Gropp W (2008) Parallel I/O prefetching using MPI file caching and I/O signatures. In: Proceedings of the ACM/IEEE SuperComputing Conference (SC’08)
7.
go back to reference Carns PH, Ligon III WB, Ross RB, Thakur R (2000) PVFS: A parallel file system for linux clusters. In: Proceedings of the 4th Annual Linux Showcase and Conference Carns PH, Ligon III WB, Ross RB, Thakur R (2000) PVFS: A parallel file system for linux clusters. In: Proceedings of the 4th Annual Linux Showcase and Conference
8.
go back to reference Chang F, Gibson GA (1999) Automatic I/O hint generation through speculative execution. In: Proceedings of the 3rd Symposium on Operating Systems Design and Implementation (OSDI) Chang F, Gibson GA (1999) Automatic I/O hint generation through speculative execution. In: Proceedings of the 3rd Symposium on Operating Systems Design and Implementation (OSDI)
9.
go back to reference Chen Y, Sun X-H, Thakur R, Roth PC, Gropp W (2011) LACIO: a new layout-aware collective I/O strategy for parallel I/O systems. In: The Proceedings of IEEE International Parallel and Distributed Processing Symposium (IPDPS’11) Chen Y, Sun X-H, Thakur R, Roth PC, Gropp W (2011) LACIO: a new layout-aware collective I/O strategy for parallel I/O systems. In: The Proceedings of IEEE International Parallel and Distributed Processing Symposium (IPDPS’11)
10.
go back to reference Chen Y, Byna S, Sun X-H, Thakur R, Gropp W (2008) Hiding I/O latency with pre-execution prefetching for parallel applications. Best paper award finalist, in Proceedings of the ACM/IEEE SuperComputing Conference (SC’08) Chen Y, Byna S, Sun X-H, Thakur R, Gropp W (2008) Hiding I/O latency with pre-execution prefetching for parallel applications. Best paper award finalist, in Proceedings of the ACM/IEEE SuperComputing Conference (SC’08)
12.
go back to reference Crandall PE, Aydt RA, Chien AA, Reed DA (1995) Input/output characteristics of scalable parallel applications. In: Proceedings of the ACM/IEEE conference on Supercomputing, pp 59-es Crandall PE, Aydt RA, Chien AA, Reed DA (1995) Input/output characteristics of scalable parallel applications. In: Proceedings of the ACM/IEEE conference on Supercomputing, pp 59-es
13.
go back to reference Eshel M, Haskin RL, Hildebrand D, Naik M, Schmuck FB, Tewari R (2010) Panache: a parallel file system cache for global file access. In: Proceedings of the 8th USENIX Conference on File and Storage Technologies Eshel M, Haskin RL, Hildebrand D, Naik M, Schmuck FB, Tewari R (2010) Panache: a parallel file system cache for global file access. In: Proceedings of the 8th USENIX Conference on File and Storage Technologies
14.
go back to reference Gu P, Wang J, Ross R (2008) Bridging the gap between parallel file systems and local file systems: a case study with PVFS. The 37th International Conference on Parallel processing 2008 (ICPP’08), pp 554–561 Gu P, Wang J, Ross R (2008) Bridging the gap between parallel file systems and local file systems: a case study with PVFS. The 37th International Conference on Parallel processing 2008 (ICPP’08), pp 554–561
15.
go back to reference Huang HH, Shan L, Szalay A, Terzis A (2011) Performance modeling and analysis of flash-based storage devices in Mass Storage Systems and Technologies (MSST). 2011 IEEE 27th Symposium on Huang HH, Shan L, Szalay A, Terzis A (2011) Performance modeling and analysis of flash-based storage devices in Mass Storage Systems and Technologies (MSST). 2011 IEEE 27th Symposium on
16.
go back to reference Iskra K, Romein JW, Yoshii K, Beckman P (2008) ZOID: I/O forwarding infrastructure for petascale architectures. In: Proceedings of the 13th ACM SIGPLAN symposium on principles and practice of parallel programming, pp 153–162 Iskra K, Romein JW, Yoshii K, Beckman P (2008) ZOID: I/O forwarding infrastructure for petascale architectures. In: Proceedings of the 13th ACM SIGPLAN symposium on principles and practice of parallel programming, pp 153–162
17.
go back to reference Kotz D (1997) Disk-directed I/O for MIMD multiprocessors. ACM Trans Comput Systems 15(1):41–74 Kotz D (1997) Disk-directed I/O for MIMD multiprocessors. ACM Trans Comput Systems 15(1):41–74
18.
go back to reference Lang S, Latham R, Ross RB, Kimpe D (2009) Interfaces for coordinated access in the file system. CLUSTER, pp 1–9 Lang S, Latham R, Ross RB, Kimpe D (2009) Interfaces for coordinated access in the file system. CLUSTER, pp 1–9
19.
go back to reference Lei H, Duchamp D (1997) An analytical approach to file prefetching. In: Proceedings of the 1997 USENIX Annual Technical Conference, pp 275–288 Lei H, Duchamp D (1997) An analytical approach to file prefetching. In: Proceedings of the 1997 USENIX Annual Technical Conference, pp 275–288
20.
go back to reference Liao W-K, Ching A, Coloma K, Choudhary A, Ward L (2007) An implementation and evaluation of client-side file caching for MPI-IO. In: Proceedings of IEEE International parallel and distributed processing symposium Liao W-K, Ching A, Coloma K, Choudhary A, Ward L (2007) An implementation and evaluation of client-side file caching for MPI-IO. In: Proceedings of IEEE International parallel and distributed processing symposium
21.
go back to reference Lofstead JF, Klasky S, Schwan K, Podhorszki N, Jin C (2008) Flexible I/O and integration for scientific codes through the adaptable I/O system (ADIOS). In: Proceedings of the 6th International Workshop on challenges of large applications in distributed environments Lofstead JF, Klasky S, Schwan K, Podhorszki N, Jin C (2008) Flexible I/O and integration for scientific codes through the adaptable I/O system (ADIOS). In: Proceedings of the 6th International Workshop on challenges of large applications in distributed environments
22.
go back to reference Lu Y, Chen Y, Amritkar Y, Thakur R, Zhuang Y (2012) A new data sieving approach for high performance I/O. In: Proceedings of 7th International Conference on Future Information Technology, Vancouver, Canada Lu Y, Chen Y, Amritkar Y, Thakur R, Zhuang Y (2012) A new data sieving approach for high performance I/O. In: Proceedings of 7th International Conference on Future Information Technology, Vancouver, Canada
23.
go back to reference May J (2001) Parallel I/O for high performance computing. Morgan Kaufmann Publishing, San Francisco, CA May J (2001) Parallel I/O for high performance computing. Morgan Kaufmann Publishing, San Francisco, CA
24.
go back to reference Ma XS, Winslett M, Lee J, Yu SK (2002) Faster collective output through active buffering. IPDPS Ma XS, Winslett M, Lee J, Yu SK (2002) Faster collective output through active buffering. IPDPS
25.
go back to reference Nisar A, Liao WK, Choudhary A (2008) Scaling parallel I/O performance through I/O delegate and caching system. SC Nisar A, Liao WK, Choudhary A (2008) Scaling parallel I/O performance through I/O delegate and caching system. SC
26.
go back to reference Nitzberg B, Lo V (1997) Collective buffering: improving parallel I/O performance. HPDC Nitzberg B, Lo V (1997) Collective buffering: improving parallel I/O performance. HPDC
27.
go back to reference Oldfield R, Kotz D (2001) Armada: a parallel file system for computational grids. In: Proceedings of IEEE/ACM International Symposium on luster Computing and the Grid, pp 194–201, Brisbane, Australia. IEEE Press Oldfield R, Kotz D (2001) Armada: a parallel file system for computational grids. In: Proceedings of IEEE/ACM International Symposium on luster Computing and the Grid, pp 194–201, Brisbane, Australia. IEEE Press
29.
go back to reference Patterson RH, Gibson GA, Ginting E, Stodolsky D, Zelenka J (1995) Informed prefetching and caching. In: Proceedings of the 15th ACM Symposium on Operating Systems Principles (SOSP ’05), ACM Patterson RH, Gibson GA, Ginting E, Stodolsky D, Zelenka J (1995) Informed prefetching and caching. In: Proceedings of the 15th ACM Symposium on Operating Systems Principles (SOSP ’05), ACM
30.
go back to reference Rafique MM, Butt AR, Nikolopoulos DS (2008) DMA-based prefetching for I/O-intensive workloads on the cell architecture. Conf. Computing, Frontiers, pp 23–32 Rafique MM, Butt AR, Nikolopoulos DS (2008) DMA-based prefetching for I/O-intensive workloads on the cell architecture. Conf. Computing, Frontiers, pp 23–32
32.
go back to reference Schmuck F, Haskin R (2002) GPFS: A shared-disk file system for large computing clusters. In: Proceedings of the First USENIX Conference on File and Storage Technologies, pp 231–244, USENIX Schmuck F, Haskin R (2002) GPFS: A shared-disk file system for large computing clusters. In: Proceedings of the First USENIX Conference on File and Storage Technologies, pp 231–244, USENIX
33.
go back to reference Seamons K, Chen Y, Jones P, Jozwiak J, Winslett M (1995) Server-directed collective I/O in panda. In: Proceedings of Supercomputing Conference Seamons K, Chen Y, Jones P, Jozwiak J, Winslett M (1995) Server-directed collective I/O in panda. In: Proceedings of Supercomputing Conference
34.
go back to reference Song H, Yin Y, Chen Y, Sun X (2011) A cost intelligent application specific data layout scheme for parallel file systems. In: Proceedings of the 20th international symposium on High performance distributed computing. ACM New York, NY, USA Song H, Yin Y, Chen Y, Sun X (2011) A cost intelligent application specific data layout scheme for parallel file systems. In: Proceedings of the 20th international symposium on High performance distributed computing. ACM New York, NY, USA
35.
go back to reference Tran N, Reed DA (2004) Automatic ARIMA time series modeling for adaptive I/O prefetching. IEEE Trans Parallel Distrib Syst 15(4):362–377CrossRef Tran N, Reed DA (2004) Automatic ARIMA time series modeling for adaptive I/O prefetching. IEEE Trans Parallel Distrib Syst 15(4):362–377CrossRef
36.
go back to reference Thakur R, Gropp W, Lusk E (1999) Data sieving and collective I/O in ROMIO. In: Proceedings of the 7th Symposium on the Frontiers of Massively Parallel Computation Thakur R, Gropp W, Lusk E (1999) Data sieving and collective I/O in ROMIO. In: Proceedings of the 7th Symposium on the Frontiers of Massively Parallel Computation
37.
go back to reference Thakur R, Ross R, Lusk E, Gropp W (2004) Users Guide for ROMIO: a high-performance, portable MPI-IO implementation. Technical Memorandum ANL/MCS-TM-234. Mathematics and Computer Science Division, Argonne National Laboratory, Revised May Thakur R, Ross R, Lusk E, Gropp W (2004) Users Guide for ROMIO: a high-performance, portable MPI-IO implementation. Technical Memorandum ANL/MCS-TM-234. Mathematics and Computer Science Division, Argonne National Laboratory, Revised May
38.
go back to reference Thakur R, Choudhary A, Bordawekar R, More S, Kuditipudi S (1996) Passion: optimized I/O for parallel applications. Computer 29(6):70–78 Thakur R, Choudhary A, Bordawekar R, More S, Kuditipudi S (1996) Passion: optimized I/O for parallel applications. Computer 29(6):70–78
39.
go back to reference Vilayannur M, Sivasubramaniam A, Kandemir MT, Thakur R, Ross R (2006) Discretionary caching for I/O on clusters. Cluster Comput 9(1):29–44CrossRef Vilayannur M, Sivasubramaniam A, Kandemir MT, Thakur R, Ross R (2006) Discretionary caching for I/O on clusters. Cluster Comput 9(1):29–44CrossRef
40.
go back to reference Wang J, Yao X, Mitchell C, Gu P (2009) A hierarchical data cache architecture for iSCSI storage server. IEEE Trans Comput 58(4):1–15CrossRefMathSciNet Wang J, Yao X, Mitchell C, Gu P (2009) A hierarchical data cache architecture for iSCSI storage server. IEEE Trans Comput 58(4):1–15CrossRefMathSciNet
41.
go back to reference Weil S, Brandt S, Miller E, Long DDE, Maltzahn C (2006) Ceph: a scalable, high-performance distributed file system. In: Proceedings of USENIX Symposium on operating Systems design and implementation Weil S, Brandt S, Miller E, Long DDE, Maltzahn C (2006) Ceph: a scalable, high-performance distributed file system. In: Proceedings of USENIX Symposium on operating Systems design and implementation
42.
go back to reference Welch B, Unangst M, Abbasi Z, Gibson G, Mueller B, Small J, Zelenka J, Zhou B (2008) Scalable performance of the panasas parallel file system. In: Proceedings of the 6th USENIX Conference on File and Storage Technologies Welch B, Unangst M, Abbasi Z, Gibson G, Mueller B, Small J, Zelenka J, Zhou B (2008) Scalable performance of the panasas parallel file system. In: Proceedings of the 6th USENIX Conference on File and Storage Technologies
43.
go back to reference Widener P, Wolf M, Abbasi H, McManus S, Payne M, Barrick MJ, Pulikottil J, Bridges PG, Schwan K (2011) Exploiting latent I/O asynchrony in petascale science applications. IJHPCA 25(2):161–179 Widener P, Wolf M, Abbasi H, McManus S, Payne M, Barrick MJ, Pulikottil J, Bridges PG, Schwan K (2011) Exploiting latent I/O asynchrony in petascale science applications. IJHPCA 25(2):161–179
44.
go back to reference Yang CK, Mitra T, Chiueh T (2002) A decoupled architecture for application-specific file prefetching. Freenix Track of USENIX 2002 Annual Conference Yang CK, Mitra T, Chiueh T (2002) A decoupled architecture for application-specific file prefetching. Freenix Track of USENIX 2002 Annual Conference
45.
go back to reference Zhang X, Jiang S, Davis K (2009) Making resonance a common case: a high-performance implementation of collective I/O on parallel file systems. In: Proceedings of the 23rd IEEE International Symposium on parallel and distributed processing Zhang X, Jiang S, Davis K (2009) Making resonance a common case: a high-performance implementation of collective I/O on parallel file systems. In: Proceedings of the 23rd IEEE International Symposium on parallel and distributed processing
46.
go back to reference Zhang Z, Lee K, Ma X, Zhou Y (2008) PFC: transparent optimization of existing prefetching strategies for multi-level storage systems. ICDCS, pp 740–751 Zhang Z, Lee K, Ma X, Zhou Y (2008) PFC: transparent optimization of existing prefetching strategies for multi-level storage systems. ICDCS, pp 740–751
Metadata
Title
Performance model-directed data sieving for high-performance I/O
Authors
Yong Chen
Yin Lu
Prathamesh Amritkar
Rajeev Thakur
Yu Zhuang
Publication date
01-06-2015
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 6/2015
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-014-1277-8

Other articles of this Issue 6/2015

The Journal of Supercomputing 6/2015 Go to the issue

Premium Partner