Skip to main content
Top
Published in: The Journal of Supercomputing 8/2018

05-05-2018

Fair bandwidth allocating and strip-aware prefetching for concurrent read streams and striped RAIDs in distributed file systems

Authors: Sangmin Lee, Soon J. Hyun, Hong-Yeon Kim, Young-Kyun Kim

Published in: The Journal of Supercomputing | Issue 8/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

With a striped RAID (Redundant Array of Independent Disks) which consists of multiple disks and spreads data across them in parallel, distributed file systems (DFSs) easily enhance the performance of a single read stream (i.e., a series of sequential reads by a process). However, most existing DFSs suffer from performance degradation in concurrent read streams (i.e., multiple series of sequential reads by concurrent processes). Furthermore, research on the performance of concurrent ones for a striped RAID in DFSs has been rarely reported so far. In this paper, we define the problems that degrade it at different configurations of striped RAIDs, and resolve them by proposing the following two methods: (1) a fair allocating of network bandwidth for concurrent read streams and (2) a strip-aware prefetching for each individual read stream. We show that our proposal outperforms all the existing DFSs by at least two times for all kinds and configurations of striped RAIDs. Furthermore, the performance gap between our proposal and the existing DFSs becomes wider according to the increasing number of striped disks.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Palankar MR et al (2008) Amazon S3 for science grids: a viable solution? In: Proceedings of the 2008 International Workshop on Data-Aware Distributed Computing. ACM Palankar MR et al (2008) Amazon S3 for science grids: a viable solution? In: Proceedings of the 2008 International Workshop on Data-Aware Distributed Computing. ACM
3.
go back to reference Weil SA et al (2006) Ceph: a scalable, high-performance distributed file system. In: Proceedings of the 7th Symposium on Operating Systems Design and Implementation. USENIX Association Weil SA et al (2006) Ceph: a scalable, high-performance distributed file system. In: Proceedings of the 7th Symposium on Operating Systems Design and Implementation. USENIX Association
4.
go back to reference Calder B et al (2011) Windows Azure storage: a highly available cloud storage service with strong consistency. In: Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles. ACM Calder B et al (2011) Windows Azure storage: a highly available cloud storage service with strong consistency. In: Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles. ACM
5.
go back to reference Ghemawat S, Gobioff H, Leung S-T (2003) The Google file system. In: ACM SIGOPS Operating Systems Review, vol. 37, no 5. ACM, pp 29–43 Ghemawat S, Gobioff H, Leung S-T (2003) The Google file system. In: ACM SIGOPS Operating Systems Review, vol. 37, no 5. ACM, pp 29–43
6.
go back to reference Dean J, Ghemawat S (2008) MapReduce: simplified data processing on large clusters. Commun ACM 51(1):107–113CrossRef Dean J, Ghemawat S (2008) MapReduce: simplified data processing on large clusters. Commun ACM 51(1):107–113CrossRef
7.
go back to reference Shvachko K et al (2010) The hadoop distributed file system. In: 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST). IEEE Shvachko K et al (2010) The hadoop distributed file system. In: 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST). IEEE
8.
go back to reference Wang F et al (2009) Understanding lustre filesystem internals. Oak Ridge National Laboratory, National Center for Computational Sciences, Technical Report Wang F et al (2009) Understanding lustre filesystem internals. Oak Ridge National Laboratory, National Center for Computational Sciences, Technical Report
9.
go back to reference Welch B et al. (2008) Scalable performance of the Panasas parallel file system. In: FAST, vol 8, pp 1–17 Welch B et al. (2008) Scalable performance of the Panasas parallel file system. In: FAST, vol 8, pp 1–17
10.
go back to reference Chen Y (2011) Towards scalable I/O architecture for exascale systems. In: Proceedings of the 2011 ACM International Workshop on Many Task Computing on Grids and Supercomputers. ACM Chen Y (2011) Towards scalable I/O architecture for exascale systems. In: Proceedings of the 2011 ACM International Workshop on Many Task Computing on Grids and Supercomputers. ACM
11.
go back to reference Xu Q et al (2014) Efficient and scalable metadata management in EB-scale file systems. IEEE Trans Parallel Distrib Syst 25.11:2840–2850CrossRef Xu Q et al (2014) Efficient and scalable metadata management in EB-scale file systems. IEEE Trans Parallel Distrib Syst 25.11:2840–2850CrossRef
12.
go back to reference Xiong J et al (2011) Metadata distribution and consistency techniques for large-scale cluster file systems. IEEE Trans Parallel Distrib Syst 22.5:803–816CrossRef Xiong J et al (2011) Metadata distribution and consistency techniques for large-scale cluster file systems. IEEE Trans Parallel Distrib Syst 22.5:803–816CrossRef
13.
go back to reference Kim Y, Gunasekaran R (2015) Understanding I/O workload characteristics of a peta-scale storage system. J Supercomput 71(3):761–780CrossRef Kim Y, Gunasekaran R (2015) Understanding I/O workload characteristics of a peta-scale storage system. J Supercomput 71(3):761–780CrossRef
14.
go back to reference Lai WK et al (2014) Towards a framework for large-scale multimedia data storage and processing on Hadoop platform. J Supercomput 68.1:488–507CrossRef Lai WK et al (2014) Towards a framework for large-scale multimedia data storage and processing on Hadoop platform. J Supercomput 68.1:488–507CrossRef
15.
go back to reference Mao B, Wu S, Duan L (2018) Improving the SSD performance by exploiting request characteristics and internal parallelism. IEEE Trans Comput Aided Des Integr Circuits Syst 37(2):472–484CrossRef Mao B, Wu S, Duan L (2018) Improving the SSD performance by exploiting request characteristics and internal parallelism. IEEE Trans Comput Aided Des Integr Circuits Syst 37(2):472–484CrossRef
16.
go back to reference Sur S et al (2010) Can high-performance interconnects benefit hadoop distributed file system. In: Workshop on Micro Architectural Support for Virtualization, Data Center Computing, and Clouds (MASVDC). Held in Conjunction with MICRO Sur S et al (2010) Can high-performance interconnects benefit hadoop distributed file system. In: Workshop on Micro Architectural Support for Virtualization, Data Center Computing, and Clouds (MASVDC). Held in Conjunction with MICRO
17.
go back to reference Kolli A et al (2016) High-performance transactions for persistent memories. ACM SIGPLAN Not 51.4:399–411CrossRef Kolli A et al (2016) High-performance transactions for persistent memories. ACM SIGPLAN Not 51.4:399–411CrossRef
18.
go back to reference Matsui C, Sun C, Takeuchi K (2017) Design of hybrid SSDs with storage class memory and NAND flash memory. In: Proceedings of the IEEE Matsui C, Sun C, Takeuchi K (2017) Design of hybrid SSDs with storage class memory and NAND flash memory. In: Proceedings of the IEEE
19.
go back to reference Qiu S, Reddy ALN (2013) NVMFS: a hybrid file system for improving random write in nand-flash SSD. In: 2013 IEEE 29th Symposium on Mass Storage Systems and Technologies (MSST). IEEE Qiu S, Reddy ALN (2013) NVMFS: a hybrid file system for improving random write in nand-flash SSD. In: 2013 IEEE 29th Symposium on Mass Storage Systems and Technologies (MSST). IEEE
20.
go back to reference Huang TC, Chang DW (2016) TridentFS: a hybrid file system for nonvolatile RAM, flash memory and magnetic disk. Softw Pract Exp 46.3:291–318CrossRef Huang TC, Chang DW (2016) TridentFS: a hybrid file system for nonvolatile RAM, flash memory and magnetic disk. Softw Pract Exp 46.3:291–318CrossRef
21.
go back to reference Fan Z et al (2017) Hibachi: a cooperative hybrid cache with NVRAM and DRAM for storage arrays. In: Proceedings of IEEE Conference on Mass Storage Systems and Technologies (MSST) Fan Z et al (2017) Hibachi: a cooperative hybrid cache with NVRAM and DRAM for storage arrays. In: Proceedings of IEEE Conference on Mass Storage Systems and Technologies (MSST)
22.
go back to reference Chandy JA (2008) RAID0. 5: design and implementation of a low cost disk array data protection method. J Supercomput 46(2):108–123CrossRef Chandy JA (2008) RAID0. 5: design and implementation of a low cost disk array data protection method. J Supercomput 46(2):108–123CrossRef
23.
go back to reference Shriver EAM, Small C, Smith KA (1999) Why does file system prefetching work? In: USENIX Annual Technical Conference, General Track Shriver EAM, Small C, Smith KA (1999) Why does file system prefetching work? In: USENIX Annual Technical Conference, General Track
24.
go back to reference Fengguang WU, Hongsheng XI, Chenfeng XU (2008) On the design of a new linux readahead framework. ACM SIGOPS Oper Syst Rev 42(5):75–84CrossRef Fengguang WU, Hongsheng XI, Chenfeng XU (2008) On the design of a new linux readahead framework. ACM SIGOPS Oper Syst Rev 42(5):75–84CrossRef
25.
go back to reference Pai R, Pulavarty B, Cao M (2004) Linux 2.6 performance improvement through readahead optimization. In: Proceedings of the Linux Symposium, vol 2 Pai R, Pulavarty B, Cao M (2004) Linux 2.6 performance improvement through readahead optimization. In: Proceedings of the Linux Symposium, vol 2
26.
go back to reference Wu F et al (2007) Linux readahead: less tricks for more. In: Proceedings of the Linux Symposium, vol 2 Wu F et al (2007) Linux readahead: less tricks for more. In: Proceedings of the Linux Symposium, vol 2
27.
go back to reference Li C, Shen K, Papathanasiou AE (2007) Competitive prefetching for concurrent sequential I/O. In: ACM SIGOPS Operating Systems Review, vol 41, no 3. ACM Li C, Shen K, Papathanasiou AE (2007) Competitive prefetching for concurrent sequential I/O. In: ACM SIGOPS Operating Systems Review, vol 41, no 3. ACM
28.
go back to reference Ding X et al (2007) DiskSeen: exploiting disk layout and access history to enhance I/O prefetch. In: USENIX Annual Technical Conference, vol 7 Ding X et al (2007) DiskSeen: exploiting disk layout and access history to enhance I/O prefetch. In: USENIX Annual Technical Conference, vol 7
29.
go back to reference Jiang S et al (2013) A prefetching scheme exploiting both data layout and access history on disk. ACM Trans Storage (TOS) 9.3:10 Jiang S et al (2013) A prefetching scheme exploiting both data layout and access history on disk. ACM Trans Storage (TOS) 9.3:10
30.
go back to reference Gill BS, Bathen LAD (2007) Optimal multistream sequential prefetching in a shared cache. ACM Trans Storage (TOS) 3.3:10CrossRef Gill BS, Bathen LAD (2007) Optimal multistream sequential prefetching in a shared cache. ACM Trans Storage (TOS) 3.3:10CrossRef
31.
go back to reference Baek SH, Park KH (2009) Striping-aware sequential prefetching for independency and parallelism in disk arrays with concurrent accesses. IEEE Trans Comput 58(8):1146–1152MathSciNetCrossRefMATH Baek SH, Park KH (2009) Striping-aware sequential prefetching for independency and parallelism in disk arrays with concurrent accesses. IEEE Trans Comput 58(8):1146–1152MathSciNetCrossRefMATH
32.
go back to reference Shi X, Feng D (2012) LSP: a locality-aware strip prefetching scheme for striped disk array systems with concurrent accesses. J Comput 7(6):1303–1311MathSciNetCrossRef Shi X, Feng D (2012) LSP: a locality-aware strip prefetching scheme for striped disk array systems with concurrent accesses. J Comput 7(6):1303–1311MathSciNetCrossRef
33.
go back to reference Pratt S, Heger DA (2004) Workload dependent performance evaluation of the linux 2.6 i/o schedulers. In: 2004 Linux Symposium Pratt S, Heger DA (2004) Workload dependent performance evaluation of the linux 2.6 i/o schedulers. In: 2004 Linux Symposium
34.
go back to reference Lee Y-J et al (2009) Fast-path I/O architecture for high performance streaming server. J Supercomput 50.2:99CrossRef Lee Y-J et al (2009) Fast-path I/O architecture for high performance streaming server. J Supercomput 50.2:99CrossRef
35.
go back to reference Roselli DS, Lorch JR, Anderson TE (2000) A comparison of file system workloads. In: USENIX Annual Technical Conference, General Track Roselli DS, Lorch JR, Anderson TE (2000) A comparison of file system workloads. In: USENIX Annual Technical Conference, General Track
36.
go back to reference Cooper BF et al (2010) Benchmarking cloud serving systems with YCSB. In: Proceedings of the 1st ACM Symposium on Cloud Computing. ACM Cooper BF et al (2010) Benchmarking cloud serving systems with YCSB. In: Proceedings of the 1st ACM Symposium on Cloud Computing. ACM
37.
go back to reference Shafer J, Rixner S, Cox AL (2010) The hadoop distributed filesystem: balancing portability and performance. In: 2010 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS). IEEE Shafer J, Rixner S, Cox AL (2010) The hadoop distributed filesystem: balancing portability and performance. In: 2010 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS). IEEE
38.
go back to reference Saini S et al (2012) I/O performance characterization of Lustre and NASA applications on Pleiades. In: 2012 19th International Conference on High Performance Computing (HiPC). IEEE Saini S et al (2012) I/O performance characterization of Lustre and NASA applications on Pleiades. In: 2012 19th International Conference on High Performance Computing (HiPC). IEEE
39.
go back to reference Chen PM et al (1994) RAID: high-performance, reliable secondary storage. ACM Comput Surv (CSUR) 26.2:145–185CrossRef Chen PM et al (1994) RAID: high-performance, reliable secondary storage. ACM Comput Surv (CSUR) 26.2:145–185CrossRef
40.
go back to reference Moon S et al (2015) Optimizing the Hadoop MapReduce Framework with high-performance storage devices. J Supercomput 71.9:3525–3548CrossRef Moon S et al (2015) Optimizing the Hadoop MapReduce Framework with high-performance storage devices. J Supercomput 71.9:3525–3548CrossRef
41.
go back to reference Liang S, Jiang S, Zhang X (2007) STEP: sequentiality and thrashing detection based prefetching to improve performance of networked storage servers. In: 27th International Conference on Distributed Computing Systems (ICDCS’07). IEEE Liang S, Jiang S, Zhang X (2007) STEP: sequentiality and thrashing detection based prefetching to improve performance of networked storage servers. In: 27th International Conference on Distributed Computing Systems (ICDCS’07). IEEE
42.
go back to reference Zhang Z et al (2008) Pfc: transparent optimization of existing prefetching strategies for multi-level storage systems. In: The 28th International Conference on Distributed Computing Systems, 2008. ICDCS’08. IEEE Zhang Z et al (2008) Pfc: transparent optimization of existing prefetching strategies for multi-level storage systems. In: The 28th International Conference on Distributed Computing Systems, 2008. ICDCS’08. IEEE
43.
go back to reference Soundararajan G, Mihailescu M, Amza C (2008) Context-aware prefetching at the storage server. In: USENIX Annual Technical Conference Soundararajan G, Mihailescu M, Amza C (2008) Context-aware prefetching at the storage server. In: USENIX Annual Technical Conference
44.
go back to reference Lee HK, An BS, Kim EJ (2009) Adaptive prefetching scheme using web log mining in Cluster-based web systems. In: IEEE International Conference on Web Services, 2009. ICWS 2009. IEEE Lee HK, An BS, Kim EJ (2009) Adaptive prefetching scheme using web log mining in Cluster-based web systems. In: IEEE International Conference on Web Services, 2009. ICWS 2009. IEEE
45.
go back to reference Gala Y et al (2011) Management of multilevel, multiclient cache hierarchies with application hints. ACM Trans Comput Syst (TOCS) 29(2):5 Gala Y et al (2011) Management of multilevel, multiclient cache hierarchies with application hints. ACM Trans Comput Syst (TOCS) 29(2):5
46.
go back to reference Yadgar G et al (2008) Mc2: multiple clients on a multilevel cache. In: The 28th International Conference on Distributed Computing Systems, 2008. ICDCS’08. IEEE Yadgar G et al (2008) Mc2: multiple clients on a multilevel cache. In: The 28th International Conference on Distributed Computing Systems, 2008. ICDCS’08. IEEE
47.
go back to reference Dong B et al (2010) Correlation based file prefetching approach for hadoop. In: 2010 IEEE Second International Conference on Cloud Computing Technology and Science (CloudCom). IEEE Dong B et al (2010) Correlation based file prefetching approach for hadoop. In: 2010 IEEE Second International Conference on Cloud Computing Technology and Science (CloudCom). IEEE
Metadata
Title
Fair bandwidth allocating and strip-aware prefetching for concurrent read streams and striped RAIDs in distributed file systems
Authors
Sangmin Lee
Soon J. Hyun
Hong-Yeon Kim
Young-Kyun Kim
Publication date
05-05-2018
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 8/2018
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-018-2396-4

Other articles of this Issue 8/2018

The Journal of Supercomputing 8/2018 Go to the issue

Premium Partner