research-article

RFPL: A Recovery Friendly Parity Logging Scheme for Reducing Small Write Penalty of SSD RAID

Authors:
Gaoxiang Xu

Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China

Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China
View Profile

,
Dan Feng

Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China

Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China
View Profile

,
Zhipeng Tan

Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China

Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China
View Profile

,
Xinyan Zhang

Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China

Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China
View Profile

,
Jie Xu

Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China

Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China
View Profile

,
Xi Shu

Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China

Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China
View Profile

,
Yifeng Zhu

Department of Electrical and Computer Engineering, University of Maine, USA

Department of Electrical and Computer Engineering, University of Maine, USA
View Profile

ICPP '19: Proceedings of the 48th International Conference on Parallel ProcessingAugust 2019Article No.: 23Pages 1–10https://doi.org/10.1145/3337821.3337887

Published:05 August 2019Publication History

ICPP '19: Proceedings of the 48th International Conference on Parallel Processing

Pages 1–10

ABSTRACT

Parity based RAID suffers from poor small write performance due to heavy parity update overhead. The recently proposed method EPLOG constructs a new stripe with updated data chunks without updating old parity chunks. However, due to skewness of data accesses, old versions of updated data chunks often need to be kept to protect other data chunks of the same stripe. This seriously hurts the efficiency of recovering system from device failures due to the need of reconstructing the preserved old data chunks on failed devices.

In this paper, we propose a Recovery Friendly Parity Logging scheme, called RFPL, which minimizes small write penalty and provides high recovery performance for SSD RAID. The key idea of RFPL is to reduce the mixture of old and new data chunks in a stripe by exploiting skewness of data accesses. RFPL constructs a new stripe with updated data chunks of the same old stripe. Since cold data chunks of the old stripe are rarely updated, it is likely that all of data chunks written to the new stripe are hot data and become old together within a short time span. This co-old of data chunks in a stripe effectively mitigates the total number of old data chunks which need to be preserved. We have implemented RFPL on a RAID-5 SSD array in Linux 4.3. Experimental results show that, compared with the Linux software RAID, RFPL reduces user I/O response time by 83.1% for normal state and 81.6% for reconstruction state. Compared with the state-of-the-art scheme EPLOG, RFPL reduces user I/O response time by 46.8% for normal state and 40.9% for reconstruction state. Our reliability analysis shows RFPL improves the mean time to data loss (MTTDL) by 9.36X and 1.44X compared with the Linux software RAID and EPLOG.

References

2006. blktrace User Guide. https://linux.die.net/man/8/blktrace. (2006).Google Scholar
2017. Intel Optane Memory. https://www.intel.cn/content/www/cn/zh/products/memory-storage/optane-memory/optane-32gb-m-2-80mm.html. (2017).Google Scholar
2018. SanDisk Solid State Driver. https://www.sandisk.com/. (2018).Google Scholar
Ching-Che Chung and Hao-Hsiang Hsu. 2014. Partial parity cache and data cache management method to improve the performance of an SSD-based RAID. VLSI 22, 7 (2014), 1470--1480.Google Scholar
Garth Gibson. 2007. Reflections on failure in post-terascale parallel computing. In International Conference on Parallel Processing. IEEE.Google Scholar
Y Hu. 2013. Exploring and exploiting the multilevel parallelism inside SSDs for improved performance and endurance. IEEE TOC 62, 6 (2013), 1141--1151. Google ScholarDigital Library
Soojun Im and Dongkun Shin. 2011. Flash-aware RAID techniques for dependable and high-performance flash memory SSD. TOC 60, 1 (2011), 80--92. Google ScholarDigital Library
J Kim, D Lee, and Noh S H. 2015. Towards SLO Complying SSDs Through OPS Isolation. In FAST. 183--189. Google ScholarDigital Library
Jaeho Kim and Jongmin Lee. 2013. Improving SSD reliability with RAID via elastic striping and anywhere. In DSN. IEEE, 1--12. Google ScholarDigital Library
Yongkun Li, Helen HW Chan, Patrick PC Lee, and Yinlong Xu. 2016. Elastic Parity Logging for SSD RAID Arrays. In DSN. IEEE, 49--60.Google Scholar
Marc Liberatore. 2007. Storage Performance Council. http://traces.cs.umass.edu/index.php/Storage/Storage. (2007).Google Scholar
Bo Mao, Hong Jiang, Suzhen Wu, et al. 2012. HPDA: A hybrid parity-based disk array for enhanced performance and reliability. TOS 8, 1 (2012), 4. Google ScholarDigital Library
Sangwhan Moon and A. L. Reddy. 2016. Does RAID improve lifetime of SSD arrays? Transactions on Storage (TOS) 12, 3 (2016), 11--29. Google ScholarDigital Library
Dushyanth Narayanan and Austin Donnelly. 2008. Write off-loading: Practical power management for enterprise storage. TOS 4, 3 (2008), 10. Google ScholarDigital Library
J. Ostergaard and E. Bueso. 2010. The Software-RAID HOWTO. http://www.tldp.org/HOWTO/Software-RAID-HOWTO.html. (2010).Google Scholar
Yubiao Pan and Yongkun Li. 2015. Grouping-Based Elastic Striping with Hotness Awareness for Improving SSD RAID Performance. In DSN. IEEE, 160--171. Google ScholarDigital Library
Amer A Paris J F. 2009. Using storage class memories to increase the reliability of two-dimensional RAID arrays. In International Symposium on Modeling, Analysis Simulation of Computer and Telecommunication Systems (MASCOTS). IEEE, 1--8.Google Scholar
R Pawula. 1967. Generalizations and extensions of Fokker-Planck-Kolmogorov equations. IEEE Transactions on Information Theory 13, 1 (1967), 33--41. Google ScholarDigital Library
Mendel Rosenblum. 1992. The design and implementation of a log-structured file system. ACM Transactions on Computer Systems 10, 1 (1992), 26--52. Google ScholarDigital Library
Gibson G A Schroeder B. 2007. Disk failures in the real world: What does an mttf of 1, 000, 000 hours mean to you?. In FAST. 1--16. Google ScholarDigital Library
Gibson G Stodolsky D. 1993. Parity logging overcoming the small write problem in redundant disk arrays. In SIGARCH Computer Architecture News. ACM, 64--75. Google ScholarDigital Library
Jiang H Tian L, Feng D. 2007. PRO: A Popularity-based Multi-threaded Reconstruction Optimization for RAID-Structured Storage Systems. In FAST. 301--314. Google ScholarDigital Library
Jiguang Wan and Wei Wu. 2017. DEFT-Cache: A Cost-Effective and Highly Reliable SSD Cache for RAID Storage. In IPDPS. IEEE, 102--111.Google Scholar
Yang Q Wan J, Wang J. 2010. S2-RAID: A new RAID architecture for fast data recovery. In Mass Storage Systems and Technologies (MSST). IEEE, 1--9. Google ScholarDigital Library
Suzhen Wu and Bo Mao. 2016. LDM: Log Disk Mirroring with Improved Performance and Reliability for SSD-Based Disk Arrays. TOS 12, 4 (2016), 22. Google ScholarDigital Library
Feng D Wu S, Jiang H. 2009. WorkOut: I/O Workload Outsourcing for Boosting RAID Reconstruction Performance. In FAST. 239--252. Google ScholarDigital Library
Jiang H et al Wu S, Feng D. 2009. JOR: A journal-guided reconstruction optimization for RAID-structured storage systems. In International Conference on Parallel and Distributed Systems (ICPADS). IEEE, 609--616. Google ScholarDigital Library
Schwarz T et al Xin Q, Miller E L. 2003. Reliability mechanisms for very large storage systems. In Mass Storage Systems and Technologies. IEEE, 146--156. Google ScholarDigital Library
Jie Yao, Hong Jiang, et al. 2016. Elastic-RAID: A New Architecture for Improved Availability of Parity-Based RAIDs by Elastic Mirroring. IPDPS 27, 4 (2016), 1044--1056. Google ScholarDigital Library

Recommendations

Modeling SSD RAID reliability under general settings
CF '18: Proceedings of the 15th ACM International Conference on Computing Frontiers

Solid-state drives (SSDs) are susceptible to the limited number of program/erase (P/E) cycles and uncorrectable flash errors, and hence achieving high reliability of SSD storage systems is a critical issue. RAID provides a viable option for enhancing ...
Read More
Reconstruct versus read-modify writes in RAID

RAID5 (Redundant Arrays of Independent Disk level 5) is a popular paradigm, which uses parity to protect against single disk failures. A major shortcoming of RAID5 is the small write penalty, i.e., the cost of updating parity when a data block is ...
Read More
Grouping-Based Elastic Striping with Hotness Awareness for Improving SSD RAID Performance
DSN '15: Proceedings of the 2015 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks

RAID provides a good option to provide device-level fault tolerance. Conventional RAID usually updates parities with read-modify-write or read-reconstruct-write, which may introduce a lot of extra I/Os and thus significantly degrade SSD RAID ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ICPP '19: Proceedings of the 48th International Conference on Parallel Processing
August 2019
1107 pages
ISBN:9781450362955
DOI:10.1145/3337821

Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 5 August 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
I/O Performance
Recovery Performance
Reliability
SSD RAID
Small Write Penalty
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate91of313submissions,29%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 209
  Total Downloads
- Downloads (Last 12 months)15
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

RFPL: A Recovery Friendly Parity Logging Scheme for Reducing Small Write Penalty of SSD RAID

ICPP '19: Proceedings of the 48th International Conference on Parallel Processing

ABSTRACT

References

Cited By

Recommendations

Modeling SSD RAID reliability under general settings

Reconstruct versus read-modify writes in RAID

Grouping-Based Elastic Striping with Hotness Awareness for Improving SSD RAID Performance

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

RFPL: A Recovery Friendly Parity Logging Scheme for Reducing Small Write Penalty of SSD RAID

ICPP '19: Proceedings of the 48th International Conference on Parallel Processing

ABSTRACT

References

Cited By

Recommendations

Modeling SSD RAID reliability under general settings

Reconstruct versus read-modify writes in RAID

Grouping-Based Elastic Striping with Hotness Awareness for Improving SSD RAID Performance

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media