Proceedings of the 4th Annual Workshop on Petascale Data Storage

PDSW '09: Proceedings of the 4th Annual Workshop on Petascale Data Storage

November 2009

2009 Proceeding

Conference Chair:
Garth A. Gibson
Carnegie Mellon University and Panasas Inc.

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

SC '09: International Conference for High Performance Computing, Networking, Storage and Analysis Portland Oregon 14 November 2009

ISBN:

978-1-60558-883-4

Published:

14 November 2009

Sponsors:

SIGARCH, IEEE CS

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Bibliometrics

Citation count

245

Downloads (6 weeks)

Downloads (12 months)

112

Downloads (cumulative)

3,097

Sections

PDSW '09: Proceedings of the 4th Annual Workshop on Petascale Data Storage

2009

Previous Next

Abstract

No abstract available.

Proceeding Downloads

PDFFront matter (Cover, TOC, Program committee, Author index)

Skip Table Of Content Section

Select All

Export Citations Save to Binder

SESSION: Data-intensive cluster storage

research-article

Mixing Hadoop and HPC workloads on parallel filesystems

Esteban Molina-Estolano,
Maya Gokhale,
Carlos Maltzahn,
John May,
John Bent,
Scott Brandt

pp 1–5https://doi.org/10.1145/1713072.1713074

MapReduce-tailored distributed filesystems---such as HDFS for Hadoop MapReduce---and parallel high-performance computing filesystems are tailored for considerably different workloads. The purpose of our work is to examine the performance of each ...

- 11
- 416
Metrics
Total Citations11
Total Downloads416
Last 12 Months6
Last 6 weeks1

Abstract
Get Access

research-article

DiskReduce: RAID for data-intensive scalable computing

Bin Fan,
Wittawat Tantisiriroj,
Lin Xiao,
Garth Gibson

pp 6–10https://doi.org/10.1145/1713072.1713075

Data-intensive file systems, developed for Internet services and popular in cloud computing, provide high reliability and availability by replicating data, typically three copies of everything. Alternatively high performance computing, which has ...

- 101
- 745
Metrics
Total Citations101
Total Downloads745
Last 12 Months9
Last 6 weeks3

Abstract
Get Access

SESSION: Patterns in petascale storage access

research-article

Data layout optimization for petascale file systems

Xian-He Sun,
Yong Chen,
Yanlong Yin

pp 11–15https://doi.org/10.1145/1713072.1713077

In this study, the authors propose a simple performance model to promote a better integration between the parallel I/O middleware layer and parallel file systems. They show that application-specific data layout optimization can improve overall data ...

- 12
- 206
Metrics
Total Citations12
Total Downloads206
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

research-article

Case studies in storage access by loosely coupled petascale applications

Justin M. Wozniak,
Michael Wilde

pp 16–20https://doi.org/10.1145/1713072.1713078

A large number of real-world scientific applications can be characterized as loosely coupled: the communication among tasks is infrequent and can be performed by using file operations. While these applications may be ported to large scale machines ...

- 15
- 170
Metrics
Total Citations15
Total Downloads170
Last 12 Months6
Last 6 weeks1

Abstract
Get Access

research-article

...and eat it too: high read performance in write-optimized HPC I/O middleware file formats

Milo Polte,
Jay Lofstead,
John Bent,
Garth Gibson,
Scott A. Klasky,
Qing Liu,
Manish Parashar,
Norbert Podhorszki,
Karsten Schwan,
Meghan Wingate,
Matthew Wolf

pp 21–25https://doi.org/10.1145/1713072.1713079

As HPC applications run on increasingly high process counts on larger and larger machines, both the frequency of checkpoints needed for fault tolerance [14] and the resolution and size of Data Analysis Dumps are expected to increase proportionally. In ...

- 20
- 196
Metrics
Total Citations20
Total Downloads196
Last 12 Months6
Last 6 weeks1

Abstract
Get Access

research-article

Scalable I/O tracing and analysis

Karthik Vijayakumar,
Frank Mueller,
Xiaosong Ma,
Philip C. Roth

pp 26–31https://doi.org/10.1145/1713072.1713080

As supercomputer performance approached and then surpassed the petaflop level, I/O performance has become a major performance bottleneck for many scientific applications. Several tools exist to collect I/O traces to assist in the analysis of I/O ...

- 48
- 333
Metrics
Total Citations48
Total Downloads333
Last 12 Months21
Last 6 weeks1

Abstract
Get Access

SESSION: Integrating enterprise storage features

research-article

pNFS, POSIX, and MPI-IO: a tale of three semantics

Dean Hildebrand,
Arifa Nisar,
Roger Haskin

pp 32–36https://doi.org/10.1145/1713072.1713082

MPI-IO is emerging as the standard mechanism for file I/O within HPC applications. While pNFS demonstrates high-performance I/O for bulk data transfers, its performance and scalability with MPI-IO is unproven. To attain success, the consistency ...

- 5
- 254
Metrics
Total Citations5
Total Downloads254
Last 12 Months12
Last 6 weeks0

Abstract
Get Access

research-article

Uncovering errors: the cost of detecting silent data corruption

Sumit Narayan,
John A. Chandy,
Samuel Lang,
Philip Carns,
Robert Ross

pp 37–41https://doi.org/10.1145/1713072.1713083

Data integrity is pivotal to the usefulness of any storage system. It ensures that the data stored is free from any modification throughout its existence on the storage medium. Hash functions such as cyclic redundancy checks or check-sums are frequently ...

- 7
- 310
Metrics
Total Citations7
Total Downloads310
Last 12 Months41
Last 6 weeks22

Abstract
Get Access

SESSION: Integrating databases

research-article

Fusing data management services with file systems

Scott Brandt,
Carlos Maltzahn,
Neoklis Polyzotis,
Wang-Chiew Tan

pp 42–46https://doi.org/10.1145/1713072.1713085

File systems are the backbone of large-scale data processing for scientific applications. Motivated by the need to provide an extensible and flexible framework beyond the abstractions provided by API libraries for files to manage and analyze large-scale ...

- 7
- 234
Metrics
Total Citations7
Total Downloads234
Last 12 Months4
Last 6 weeks2

Abstract
Get Access

research-article

Using the Active Storage Fabrics model to address petascale storage challenges

Blake G. Fitch,
Aleksandr Rayshubskiy,
Michael C. Pitman,
T. J. Christopher Ward,
Robert S. Germain

pp 47–54https://doi.org/10.1145/1713072.1713086

We present the Active Storage Fabrics (ASF) model for storage embedded parallel processing as a way to address petascale data intensive challenges. ASF is aimed at emerging scalable system-on-a-chip, storage class memory architectures, but may be ...

- 18
- 217
Metrics
Total Citations18
Total Downloads217
Last 12 Months6
Last 6 weeks1

Abstract
Get Access

Cited By

Watkins N, Jia Z, Shipman G, Maltzahn C, Aiken A and McCormick P Automatic and transparent I/O optimization with storage integrated application runtime support Proceedings of the 10th Parallel Data Storage Workshop, (49-54)

Save to Binder

Create a New Binder

Name

Contributors

G. A. Gibson
Carnegie Mellon University
- Publication Years1985 - 2021
- Publication counts135
- Citation count12,403
- Available for Download77
- Downloads (cumulative)103,059
- Downloads (12 months)9,579
- Downloads (6 weeks)1,287
- Average Downloads per Article1,338
- Average Citation per Article92
View Full Profile

Recommendations

SYSTOR '11: Proceedings of the 4th Annual International Conference on Systems and Storage
Read More
RIIT '15: Proceedings of the 4th Annual ACM Conference on Research in Information Technology
Read More
WESS '09: Proceedings of the 4th Workshop on Embedded Systems Security
Read More

Acceptance Rates

Overall Acceptance Rate17of41submissions,41%

Year	Submitted	Accepted	Rate
PDSW '15	25	9	36%
PDSW '13	16	8	50%
Overall	41	17	41%

Comments

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Proceeding Downloads

Cited By

Save to Binder

Recommendations

SYSTOR '11: Proceedings of the 4th Annual International Conference on Systems and Storage

RIIT '15: Proceedings of the 4th Annual ACM Conference on Research in Information Technology

WESS '09: Proceedings of the 4th Workshop on Embedded Systems Security

Acceptance Rates