Proceedings of the 6th workshop on Workflows in support of large-scale science

WORKS '11: Proceedings of the 6th workshop on Workflows in support of large-scale science

November 2011

2011 Proceeding

General Chairs:
Ian Taylor
Cardiff University, UK
,
Johan Montagnat
CNRS, France

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

SC '11: International Conference for High Performance Computing, Networking, Storage and Analysis Seattle Washington USA 14 November 2011

ISBN:

978-1-4503-1100-7

Published:

14 November 2011

Sponsors:

SIGARCH, IEEE CS

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Bibliometrics

Citation count

252

Downloads (6 weeks)

Downloads (12 months)

Downloads (cumulative)

3,339

Sections

WORKS '11: Proceedings of the 6th workshop on Workflows in support of large-scale science

2011

Previous Next

Skip Abstract Section

Abstract

WORKS'11 was the sixth issue in the WORKS workshop series. The call for papers attracted 23 submissions from Europe, North and South America. The program committee accepted 15 papers that cover a variety of topics, ranging from large-scale execution management of workflows (scalability, fault-tolerance, performance, optimization, etc) to workflows exploitation (reuse, portability, interoperability, traceability, etc). This issue was also the opportunity for an inspiring keynote that surveyed the past issues and future trends given by the former WORKS chair, Ewa Deelman. The attendance was above expectations with more than 80 participants.

To foster discussions and encourage more exchanges between researchers working on closely related topics, we organized mini-panel discussions at the end of each thematic session this year. The five sessions organized confirmed the vivid interest for distributed-computing workflows animating several communities: experts of distributed computing systems as well as end users in need for high-end, accessible computing infrastructures. Traditional research topics from distributed-computing were well represented with papers on scalability, fault-tolerance, performance and optimization of workflow management systems. The now well-identified scientific data provenance theme in workflows was also well subscribed. In addition we could observe the growing interest for workflow reuse and workflow systems interoperability showing a maturity level of the scientific workflows community. With extended usage of the existing solutions, users are increasingly looking for on-line workflow resources. Beyond mere computational capability, workflows are also used for knowledge and know-how transfer, causing new needs to emerge such as community distribution of workflows, high-level representation and automated transformations.

Proceeding Downloads

PDF(title page, copyright, foreword, contents, organization)

PDF(author index)

Skip Table Of Content Section

Select All

Export Citations Save to Binder

SESSION: Presentations

research-article

Scientific workflow reuse through conceptual workflows on the virtual imaging platform

Nadia Cerezo,
Johan Montagnat

pp 1–10https://doi.org/10.1145/2110497.2110499

An increasing number of scientific experiments are "in-silico": carried out at least partially using computers. Scientific Workflows have become a key tool to model and implement such experiments, but they tangle domain knowledge, technical know-how and ...

- 7
- 172
Metrics
Total Citations7
Total Downloads172
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

research-article

Workflow overhead analysis and optimizations

Weiwei Chen,
Ewa Deelman

pp 11–20https://doi.org/10.1145/2110497.2110500

The execution of scientific workflows often suffers from a variety of overheads in distributed environments. It is essential to identify the different overheads and to evaluate how optimization methods help reduce overheads and improve runtime ...

- 33
- 320
Metrics
Total Citations33
Total Downloads320
Last 12 Months11
Last 6 weeks2

Abstract
Get Access

research-article

Provenance for MapReduce-based data-intensive workflows

Daniel Crawl,
Jianwu Wang,
Ilkay Altintas

pp 21–30https://doi.org/10.1145/2110497.2110501

MapReduce has been widely adopted by many business and scientific applications for data-intensive processing of large datasets. There are increasing efforts for workflows and systems to work with the MapReduce programming model and the Hadoop ...

- 35
- 410
Metrics
Total Citations35
Total Downloads410
Last 12 Months17
Last 6 weeks1

Abstract
Get Access

research-article

Supporting dynamic parameter sweep in adaptive and user-steered workflow

Jonas Dias,
Eduardo Ogasawara,
Daniel de Oliveira,
Fabio Porto,
Alvaro L.G.A. Coutinho,
Marta Mattoso

pp 31–36https://doi.org/10.1145/2110497.2110502

Large-scale experiments in computational science are complex to manage. Due to its exploratory nature, several iterations evaluate a large space of parameter combinations. Scientists analyze partial results and dynamically interfere on the next steps of ...

- 15
- 281
Metrics
Total Citations15
Total Downloads281
Last 12 Months10
Last 6 weeks0

Abstract
Get Access

research-article

Optimizing bioinformatics workflows for data analysis using cloud management techniques

Vincent C. Emeakaroha,
Paweł P. Łabaj,
Michael Maurer,
Ivona Brandic,
David P. Kreil

pp 37–46https://doi.org/10.1145/2110497.2110503

With the rapid development in recent years of high-throughput technologies in the life sciences, huge amounts of data are being generated and stored in databases. Despite significant advances in computing capacity and performance, an analysis of these ...

- 11
- 298
Metrics
Total Citations11
Total Downloads298
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

research-article

A new approach for publishing workflows: abstractions, standards, and linked data

Daniel Garijo,
Yolanda Gil

pp 47–56https://doi.org/10.1145/2110497.2110504

In recent years, a variety of systems have been developed that export the workflows used to analyze data and make them part of published articles. We argue that the workflows that are published in current approaches are dependent on the specific codes ...

- 49
- 316
Metrics
Total Citations49
Total Downloads316
Last 12 Months17
Last 6 weeks2

Abstract
Get Access

research-article

Provenance opportunities for WS-VLAM: an exploration of an e-science and an e-business approach

Michael Gerhards,
Volker Sander,
Torsten Matzerath,
Adam Belloum,
Dmitry Vasunin,
Ammar Benabdelkader

pp 57–66https://doi.org/10.1145/2110497.2110505

Scientific applications are frequently modeled as a workflow that is executed under the control of a workflow management system. One crucial requirement during the execution is the validation of the generated results and the traceability of the ...

- 4
- 117
Metrics
Total Citations4
Total Downloads117
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

research-article

Object reuse and exchange for publishing and sharing workflows

Andrew Harrison,
Ian Harvey,
Andrew Jones,
David Rogers,
Ian Taylor

pp 67–76https://doi.org/10.1145/2110497.2110506

The workflow paradigm can provide the means to describe the complete functional pipeline for a scientific experiment and therefore expose the underlying scientific processes for enabling the reproducibility of results. However, current means for ...

- 4
- 133
Metrics
Total Citations4
Total Downloads133
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

research-article

Making data analysis expertise broadly accessible through workflows

Matheus Hauder,
Yolanda Gil,
Ricky Sethi,
Yan Liu,
Hyunjoon Jo

pp 77–86https://doi.org/10.1145/2110497.2110507

The demand for advanced skills in data analysis spans many areas of science, computing, and business analytics. This paper discusses how non-expert users reuse workflows created by experts and representing complex data mining processes for text ...

- 14
- 193
Metrics
Total Citations14
Total Downloads193
Last 12 Months4
Last 6 weeks0

Abstract
Get Access

research-article

Exploring workflow interoperability tools for neuroimaging data analysis

Vladimir Korkhov,
Dagmar Krefting,
Tamas Kukla,
Gabor Z. Terstyanszky,
Matthan Caan,
Silvia D. Olabarriaga

pp 87–96https://doi.org/10.1145/2110497.2110508

Neuroimaging is a field that benefits from distributed computing infrastructures (DCIs) to perform data processing and analysis, which is often achieved using grid workflow systems. Collaborative research in neuroimaging requires ways to facilitate ...

- 12
- 151
Metrics
Total Citations12
Total Downloads151
Last 12 Months4
Last 6 weeks1

Abstract
Get Access

research-article

IWIR: a language enabling portability across grid workflow systems

Kassian Plankensteiner,
Johan Montagnat,
Radu Prodan

pp 97–106https://doi.org/10.1145/2110497.2110509

Today there are many different scientific Grid workflow management systems using a wide array of custom workflow languages. Some of them are geared towards a data-based view, some are geared towards a control-flow based view and others try to be as ...

- 16
- 176
Metrics
Total Citations16
Total Downloads176
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

research-article

Failure prediction and localization in large scientific workflows

Taghrid Samak,
Dan Gunter,
Monte Goode,
Ewa Deelman,
Gaurang Mehta,
Fabio Silva,
Karan Vahi

pp 107–116https://doi.org/10.1145/2110497.2110510

Scientific workflows provide a portable representation for scientific applications' coordinated input, output, and execution management for highly parallel executions of interdependent computations, as well as support for sharing and validating the ...

- 16
- 207
Metrics
Total Citations16
Total Downloads207
Last 12 Months3
Last 6 weeks0

Abstract
Get Access

research-article

Characterizing quality of resilience in scientific workflows

Rafael Tolosana-Calasanz,
Marco Lackovic,
Omer F. Rana,
José Á. Bañares,
Domenico Talia

pp 117–126https://doi.org/10.1145/2110497.2110511

The enactment of scientific workflows involves the distribution of tasks to distributed resources that exist in different administrative domains. Such resources can range in granularity from a single machine to one or more clusters and file systems. The ...

- 5
- 197
Metrics
Total Citations5
Total Downloads197
Last 12 Months6
Last 6 weeks1

Abstract
Get Access

research-article

Achieving reproducibility by combining provenance with service and workflow versioning

Simon Woodman,
Hugo Hiden,
Paul Watson,
Paolo Missier

pp 127–136https://doi.org/10.1145/2110497.2110512

Capturing and exploiting provenance information is considered to be important across a range of scientific, medical, commercial and Web applications, including recent trends towards publishing provenance-rich, executable papers. This article shows how ...

- 20
- 248
Metrics
Total Citations20
Total Downloads248
Last 12 Months3
Last 6 weeks0

Abstract
Get Access

research-article

AME: an anyscale many-task computing engine

Zhao Zhang,
Daniel S. Katz,
Matei Ripeanu,
Michael Wilde,
Ian T. Foster

pp 137–146https://doi.org/10.1145/2110497.2110513

Many-Task Computing (MTC) is a new application category that encompasses increasingly popular applications in biology, economics, and statistics. The high inter-task parallelism and data-intensive processing capabilities of these applications pose new ...

- 11
- 96
Metrics
Total Citations11
Total Downloads96
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

Save to Binder

Create a New Binder

Name

Contributors

Ian J Taylor
Cardiff University
- Publication Years2002 - 2019
- Publication counts53
- Citation count636
- Available for Download18
- Downloads (cumulative)3,892
- Downloads (12 months)80
- Downloads (6 weeks)8
- Average Downloads per Article216
- Average Citation per Article12
View Full Profile
Johan Montagnat
CNRS National Centre for Scientific Research
- Publication Years2004 - 2013
- Publication counts5
- Citation count28
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article6
View Full Profile

Proceedings of the 6th workshop on Workflows in support of large-scale science
1. Applied computing
  1. Enterprise computing
2. Social and professional topics
  1. Professional topics
    1. Management of computing and information systems

Recommendations

WORKS '14: Proceedings of the 9th Workshop on Workflows in Support of Large-Scale Science
Read More
WGP '10: Proceedings of the 6th ACM SIGPLAN workshop on Generic programming
Read More
WORKS '07: Proceedings of the 2nd workshop on Workflows in support of large-scale science
Read More

Acceptance Rates

Overall Acceptance Rate30of54submissions,56%

Year	Submitted	Accepted	Rate
WORKS '17	25	8	32%
WORKS '15	13	9	69%
WORKS '13	16	13	81%
Overall	54	30	56%

Comments

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Proceeding Downloads

Save to Binder

Recommendations

WORKS '14: Proceedings of the 9th Workshop on Workflows in Support of Large-Scale Science

WGP '10: Proceedings of the 6th ACM SIGPLAN workshop on Generic programming

WORKS '07: Proceedings of the 2nd workshop on Workflows in support of large-scale science

Acceptance Rates