𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Fault-tolerant execution of large parameter sweep applications across multiple VOs with storage constraints

✍ Scribed by Shahaan Ayyub; David Abramson; Colin Enticott; Slavisa Garic; Jefferson Tan


Publisher
John Wiley and Sons
Year
2009
Tongue
English
Weight
379 KB
Volume
21
Category
Article
ISSN
1532-0626

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

Applications that span multiple virtual organizations (VOs) are of great interest to the e‐science community. However, our recent attempts to execute large‐scale parameter sweep applications (PSAs) for real‐world climate studies with the Nimrod/G tool have exposed problems in the areas of fault tolerance, data storage and trust management. In response, we have implemented a task‐splitting approach that facilitates breaking up large PSAs into a sequence of dependent subtasks, improving fault tolerance; provides a garbage collection technique that deletes unnecessary data; and employs a trust delegation technique that facilitates flexible third party data transfers across different VOs. Copyright © 2008 John Wiley & Sons, Ltd.