𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Reparallelization techniques for migrating OpenMP codes in computational grids

✍ Scribed by Michael Klemm; Matthias Bezold; Stefan Gabriel; Ronald Veldema; Michael Philippsen


Book ID
102809677
Publisher
John Wiley and Sons
Year
2009
Tongue
English
Weight
754 KB
Volume
21
Category
Article
ISSN
1532-0626

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

Typical computational grid users target only a single cluster and have to estimate the runtime of their jobs. Job schedulers prefer short‐running jobs to maintain a high system utilization. If the user underestimates the runtime, premature termination causes computation loss; overestimation is penalized by long queue times. As a solution, we present an automatic reparallelization and migration of OpenMP applications. A reparallelization is dynamically computed for an OpenMP work distribution when the number of CPUs changes. The application can be migrated between clusters when an allocated time slice is exceeded. Migration is based on a coordinated, heterogeneous checkpointing algorithm. Both reparallelization and migration enable the user to freely use computing time at more than a single point of the grid. Our demo applications successfully adapt to the changed CPU setting and smoothly migrate between, for example, clusters in Erlangen, Germany, and Amsterdam, the Netherlands, that use different kinds and numbers of processors. Benchmarks show that reparallelization and migration impose average overheads of about 4 and 2%, respectively. Copyright Β© 2008 John Wiley & Sons, Ltd.


πŸ“œ SIMILAR VOLUMES


Parallelization of the Scale-Changing Te
✍ F. Khalil; C. J. Barrios-Hernandez; A. Rashid; H. Aubert; Y. Denneulin; F. Cocce πŸ“‚ Article πŸ“… 2010 πŸ› John Wiley and Sons 🌐 English βš– 564 KB

A parallel computing approach to run fast and full-wave electromagnetic simulation of complex structures in Grid Computing environment is presented. In this study, we show how Grid Computing improves speed and/or reliability over that provided by a single computer, while typically being much more co