๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

EXPLORER: Supporting run-time parallelization of DOACROSS loops on general networks of workstations

โœ Scribed by Yung-Lin Liu; Chung-Ta King


Publisher
Elsevier Science
Year
2000
Tongue
English
Weight
485 KB
Volume
26
Category
Article
ISSN
0167-8191

No coin nor oath required. For personal study only.

โœฆ Synopsis


Performing run-time parallelization on general networks of workstations (NOWs) without special hardware or system software supports is very dicult, especially for DOACROSS loops. Parallelizing DOACROSS loops at run time implies complex operations for dependence detection, data accesses, and computation scheduling, which is hard to do systematically and could induce a large amount of messages. With the high communication overhead on NOWs, there is hardly any performance gain for run-time parallelization of DOACROSS loops. This paper proposes a general methodology for run-time parallelization of DOACROSS loops on NOWs. The methodology is realized by a portable run-time environment called EXPLORER, which can be implemented entirely at the user level. EXPLORER hides the communication overhead as much as possible through multithreading ยฑ a facility supported in almost all workstations. Considerations involved in the design of EXPLORER and its general organization will be introduced. Preliminary performance of EXPLORER will be presented and discussed. Performance measured on an implementation of EXPLORER on eight DEC Alpha workstations connected through a 10-Mbps Ethernet shows that the computation to communication ratio plays an important role and the extent to which multithreading can hide the communication overhead is limited. Thus EXPLORER is more suitable for coarse-grain computations.


๐Ÿ“œ SIMILAR VOLUMES


Efficient parallel timing simulation of
โœ Edwin Naroska; Feipei Lai; Rung-Ji Shang; Uwe Schwiegelshohn ๐Ÿ“‚ Article ๐Ÿ“… 2001 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 353 KB

In this paper we address the parallel timing simulation of synchronous VLSI designs on a network of workstations (NOWs). We suggest combining cycle based and conventional timing simulation techniques to achieve fast timing simulation even on NOWs which are typically characterized by low bandwidth an

Analysis of Load Average and its Relatio
โœ Trevor E. Meyer; James A. Davis; Jennifer L. Davidson ๐Ÿ“‚ Article ๐Ÿ“… 1997 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 151 KB

Parallel processing systems using networks of workstations are being used to provide an alternative to expensive parallel processors. Scheduling of tasks on these networks is an important and practical problem that must be addressed. Although CPU load is an important parameter to many of the propose