This paper addresses optimal mapping of parallel programs composed of a chain of data parallel tasks onto the processors of a parallel system. The input to the programs is a stream of data sets, each of which is processed in order by the chain of tasks. This computation structure, also referred to a
Analysis and Optimization of Software Pipeline Performance on MIMD Parallel Computers
β Scribed by Rob F. Van der Wijngaart; Sekhar R. Sarukkai; Pankaj Mehra
- Publisher
- Elsevier Science
- Year
- 1996
- Tongue
- English
- Weight
- 668 KB
- Volume
- 38
- Category
- Article
- ISSN
- 0743-7315
No coin nor oath required. For personal study only.
π SIMILAR VOLUMES
We propose a technique for constructing a fault-tolerant parallel software for general commercial massively parallel computers which are not provided with special fault-tolerant functions. This technique is a hybrid of the primary/backup approach and state machine approach, and can implement paralle
An analysis is presented of the primary factors influencing the performance of a parallel implementation of the UCLA atmospheric general circulation model (AGCM) on distributedmemory, massively parallel computer systems. Several modifications to the original parallel AGCM code aimed at improving its
## Abstract Computational protein design will continue to improve as new implementations and parameterizations are explored. An automated protein design procedure is implemented and applied to the full redesign of 16 globular proteins. We combine established but simple ingredients: a molecular mech
A two-dimensional \((h, p)\) finite element scheme for distributed parallel computation is developed. The approach is based on an element-by-element domain decomposition and is implemented on the nCUBE2 system. Example problems are used to demonstrate performance of the algorithm for a range of \((h
We show how a reconfigurable network of transputers can be used to serve as a fast neural computing machine. Software implementation and hardware configuration are presented. The individual updating of the neurons in the neural network is performed following the parallel synchronous or maximum field