Analysis for performance and reliability of fault-tolerant parallel software
โ Scribed by Eiji Sugino; Haruo Yokota
- Publisher
- John Wiley and Sons
- Year
- 2000
- Tongue
- English
- Weight
- 193 KB
- Volume
- 31
- Category
- Article
- ISSN
- 0882-1666
No coin nor oath required. For personal study only.
โฆ Synopsis
We propose a technique for constructing a fault-tolerant parallel software for general commercial massively parallel computers which are not provided with special fault-tolerant functions. This technique is a hybrid of the primary/backup approach and state machine approach, and can implement parallel programs in fault tolerance by automatically converting user programs. In general, when a parallel system is to be used as a fault-tolerant computer, since parallel entities are used as redundant elements for obtaining fault tolerance, the maximum performance will decrease concurrently with the improvement of reliability. Moreover, it is necessary to consider the performance drop for processing which is supplementary to the original program in fault-tolerant implementation by software. Therefore, a gain by fault-tolerant implementation cannot be shown if it is merely demonstrated that an improvement of the reliability is obtained. In this paper, we define an evaluation index which takes into account reliability improvement and performance drop; based on this index, we study the execution environment which can tolerate practical use for fault-tolerant parallel software.
๐ SIMILAR VOLUMES
There is a continuing need for increased throughput in the evaluation of new drug entities in terms of their pharmacokinetic (PK) parameters. This report describes an alternative procedure for increasing the throughput of plasma samples assayed in one overnight analysis: the use of parallel high per