๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Cooperative Diagnosis and Routing in Fault-Tolerant Multiprocessor Systems

โœ Scribed by D.M. Blough; H.Y. Wang


Publisher
Elsevier Science
Year
1995
Tongue
English
Weight
671 KB
Volume
27
Category
Article
ISSN
0743-7315

No coin nor oath required. For personal study only.

โœฆ Synopsis


In this note, we consider the problem of fault-tolerant routing in multiprocessor systems when incomplete, or partial, diagnostic information is available. We first define a new type of partial diagnosis, known as (k)-reachability diagnosis. The overhead for (k)-reachability diagnosis increases with (k), which specifies the radius of diagnostic information maintained by each node. We then present a routing algorithm, known as Algorithm Partial Route, that makes use of (k)-reachability diagnostic information and allows a trade-off between the amount of diagnostic information and the quality of routing. Partial Route is the first algorithm capable of handling systems of arbitrary topology containing an arbitrary number of faults. The worst-case performance of the algorithm in an (n)-node system, is shown to be optimal when (k=n-1) and within a factor of 2 of optimal when (k=1). Simulation results on meshes and hypercubes are also presented that show, in the average case, Algorithm Partial Route is nearly optimal for relatively Small values of (k). 1995 Academic Press, inc.


๐Ÿ“œ SIMILAR VOLUMES


Optimal adaptive fault diagnosis for sim
โœ Kranakis, Evangelos; Pelc, Andrzej; Spatharis, Anthony ๐Ÿ“‚ Article ๐Ÿ“… 1999 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 122 KB

We studied adaptive system-level fault diagnosis for multiprocessor systems. Processors can test each other and future tests can be selected on the basis of previous test results. Fault-free testers give always correct test results, while faulty testers are completely unreliable. The aim of diagnosi

Cluster fault-tolerant routing in star g
โœ Gu, Qian-Ping; Peng, Shietung ๐Ÿ“‚ Article ๐Ÿ“… 2000 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 150 KB ๐Ÿ‘ 2 views

Fault-tolerant routing is a key issue in computer/ communication networks. We say a network (graph) can tolerate l faulty nodes for a routing problem if after removing at most l arbitrary faulty nodes from the graph the routing paths exist for the routing problem. However, the bound l is usually a w