may be performed at an intermediate node as soon as the header of the message arrives with the destination information. Thus, with cut-through, only an on-line flit buffer is required to examine the message header at each intermediate node. If cut-throughs are established through all intermediate no
A Communication Architecture for a Massively Parallel Message-Passing Multicomputer
โ Scribed by C. Germain; J.L. Bechennec; D. Etiemble; J.P. Sansonnet
- Publisher
- Elsevier Science
- Year
- 1993
- Tongue
- English
- Weight
- 916 KB
- Volume
- 19
- Category
- Article
- ISSN
- 0743-7315
No coin nor oath required. For personal study only.
โฆ Synopsis
Communication networks stress the distinction between classical and massively parallel architectures. The network is the key for the whole architecture efficiency, while severe technological constraints limit the possible choices. This paper presents forced routing, which is the routing strategy of the MEGA massively parallel architecture. Forced routing is a trade-off between deterministic and randomized routing. In the absence of contention, messages follow one shortest path. As the contention increases, messages are randomly spread in the network. Moreover, the algorithm is quite simple and it only needs minor buffering capacity at each node; thus it is well adapted to hardwired implementation. In this paper, we test the performance of the forced and greedy strategies on a variety of communication patterns representing synchronous and asynchronous algorithms. Finally, we sketch a description of the circuit implementing the forced routing, in order to prove the technical feasibility of this appealing algorithm upon the constraint of a mono-chip node. O 1993 Academic Press. Inc.
๐ SIMILAR VOLUMES
When scaling message-passing applications to thousands of processors, their performance is often affected by wait states that occur when processes fail to reach synchronization points simultaneously. As a first step in reducing the performance impact, we have shown in our earlier work that wait stat
FLASH is a publicly available high performance application code which has evolved into a modular, extensible software system from a collection of unconnected legacy codes. FLASH has been successful because its capabilities have been driven by the needs of scientific applications, without compromisin