✦ LIBER ✦

Architectures and message-passing algorithms for cluster computing: Design and performance

✍ Scribed by Edward K. Blum; Xin Wang; Patrick Leung

Publisher: Elsevier Science
Year: 2000
Tongue: English
Weight: 239 KB
Volume: 26
Category: Article
ISSN: 0167-8191
DOI: 10.1016/s0167-8191(99)00107-6

No coin nor oath required. For personal study only.

✦ Synopsis

This paper considers the architecture of clusters and related message-passing (MP) software algorithms and their eect on performance (speedup and eciency) of cluster computing (CC). We present new architectures for multi-segment Ethernet clusters and new MP algorithms which ®t these architectures. The multiple segments (e.g. commodity hubs) connect commodity processor nodes so as to allow MP to be highly parallelized by avoiding network contention and collisions in many applications where the all-gather and other collective operations are central. We analyze all-gather in some detail, and present new network topologies and new MP algorithms to minimize latency. The new topologies are based on a design, called two-by-four nets 2 Â 4 nets, by Compbionics. An integrated MP software system, called Reduced Overhead Cluster Communication (ROCC), which embodies the MP algorithms is also described. In brief, 2 Â 4 nets are networks of ``supernodes'', called 2 Â 4's, each having 4 processors on 2 segments and segments usually being Ethernet hubs. The supernodes are typically connected to form rings or tori of supernodes. We present actual test results and supporting analyses to demonstrate that 2 Â 4 nets with the ROCC MP software are faster than many existing clusters and generally less costly.

📜 SIMILAR VOLUMES

Clustering and reassignment-based mappin

Clustering and reassignment-based mapping strategy for message-passing architectures

✍ Miquel A. Senar; Ana Ripoll; Ana Cortés; Emilio Luque 📂 Article 📅 2003 🏛 Elsevier Science 🌐 English ⚖ 329 KB

A Multithreaded Message Passing Interfac

A Multithreaded Message Passing Interface (MPI) Architecture: Performance and Program Issues

✍ Boris V. Protopopov; Anthony Skjellum 📂 Article 📅 2001 🏛 Elsevier Science 🌐 English ⚖ 222 KB

This paper discusses a multithreaded software architecture for messagepassing interface (MPI) software specification. The architecture is thread-safe, allows for concurrent communication over several communications media (multifabric communication), efficiently utilizes available hardware concurrenc

Enhanced distributed computing message p

Enhanced distributed computing message passing strategies for FDTD

✍ C. J. Gillan; V. Fusco 📂 Article 📅 1999 🏛 John Wiley and Sons 🌐 English ⚖ 91 KB 👁 1 views

The "nite di!erence time domain method (FDTD) solves Maxwell's equations by employing numerically and storage intensive computation to map the electric and magnetic "elds within a "nite volume as an explicit function of time. Distributed computation, using heterogeneous networks of computers, is a c

A parallel algorithm for computing eigen

A parallel algorithm for computing eigenvalues of very large real symmetric matrices on message passing architectures

✍ Susanne Balle; Jane Cullum 📂 Article 📅 1999 🏛 Elsevier Science 🌐 English ⚖ 203 KB

Performance modeling for SPMD message-pa

Performance modeling for SPMD message-passing programs

✍ BREHM, JÜRGEN; WORLEY, PATRICK H.; MADHUKAR, MANISH 📂 Article 📅 1998 🏛 John Wiley and Sons 🌐 English ⚖ 279 KB 👁 2 views

Today's massively parallel machines are typically message-passing systems consisting of hundreds or thousands of processors. Implementing parallel applications efficiently in this environment is a challenging task, and poor parallel design decisions can be expensive to correct. Tools and techniques

Ring, torus and hypercube architectures/

Ring, torus and hypercube architectures/algorithms for parallel computing

✍ S. Lakshmivarahan; Sudarshan K. Dhall 📂 Article 📅 1999 🏛 Elsevier Science 🌐 English ⚖ 371 KB

This paper provides a survey of both architectural and algorithmic aspects of solving problems using parallel processors with ring, torus and hypercube interconnection.