✦ LIBER ✦

Parallel, multigrain iterative solvers for hiding network latencies on MPPs and networks of clusters

✍ Scribed by James R. McCombs; Andreas Stathopoulos

Book ID: 104304270
Publisher: Elsevier Science
Year: 2003
Tongue: English
Weight: 338 KB
Volume: 29
Category: Article
ISSN: 0167-8191
DOI: 10.1016/s0167-8191(03)00101-7

No coin nor oath required. For personal study only.

✦ Synopsis

Parallel iterative solvers are often the only means of solving large linear systems and eigenproblems. However, these solvers are usually implemented in a fine-grain manner and can incur significant performance penalties due to synchronization overheads on large MPPs. This problem is exacerbated in clusters of workstations (COWs) and SMPs that are interconnected via a hierarchy of networks. In this paper, we describe a novel scheme for hiding the synchronization overheads, and thus improving scalability, of block iterative solvers that employ a correction equation through an inner iterative method.

Block methods are not only robust in the presence of eigenvalue multiplicities and multiple right-hand sides, but provide better latency tolerance by performing more floating-point operations between synchronizations. We take a different approach to inducing latency tolerance by increasing the granularity at which the correction equation is solved for each block vector. This is accomplished by splitting the processors into smaller subgroups which are then used to solve the correction for each block vector concurrently. The rest of the algorithm is still performed in fine grain. We call this combination of fine and coarse-grain parallelism multigrain parallelism.

We implemented a multigrain, block Jacobi-Davidson algorithm for computing the extreme eigenvalues of a symmetric matrix. We obtained improvements of 45-50% over both the

📜 SIMILAR VOLUMES

[IEEE Comput. Soc. Press 1994 Internatio

[IEEE Comput. Soc. Press 1994 International Conference on Parallel and Distributed Systems - Hsinchu, Taiwan (19-21 Dec. 1994)] Proceedings of 1994 International Conference on Parallel and Distributed Systems - Exploiting communication latency hiding for parallel network computing: model and analysis

✍ Strumpen, V.; Casavant, T.L. 📂 Article 📅 1994 🏛 IEEE Comput. Soc. Press ⚖ 486 KB

[IEEE 2009 Sixth IFIP International Conf

[IEEE 2009 Sixth IFIP International Conference on Network and Parallel Computing (NPC) - Gold Coast, Australia (2009.10.19-2009.10.21)] 2009 Sixth IFIP International Conference on Network and Parallel Computing - Optimizing Live Migration of Virtual Machines in SMP Clusters for HPC Applications

✍ Atif, Muhammad; Strazdins, Peter 📂 Article 📅 2009 🏛 IEEE ⚖ 389 KB

[IEEE 2013 21st Euromicro International

[IEEE 2013 21st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP 2013) - Belfast (2013.2.27-2013.3.1)] 2013 21st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing - Distributed Iterative Solution of Numerical Simulation Problems on Infiniband and Ethernet Clusters via the P2PSAP Self-Adaptive Protocol

✍ Tembo, S. R.; The Tung Nguyen, ; El Baz, D. 📂 Article 📅 2013 🏛 IEEE ⚖ 504 KB