๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Parallel Implementation of BLAS: General Techniques for Level 3 BLAS

โœ Scribed by Chtchelkanova A., Gunnels J., Morrow G.


Book ID
127400899
Year
1995
Tongue
English
Weight
80 KB
Category
Library

No coin nor oath required. For personal study only.

โœฆ Synopsis


In this paper, we present straight forward techniques for a highly efficient, scalable implementation of common matrix-matrix operations generally known as the Level S Basic Linear Algebra Subprograms (BLAS). This work builds on our recent discovery of a parallel matrix-matrix multiplication implementation, which has yielded superior performance, and requires little work space. We show that the techniques used for the matrix-matrix multiplication naturally extend to all important level 3 BLAS and thus this approach becomes an enabling technology for efficient parallel implementation of these routines and libraries that use BLAS. Representative performance results on the Intel Paragon system, are given.


๐Ÿ“œ SIMILAR VOLUMES


Parallel implementation of BLAS: general
โœ Chtchelkanova, Almadena; Gunnels, John; Morrow, Greg; Overfelt, James; van de Ge ๐Ÿ“‚ Article ๐Ÿ“… 1997 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 211 KB

In this paper, we present straightforward techniques for a highly efficient, scalable implementation of common matrix-matrix operations generally known as the Level 3 Basic Linear Algebra Subprograms (BLAS). This work builds on our recent discovery of a parallel matrix-matrix multiplication implemen

Parallelizing a Level 3 BLAS Library for
โœ Kuo-Chan Huang; Feng-Jian Wang; Pei-Chi Wu ๐Ÿ“‚ Article ๐Ÿ“… 1996 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 222 KB

LAN-connected workstations are a heterogeneous environment, where each workstation provides time-varying computing power, and thus dynamic load balancing mechanisms are necessary for parallel applications to run efficiently. Parallel basic linear algebra subprograms (BLAS) have recently shown promis