𝔖 Bobbio Scriptorium
✦   LIBER   ✦

The performance of the BLAS and LAPACK on a shared memory scalar multiprocessor

✍ Scribed by Chris Phillips


Book ID
104304350
Publisher
Elsevier Science
Year
1991
Tongue
English
Weight
653 KB
Volume
17
Category
Article
ISSN
0167-8191

No coin nor oath required. For personal study only.

✦ Synopsis


LAPACK is a set of Fortran subroutines covering a wide area of linear algebra algorithms. It was developed with the intention of being portable across a range of parallel processing environments. We consider here the performance of the Basic Linear Algebra Subprograms (BLAS) on which this software is based, on a shared memory multiprocessor, with each processing element possessing scalar characteristics only. We then investigate the effect of the block size on the performance of certain of the LAPACK routines when executed in the same environment.


πŸ“œ SIMILAR VOLUMES


The Performance Implications of Locality
✍ Frank Bellosa; Martin Steckermeier πŸ“‚ Article πŸ“… 1996 πŸ› Elsevier Science 🌐 English βš– 326 KB

The latest processor generations-e.g., HPPA 8000, MIPS R10000 or Ultra SPARC-include a monitoring unit. A processor monitor can count events like read/write cache misses and processor stall cycles due to load and store operations. This information is usually only used for offline profiling. However,