The performance of the BLAS and LAPACK on a shared memory scalar multiprocessor
β Scribed by Chris Phillips
- Book ID
- 104304350
- Publisher
- Elsevier Science
- Year
- 1991
- Tongue
- English
- Weight
- 653 KB
- Volume
- 17
- Category
- Article
- ISSN
- 0167-8191
No coin nor oath required. For personal study only.
β¦ Synopsis
LAPACK is a set of Fortran subroutines covering a wide area of linear algebra algorithms. It was developed with the intention of being portable across a range of parallel processing environments. We consider here the performance of the Basic Linear Algebra Subprograms (BLAS) on which this software is based, on a shared memory multiprocessor, with each processing element possessing scalar characteristics only. We then investigate the effect of the block size on the performance of certain of the LAPACK routines when executed in the same environment.
π SIMILAR VOLUMES
The latest processor generations-e.g., HPPA 8000, MIPS R10000 or Ultra SPARC-include a monitoring unit. A processor monitor can count events like read/write cache misses and processor stall cycles due to load and store operations. This information is usually only used for offline profiling. However,