In this paper a set of techniques for improving the performance of the fast Fourier transform (FFT) algorithm on modern vector-oriented supercomputers is presented. Single-processor FFT implementations based on these techniques are developed for the CRAY-2 and the CRAY Y-MP, and it is shown that the
Implementation of the Level 2 and 3 BLAS on the CRAY Y-MP and the CRAY-2
โ Scribed by Qasim Sheikh; Phuong Vu; Chao Yang; Michael Merchant
- Publisher
- Springer US
- Year
- 1992
- Tongue
- English
- Weight
- 777 KB
- Volume
- 5
- Category
- Article
- ISSN
- 0920-8542
No coin nor oath required. For personal study only.
๐ SIMILAR VOLUMES
The serial and parallel performance of one of the world's fastest general purpose computers, the CRAY-2, is analyzed using the standard Los Alamos Benchmark Set plus codes adapted for parallel processing. For comparison, architectural and performance data are also given for the CRAY X-MP/416. Factor
The CRAY-2 is considered to be one of the most powerful supercomputers. Its state-of-the-art technology features a faster clock and more memory than any other supercomputer available today. In this report the single processor performance of the CRAY-2 is compared with the older, more mature CRAY X-M
Various scientific codes were benchmarked on three vector computers: the CRAY X-MP/48 and CRAY-2 supercomputers and the SCS-40/XM minisupercomputer. On the X-MP, two Fortran compilers were also compared. The benchmarks, which were initially all in Fortran, consisted of six research codes from Caltec