๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Vector performance estimation for CRAY X-MP/Y-MP supercomputers

โœ Scribed by Allen R. Hainline; Steven R. Thompson; Lawrence L. Halcomb


Publisher
Springer US
Year
1992
Tongue
English
Weight
931 KB
Volume
6
Category
Article
ISSN
0920-8542

No coin nor oath required. For personal study only.

โœฆ Synopsis


Optimization of vector-intensive applications for the CRAY X-MP/Y-MP often requires arranging the operations to take full advantage of such architectural features as the memory system, independent memory ports, chaining, and independent functional units. Estimation of performance is not straightforward since many operations can occur concurrently. As a tool for making trades between vector algorithms, a method has been developed and used successfully at E-Systems Inc. to predict the execution time of a sequence of vector operations without resorting to actual code development. This method reduced our software development time, produced significantly more efficient code, and provided for a systematic approach to optimization. The performance estimation is generally accurate to within 10% and accounts for memory conflicts that result from fixed stride references.


๐Ÿ“œ SIMILAR VOLUMES


CRAY X-MP and Y-MP memory performance
โœ Ulrich Detert; Gerd Hofemann ๐Ÿ“‚ Article ๐Ÿ“… 1991 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 597 KB

This paper describes investigations on the memory performance of the shared memory systems Cray X-MP and Cray Y-MP. Single and multiple CPU performance will be considered and special emphasis will be put on performance differences that result from differences in the interconnect network between memo

Ultrahigh-performance FFTs for the CRAY-
โœ David A. Carlson ๐Ÿ“‚ Article ๐Ÿ“… 1992 ๐Ÿ› Springer US ๐ŸŒ English โš– 568 KB

In this paper a set of techniques for improving the performance of the fast Fourier transform (FFT) algorithm on modern vector-oriented supercomputers is presented. Single-processor FFT implementations based on these techniques are developed for the CRAY-2 and the CRAY Y-MP, and it is shown that the

Performance comparison of the CRAY-2 and
โœ Margaret L. Simmons; Harvey J. Wasserman ๐Ÿ“‚ Article ๐Ÿ“… 1990 ๐Ÿ› Springer US ๐ŸŒ English โš– 702 KB

The serial and parallel performance of one of the world's fastest general purpose computers, the CRAY-2, is analyzed using the standard Los Alamos Benchmark Set plus codes adapted for parallel processing. For comparison, architectural and performance data are also given for the CRAY X-MP/416. Factor