✦ LIBER ✦

Compiling High Performance Fortran for distributed-memory architectures

✍ Scribed by Siegfried Benkner; Hans Zima

Publisher: Elsevier Science
Year: 1999
Tongue: English
Weight: 509 KB
Volume: 25
Category: Article
ISSN: 0167-8191
DOI: 10.1016/s0167-8191(99)00074-5

No coin nor oath required. For personal study only.

✦ Synopsis

High Performance Fortran (HPF) is a data-parallel language that provides a high-level interface for programming scienti®c applications, while delegating to the compiler the task of generating explicitly parallel message-passing programs. This paper provides an overview of HPF compilation and runtime technology for distributed-memory architectures, and deals with a number of topics in some detail. In particular, we discuss distribution and alignment processing, the basic compilation scheme and methods for the optimization of regular computations. A separate section is devoted to the transformation and optimization of independent loops with irregular data accesses. The paper concludes with a discussion of research issues and outlines potential future development paths of the language.

📜 SIMILAR VOLUMES

Compiling Fortran 90D/HPF for Distribute

Compiling Fortran 90D/HPF for Distributed Memory MIMD Computers

✍ Z. Bozkus; A. Choudhary; G. Fox; T. Haupt; S. Ranka; M.Y. Wu 📂 Article 📅 1994 🏛 Elsevier Science 🌐 English ⚖ 1013 KB

Algorithm 1 (Compiling Align directives) Input: Fortran 90D/HPF syntax tree with some alignment functions to template Output: Fortran 90D/HPF syntax tree with identical alignment functions to template Method: For each aligned array, and for each dimension of that array, carry out the following ste

Compiling programs for distributed-memor

Compiling programs for distributed-memory multiprocessors

✍ David Callahan; Ken Kennedy 📂 Article 📅 1988 🏛 Springer US 🌐 English ⚖ 962 KB

We describe a new approach to programming distributed-memory computers. Rather than having each node in the system explicitly programmed, we derive an efficient message-passing program from a sequential shared-memory program annotated with directions on how elements of shared arrays are distributed

Compiling Array Expressions for Efficien

Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines

✍ S.K.S. Gupta; S.D. Kaushik; C.-H. Huang; P. Sadayappan 📂 Article 📅 1996 🏛 Elsevier Science 🌐 English ⚖ 502 KB

data-parallelism in these languages. Array expressions involve array sections which consist of array elements from a lower index to an upper index at a fixed stride. In order to generate high-performance target code, compilers for distributed-memory machines should produce efficient code for array s

Performance Analysis of the Parallel Kar

Performance Analysis of the Parallel Karatsuba Multiplication Algorithm for Distributed Memory Architectures

✍ GIOVANNI CESARI; ROMAN MAEDER 📂 Article 📅 1996 🏛 Elsevier Science 🌐 English ⚖ 447 KB

We present three parallel implementations of the Karatsuba algorithm for long integer multiplication on a distributed memory architecture and discuss the experimental results obtained on a Paragon computer. The first two implementations have both time complexity O(n) on n log 2 3 processors, but pre

A distributed-memory, high-performance w

A distributed-memory, high-performance workstation

✍ R Bisiani; O Martin 📂 Article 📅 1992 🏛 Elsevier Science 🌐 English ⚖ 638 KB

OpenMP-oriented applications for distrib

OpenMP-oriented applications for distributed shared memory architectures

✍ Ami Marowka; Zhenying Liu; Barbara Chapman 📂 Article 📅 2004 🏛 John Wiley and Sons 🌐 English ⚖ 222 KB

## Abstract The rapid rise of OpenMP as the preferred parallel programming paradigm for small‐to‐medium scale parallelism could slow unless OpenMP can show capabilities for becoming the model‐of‐choice for large scale high‐performance parallel computing in the coming decade. The main stumbling blo