Generating local addresses and communication sets is an important issue in distributed-memory implementations of data-parallel languages such as High Performance Fortran. We demonstrate a storage scheme for an array \(A\) affinely aligned to a template that is distributed across \(p\) processors wit
Efficient Address Generation for Affine Subscripts in Data-Parallel Programs
โ Scribed by Kuei-Ping Shih; Jang-Ping Sheu; Chih-Yung Chang
- Book ID
- 110259876
- Publisher
- Springer US
- Year
- 2000
- Tongue
- English
- Weight
- 324 KB
- Volume
- 17
- Category
- Article
- ISSN
- 0920-8542
No coin nor oath required. For personal study only.
๐ SIMILAR VOLUMES
This paper presents compilation techniques used to compress holes, which are caused by the nonunit alignment stride in a two-level data-processor mapping. Holes are the memory locations mapped by useless template cells. To fully utilize the memory space, memory holes should be removed. In a two-leve
Arrays are mapped to processors through a two-step process-alignment followed by distribution-in data-parallel languages such as High Performance Fortran. This process of mapping creates disjoint pieces of the array that are locally owned by each processor. An HPF compiler that generates code for ar