✦ LIBER ✦

OpenMP-oriented applications for distributed shared memory architectures

✍ Scribed by Ami Marowka; Zhenying Liu; Barbara Chapman

Publisher: John Wiley and Sons
Year: 2004
Tongue: English
Weight: 222 KB
Volume: 16
Category: Article
ISSN: 1532-0626
DOI: 10.1002/cpe.752

No coin nor oath required. For personal study only.

✦ Synopsis

Abstract

The rapid rise of OpenMP as the preferred parallel programming paradigm for small‐to‐medium scale parallelism could slow unless OpenMP can show capabilities for becoming the model‐of‐choice for large scale high‐performance parallel computing in the coming decade.

The main stumbling block for the adaptation of OpenMP to distributed shared memory (DSM) machines, which are based on architectures like cc‐NUMA, stems from the lack of capabilities for data placement among processors and threads for achieving data locality. The absence of such a mechanism causes remote memory accesses and inefficient cache memory use, both of which lead to poor performance.

This paper presents a simple software programming approach called copy‐inside–copy‐back (CC) that exploits the data privatization mechanism of OpenMP for data placement and replacement. This technique enables one to distribute data manually without taking away control and flexibility from the programmer and is thus an alternative to the automat and implicit approaches. Moreover, the CC approach improves on the OpenMP‐SPMD style of programming that makes the development process of an OpenMP application more structured and simpler.

The CC technique was tested and analyzed using the NAS Parallel Benchmarks on SGI Origin 2000 multiprocessor machines. This study shows that OpenMP improves performance of coarse‐grained parallelism, although a fast copy mechanism is essential. Copyright © 2004 John Wiley & Sons, Ltd.

📜 SIMILAR VOLUMES

Study of OpenMP applications on the Infi

Study of OpenMP applications on the InfiniBand-based software distributed shared-memory system

✍ Inho Park; Seon Wook Kim 📂 Article 📅 2005 🏛 Elsevier Science 🌐 English ⚖ 649 KB

A Scalable Distributed Shared Memory Arc

A Scalable Distributed Shared Memory Architecture

✍ S. Krishnamoorthy; A. Choudhary 📂 Article 📅 1994 🏛 Elsevier Science 🌐 English ⚖ 736 KB

Scalability of a multiprocessor architecture depends on its ability to manage interconnection network latency with increasing number of processors. Interconnection network latency can be minimized by reducing the distance traversed by a message in terms of number of nodes and wire lengths. Scalabili

Achieving performance under OpenMP on cc

Achieving performance under OpenMP on ccNUMA and software distributed shared memory systems

✍ B. Chapman; F. Bregier; A. Patil; A. Prabhakar 📂 Article 📅 2002 🏛 John Wiley and Sons 🌐 English ⚖ 248 KB

A hierarchical distributed-shared memory

A hierarchical distributed-shared memory parallel Branch&Bound application with PVM and OpenMP for multiprocessor clusters

✍ Rocco Aversa; Beniamino Di Martino; Nicola Mazzocca; Salvatore Venticinque 📂 Article 📅 2005 🏛 Elsevier Science 🌐 English ⚖ 350 KB

The Lanczos algorithm for the generalize

The Lanczos algorithm for the generalized symmetric eigenproblem on shared-memory architectures

✍ Mark T. Jones; Merrell L. Patrick 📂 Article 📅 1993 🏛 Elsevier Science 🌐 English ⚖ 934 KB

OOPS: an object-oriented particle simula

OOPS: an object-oriented particle simulation class library for distributed architectures

✍ John V.W. Reynders; David W. Forslund; Paul J. Hinker; Marydell Tholburn; David 📂 Article 📅 1995 🏛 Elsevier Science 🌐 English ⚖ 928 KB