๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Performance Coupling: Case Studies for Improving the Performance of Scientific Applications

โœ Scribed by Jonathan Geisler; Valerie Taylor


Publisher
Elsevier Science
Year
2002
Tongue
English
Weight
237 KB
Volume
62
Category
Article
ISSN
0743-7315

No coin nor oath required. For personal study only.

โœฆ Synopsis


Traditional performance optimization techniques have focused on finding the kernel in an application that is the most time consuming and attempting to optimize it. In this paper, we focus on an optimization technique with a more global perspective of the application. In particular, we present a methodology for measuring the interaction, or coupling, between kernels within an application and describe how the measurements can be used to improve the performance of scientific applications. We discuss four case studies to demonstrate the use of this methodology. The first study involves the Conjugate Gradient Benchmark from the NAS Parallel Benchmarks. The coupling measurement aided in the development of a new hybrid data structure and corresponding algorithm that slightly increased the performance of the program. The second study involves the Block Tridiagonal NAS Parallel Benchmark, for which the coupling parameter aided in revising the program to reduce the level-two cache misses by 14%. Next, we introduce improvements to an application in the SpecJVM benchmark suite resulting in 41% reduction in level-one cache misses. Lastly, we present results from the Seis application from the SPEChpc Benchmarks to illustrate the coupling parameters that may result from large-scale scientific applications.


๐Ÿ“œ SIMILAR VOLUMES


Object Mobility for Performance Improvem
โœ Dror Garti; Shem-Tov Cohen; Amnon Barak; Arie Keren; Ricardo Szmit ๐Ÿ“‚ Article ๐Ÿ“… 2000 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 123 KB

This paper describes a framework that employs Java and mobile objects to provide an efficient, parallel-computing environment in a heterogeneous, scalable computing cluster. The framework consists of migratable active objects performing computational tasks and static objects for managing them. The p