𝔖 Bobbio Scriptorium
✦   LIBER   ✦

A fast and accurate method for determining a lower bound on execution time

✍ Scribed by G. Fursin; M. F. P. O'Boyle; O. Temam; G. Watts


Publisher
John Wiley and Sons
Year
2004
Tongue
English
Weight
212 KB
Volume
16
Category
Article
ISSN
1532-0626

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

In performance critical applications, memory latency is frequently the dominant overhead. In many cases, automatic compiler‐based optimizations to improve memory performance are limited and programmers frequently resort to manual optimization techniques. However, this process is tedious and time‐consuming. Furthermore, as the potential benefit from optimization is unknown there is no way to judge the amount of effort worth expending, nor when the process can stop, i.e. when optimal memory performance has been achieved or sufficiently approached. Architecture simulators can provide such information but designing an accurate model of an existing architecture is difficult and simulation times are excessively long. In this article, we propose and implement a technique that is both fast and reasonably accurate for estimating a lower bound on execution time for scientific applications. This technique has been tested on a wide range of programs from the SPEC benchmark suite and two commercial applications, where it has been used to guide a manual optimization process and iterative compilation. We compare our technique with that of a simulator with an ideal memory behaviour and demonstrate that our technique provides comparable information on memory performance and yet is over two orders of magnitude faster. We further show that our technique is considerably more accurate than hardware counters. Copyright Β© 2004 John Wiley & Sons, Ltd.


πŸ“œ SIMILAR VOLUMES


Lower bounds and algorithms for flowtime
✍ Simon Dunstall; Andrew Wirth; Kenneth Baker πŸ“‚ Article πŸ“… 2000 πŸ› Springer US 🌐 English βš– 165 KB πŸ‘ 2 views

We consider the scheduling of N jobs divided into G families for processing on a single machine. No set-up is necessary between jobs belonging to the same family. A set-up must be scheduled when switching from the processing of family i jobs to those of another family j, i = j, the duration of this

ChemInform Abstract: A New Method for Fa
✍ Ming Zhang; Lydia E. Kavraki πŸ“‚ Article πŸ“… 2010 πŸ› John Wiley and Sons βš– 24 KB πŸ‘ 1 views

## Abstract ChemInform is a weekly Abstracting Service, delivering concise information at a glance that was extracted from about 100 leading journals. To access a ChemInform Abstract of an article which was published elsewhere, please select a β€œFull Text” option. The original article is trackable v

A fast algorithm for a k-NN classifier b
✍ Shin'ichiro Omachi; Hirotomo Aso πŸ“‚ Article πŸ“… 2000 πŸ› John Wiley and Sons 🌐 English βš– 173 KB πŸ‘ 3 views

The nearest neighbor rule or k-nearest neighbor rule is a technique of nonparametric pattern recognition. Its algorithm is simple and the error is smaller than twice the Bayes error if there are enough training samples. However, it requires an enormous amount of computation, proportional to the numb